Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

added gsoc idea for KubeStellar #1385

Merged
merged 2 commits into from
Feb 11, 2025
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
31 changes: 31 additions & 0 deletions programs/summerofcode/2025.md
Original file line number Diff line number Diff line change
Expand Up @@ -177,6 +177,37 @@ Note that the initial idea is to solve this with **3-way Git merges**. However,

- Upstream Issue: [WIP - Proposal: Automating Operator Maintenance: Driving Better Results with Less Overhead](https://github.com/kubernetes-sigs/kubebuilder/pull/4302)

#### KubeStellar

##### AI/ML Model Monitoring and Drift Detection in Disconnected Clusters using KubeStellar

- Description: AI/ML models deployed in disconnected environments, such as edge clusters and air-gapped systems, often suffer from model drift—a degradation in model performance due to changes in input data distributions. Without continuous monitoring, models may become inaccurate, leading to unreliable predictions.

This project aims to integrate model monitoring and drift detection into KubeStellar, enabling Kubernetes-based AI workloads to detect data drift locally and sync monitoring metrics when connectivity is restored. The solution will use lightweight monitoring agents deployed alongside ML models to track data distribution changes and alert mechanisms to trigger model retraining when necessary.

The system will also include policies for efficient metric storage and synchronization between disconnected and central clusters while minimizing bandwidth usage.

- Expected Outcome:
- A KubeStellar-compatible AI/ML monitoring component that tracks model drift in disconnected clusters.
- Efficient local storage and synchronization of monitoring metrics when connectivity is restored.
- Policies for adaptive model retraining triggers based on drift detection signals.
- Integration with existing ML tools (e.g., Prometheus, TensorFlow Extended, OpenTelemetry).
- Open-source documentation and example workflows demonstrating how KubeStellar manages AI model monitoring across disconnected clusters.
- Recommended Skills:
- Kubernetes and container orchestration
- AI/ML model deployment & monitoring
- Python, Go (for Kubernetes integrations)
- Experience with logging/monitoring tools (Prometheus, OpenTelemetry)
- Familiarity with KubeStellar (preferred but not required)
- Expected Project Size: Large (~350 hours)
This project requires implementing multiple components: local monitoring, drift detection, synchronization, and integration with KubeStellar. It also involves research into efficient data synchronization strategies for low-bandwidth environments.

- Mentor(s):
Andy Anderson (@clubanderson, andy@clubanderson.com) - Primary Mentor
[Second Mentor's Name] (@second-mentor-github, second-mentor-email)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can merge this in without the second mentor for now, but for GSoC it's important for us to have one.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

understood

Upstream Issue (URL):
[GitHub Issue Link] (https://github.com/kubestellar/kubestellar/issues/2791)

#### Kubewarden

##### Allow policies to be written using JavaScript
Expand Down