feat: kubernetes operator for agent lifecycle management#2139
Open
alexm-redhat wants to merge 1 commit into
Open
feat: kubernetes operator for agent lifecycle management#2139alexm-redhat wants to merge 1 commit into
alexm-redhat wants to merge 1 commit into
Conversation
Add a kube-rs based Kubernetes operator (openshell-operator crate) that provides CRD-driven declarative sandbox lifecycle management. Components: - AgentSandbox CRD with spec for image, resources, policy, and provider refs - Reconciler loop with exponential backoff and status condition reporting - Admission webhooks (validating + mutating) for CRD validation - Manifest builders for sandbox Pod, Service, and RBAC resources - Label conventions for sandbox discovery and ownership tracking - SandboxRuntimeManager gRPC service for operator-gateway communication - Gateway integration via multiplex listener and config flags New crate: crates/openshell-operator (16 files) New proto: proto/sandbox_runtime_manager.proto Modified: openshell-core (config, proto), openshell-server (cli, grpc, multiplex) Tests: 2,239 passed, 0 failed Closes NVIDIA#1719
Author
|
I have read the DCO document and I hereby sign the DCO. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Adds a kube-rs based Kubernetes operator (
openshell-operatorcrate) that provides CRD-driven declarative sandbox lifecycle management, porting the operator pattern fromKagenti.
New crate:
openshell-operator(16 files)AgentSandboxCRD with spec for image, resources, policy, and provider refsopenshell.nvidia.com/managed-by,openshell.nvidia.com/agent-name) for discovery and ownershipGateway integration:
SandboxRuntimeManagergRPC service (proto/sandbox_runtime_manager.proto)openshell-server--enable-operatorflag and config file support28 files changed, +2,832/-9 lines across
openshell-operator,openshell-core, andopenshell-server.Test plan
cargo build -p openshell-operatorcompilescargo test -p openshell-operator— 69 passed, 0 failedcargo test -p openshell-core— 297 passed, 0 failedcargo test --workspace(excl. z3-dependent crates) — 2,239 passed, 0 failedAddresses #1719