Roadmap¶

sikifanso is an AI agent infrastructure platform. Development follows a phased approach, building from cluster foundations toward AI-powered operations.

Phase 0: Foundation & Identity -- shipped¶

Replaced the original homelab catalog with AI agent infrastructure tools. 17 curated tools across 7 categories: gateway, observability, guardrails, RAG, runtime, models, and storage. Retained the existing k3d bootstrapping, GitOps catalog system, doctor checks, snapshots, TUI browser, dashboard, and dual-track app model.

Phase 1: Agent Cluster Profiles -- shipped¶

sikifanso cluster create --profile <name> enables a pre-defined set of catalog apps. Profiles are composable: --profile agent-dev,rag enables the union of both. Available profiles: agent-minimal, agent-full, agent-dev, agent-safe, rag.

See Profiles for details.

Phase 2: Agent Isolation & Network Policies -- shipped¶

sikifanso agent create <name> scaffolds a namespace with resource quotas, network policies, and service account. Cilium NetworkPolicies enforce default-deny egress, allowlisted data layer access, no cross-agent traffic, and no Kubernetes API access.

See Agent Sandboxes for details.

Phase 3: MCP Server Interface -- shipped¶

sikifanso mcp serve exposes cluster operations as MCP tools (stdio transport). 25 tools across cluster management, catalog, agents, ArgoCD, Kubernetes, and health checks. Any MCP-compatible client can manage the cluster.

See MCP Server for details.

Phase 4: AG-UI Protocol Integration¶

Stream agent reasoning and actions to terminal/web UI in real-time.

AG-UI SSE endpoint in the dashboard server
Event flow: RUN_STARTED -> STEP_STARTED/FINISHED -> TOOL_CALL_* -> TEXT_MESSAGE_* -> STATE_SNAPSHOT/DELTA
Terminal renderer for structured output
Web dashboard live activity feed
sikifanso agent watch <name> for live agent activity streaming

Phase 5: AI-Powered Operations¶

Natural language cluster operations powered by MCP tools and AG-UI streaming.

sikifanso ai "<prompt>" -- natural language cluster operations via Claude API with tool use
AI uses the same MCP tools that external agents use (Phase 3)
Reasoning streamed via AG-UI (Phase 4)
Intelligent troubleshooting: sikifanso ai "why is langfuse unhealthy"
AI-driven catalog: sikifanso ai "I need to run agents with guardrails and cost tracking"

Phase 6: Advanced Features¶

Production hardening, community, ecosystem.

Shareable agent cluster profiles (OCI artifacts / git refs)
Community catalog contributions (curated AI tool definitions)
Multi-cluster agent federation
GPU scheduling support (Ollama/vLLM with NVIDIA GPUs)
Cloud cluster support (bootstrap EKS/GKE with the same catalog)
Agent marketplace: pre-built agent templates that run on sikifanso clusters

Phase dependencies¶

Phase 0 (Foundation) -- DONE
  +---> Phase 1 (Profiles) -- DONE
  +---> Phase 2 (Isolation) -- DONE
           +---> Phase 3 (MCP Server) -- DONE
                    +---> Phase 4 (AG-UI)
                             +---> Phase 5 (AI Ops)
                                      +---> Phase 6 (Advanced)

Other ideas¶

These are directions worth exploring, not committed to any phase:

Remote GitOps repos -- support remote git repos as the gitops source for collaboration and CI/CD. See Remote GitOps.
Additional CNI options -- Calico, Flannel, or bring-your-own via a --cni flag.
Cluster templates -- shareable templates defining node count, resource limits, pre-installed apps, and network policies.
Multi-node topologies -- HA control plane, agent nodes for workload isolation.
Terraform / OpenTofu integration -- a provider or module that wraps sikifanso.