Kash turns your documents into self-contained AI agents with embedded GraphRAG — packaged into ultra-lightweight Docker containers.
No Python. No vector DB. No infrastructure. Just Go.
Think of Kash like a static site generator — but for AI brains. Data ingestion happens at build time. Runtime only serves queries.
kash init scaffolds your agent project with a Dockerfile, config, and data directory.
Drop PDFs, Markdown, and text files into data/. That's your knowledge base.
kash build chunks text, generates embeddings, and extracts knowledge graph triples.
docker build packages everything into a tiny container sized to your data. Push and run anywhere.
Traditional RAG stacks are bloated. Kash collapses the entire pipeline into a single binary.
docker run and doneA complete agent framework with zero external dependencies at runtime.
Every query runs through both vector similarity search and knowledge graph traversal, merged and optionally reranked.
Ship agents as lightweight Docker images. Image size scales with your data — the Go runtime itself is tiny. Multi-arch support included.
REST API (OpenAI-compatible), MCP Server (for IDEs), and A2A Protocol (for multi-agent orchestration) — all on one port.
Works with any OpenAI-compatible API. Use OpenAI, Anthropic via proxy, Ollama, LiteLLM — your choice.
Automatic extraction of entity-relationship triples via LLM during build. Deep reasoning, not just keyword matching.
Optional Bearer token auth across all interfaces. Set one env var to secure everything. Open access for local dev.
One port, three protocols. Plug your agent into any ecosystem.
# Drop-in OpenAI replacement — works with LibreChat, Open WebUI, etc. curl http://localhost:8000/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{ "model": "gpt-4o", "messages": [{"role": "user", "content": "Explain the key concepts"}] }'
// Add to Cursor / Windsurf / Claude Desktop settings { "mcpServers": { "my-agent": { "url": "http://localhost:8000/mcp" } } }
# JSON-RPC for multi-agent frameworks (AutoGen, CrewAI) curl http://localhost:8000/rpc/agent \ -H "Content-Type: application/json" \ -d '{ "jsonrpc": "2.0", "id": 1, "method": "agent.query", "params": {"query": "What are the key architectural decisions?"} }'
From personal study tools to enterprise knowledge agents.
Feed company docs, runbooks, or research papers. Every dev gets a domain expert on tap.
Compile textbooks into a Socratic tutor that quizzes you and explains concepts accurately.
Turn API docs, changelogs, and FAQs into a support agent that plugs into any chat UI.
Spin up specialized agents (legal, finance, eng) and wire them together via A2A protocol.
No external database servers. No Python runtime. Pure Go from CLI to serving.
Get started in under 5 minutes. Ship an AI agent today.