Persistent, searchable, three-tier memory for AI coding agents. Store decisions, learn from patterns, and share knowledge across sessions — automatically.
Your AI agents are brilliant — but they have amnesia. Every conversation, every decision, every pattern learned, gone the moment the session ends.
Agents forget decisions, architecture choices, and rationale. You re-explain the same context every single session.
Without memory, agents make the same mistakes and propose solutions that were already rejected in prior sessions.
When team members leave or switch projects, institutional knowledge disappears. Onboarding starts from scratch.
Information naturally flows from ephemeral conversations to permanent organizational knowledge — just like human memory.
Ephemeral context within active conversations. Fast, disposable, high-volume.
Decisions, summaries, and artifacts scoped to a project. The working memory of your team.
Validated patterns and best practices promoted to org-wide knowledge. Permanent, multi-tenant.
Find relevant memories by meaning, not just keywords. 1536-dimension embeddings powered by Azure OpenAI with hybrid search combining 70% vector + 30% keyword scoring.
Bidirectional links between memories — supersedes, implements, rationale, related-to. Graph traversal up to 3 hops with cycle detection and auto-surfacing.
Automatic capture of every file read, edit, and command via Claude Code hooks. Local-first JSONL logs with optional git branch-per-session isolation.
Transparent API proxy captures every prompt and response between Claude Code and Anthropic. Zero latency overhead, automatic secret redaction, two-file JSONL logging.
Architects get high proactivity and frequent storage. Developers get minimal interruption. Five integration levels from menu-only to full automation.
Three-state fallback: full MCP memory → local-only session tracking → session-only context. Never breaks the agent. Works offline, syncs when available.
Every interaction flows through a capture pipeline — from real-time conversation proxying to permanent organizational knowledge.
Add the MCP server to your project. Deploy Cosmos DB with the included ARM template or connect to an existing instance.
npm install @acme-corp/agent-memory-server
Point Claude Code at the MCP server. Add your Azure Cosmos DB and OpenAI credentials. Optionally start the conversation proxy.
// .claude/settings.local.json
"mcpServers": { "bmad-memory": { ... } }
Your agents now have memory. Decisions persist. Patterns compound. Knowledge promotes from project to organization — automatically.
bmad_hybrid_search("auth pattern")
→ 3 relevant memories found