How I built local-first memory for Claude Code, Cursor, and Codex – 94.5% LoCoMo recall@10, 70ms p50
TL;DR. PMB (Personal Memory Brain) is an open-source MCP server that gives AI coding agents persistent, long-term memory. Everything runs on your machine – SQLite + LanceDB, zero cloud, zero API keys. On the LoCoMo benchmark it hits 94.5% recall@10 at 70ms p50, matching or beating cloud-based memory services. This post is mostly the techniques that got it there – predicate-aware reranking, multilingual verb expansion, no-LLM atomic fact extraction, RRF query splitting, and a durable async embed queue […]