A libre/open-source cognitive runtime that gives Claude Code, Codex, Claude Desktop, and other MCP clients the same local brain. Persistent memory, multimodal memory refs, pre-compaction auto-flush, a public claim wiki, readable memory export, inspectable user-state adaptation, enforceable protocol discipline, durable workflow execution, configurable automation, and recovery-aware self-healing on top of 150+ MCP tools.
$ npx nexo-brain
NEXO is not a hosted memory SaaS with a thin MCP layer. It is AGPL software you can inspect, fork, run locally, adapt to your stack, and contribute back to. That matters for trust, for longevity, and for teams that do not want their agent memory trapped behind a vendor wall.
The runtime, doctor checks, workflows, prompts, and public Evolution loop all live in the open. You can audit what the system is doing before you trust it with your work.
Plugins, scripts, prompts, skills, and policies are all meant to be adapted. If your workflow is different, NEXO is supposed to bend with it instead of hiding the real behavior.
Your memories, workflows, vectors, and operational history stay on your machine. If you want to stop using NEXO, your data is still local and the system remains understandable.
If someone asks "what does this actually feel like?", these are the shortest honest answers: install, shared brain across clients, decision-to-outcome loops, runtime discipline beyond memory, and the public Evolution loop.
The one-command path: runtime install, client detection, local brain wiring, and the first nexo chat / nexo doctor loop.
One local runtime, two client surfaces, the same operator memory. See what survives when you move from one session environment to the other.
Opt in, open a Draft PR, pause while it stays open, help review other proposals, then resume cleanly after merge or close.
See the v5 story in one loop: alternatives are evaluated, outcomes are checked, and repeated wins can become reusable behavior.
Understand the category correctly: one local working brain with runtime discipline, not only a memory layer plugged into a larger stack.
NEXO is not just an MCP endpoint. The product flow is: install the runtime once, connect the clients you actually use, and let the same local brain drive both interactive sessions and background automation.
npx nexo-brain sets up the local runtime, doctor/update commands, background schedules, embeddings, and the shared-brain MCP layer. This is the primary path.
Claude Code, Codex, and Claude Desktop can all point to the same local NEXO brain. You choose which client nexo chat opens and which backend runs automation.
The value is not just persistence. It is one operator memory, one runtime truth, one recovery model, and one set of cognitive tools across your daily workflow.
The primary path is npx nexo-brain: it installs the runtime, configures the shared-brain MCP layer, and wires Claude Code, Codex, and Claude Desktop when available. OpenClaw and ClawHub remain supported alternate entry points built on the same cognitive engine.
The main NEXO experience. Install with npx nexo-brain and get the local runtime, shared-brain MCP server, client sync for Claude Code / Codex / Claude Desktop, and the full automation layer.
Swap OpenClaw's default memory for NEXO's cognitive runtime while staying inside the OpenClaw lifecycle and tool registry. Best when OpenClaw is already your main operating surface.
OpenClawMarketplace-friendly install path for people who already browse skills via ClawHub. It is the same NEXO engine underneath, just packaged for a faster discovery and one-click workflow.
MarketplaceNEXO Brain stays libre and open-source. NEXO Desktop is the separate closed-source Mac product built on top of that runtime for operators who want the guided setup, managed updates, and product UX without living in Terminal.
Desktop is for teams who want onboarding, settings, email routing, automations, backups, and updates inside the app. It consumes the same Brain contracts, but presents them as a closed product surface instead of a repo workflow.
Managed experienceDesktop does not replace the open runtime. It packages and manages it for non-technical operators while the public Brain package remains installable on its own via npx nexo-brain.
If you want the Desktop product for a business deployment, contact info@wazion.com and ask for NEXO Desktop. Public downloads and source in this repo cover NEXO Brain only.
Commercial inquiryNEXO is now multi-client by design, but the three clients are not identical surfaces. The honest recommendation today is still Claude Code first, Codex second, Claude Desktop for shared-brain access.
The deepest path today: strongest hook surface, most battle-tested headless automation, and the cleanest fit for overnight jobs. Recommended default profile: Opus 4.6 with 1M context.
Now a real first-class option for terminal sessions and headless automation. It shares the same brain, model sync, and managed bootstrap, but still has fewer native hook surfaces than Claude Code.
Useful as a shared-brain client through MCP, but not the main terminal/automation surface. Best understood as another entry point into the same memory system, not the primary runtime operator.
Not just storage — a complete cognitive architecture that learns, forgets naturally, detects conflicts, and prevents repeated mistakes.
Three-store model: Sensory Register captures raw input, STM holds working context with rehearsal, LTM consolidates with semantic vectors. Just like human cognition.
Vector search with fastembed (BAAI/bge-base-en-v1.5). Query across all memory stores with cosine similarity. Retrieve what matters, not just what matches.
Pre-edit checks that inject known errors, real schemas, and blocking rules before your agent writes code. Prevents repeating past mistakes.
Memories naturally fade over time following the Ebbinghaus forgetting curve. Rehearsal strengthens important memories. No manual cleanup needed.
0-100 alignment index that adjusts based on corrections, successes, and proactive actions. Controls internal rigor: low trust = more paranoid checks.
Detects when new instructions contradict existing strong memories. Surfaces the conflict and asks for resolution instead of silently overwriting.
Change logs, decision records with alternatives and reasoning, session diaries with mental state continuity. Full audit trail of what happened and why.
Hot-reload plugins at runtime. Add new tool categories without restarting the server. Ship your own extensions as Python files.
All data stored in local SQLite databases. Vectors computed on-device with ONNX Runtime. Nothing ever leaves your machine. Zero cloud dependencies.
5-question onboarding that creates a unique agent personality. Your agent adopts a consistent voice, tone, and behavioral style from day one.
23 non-negotiable principles every NEXO agent follows. From memory hygiene to error prevention, the codex defines what it means to be a reliable co-operator. See the wiki.
Run NEXO Brain in a container with two commands. Mount your data directory, and the cognitive engine runs isolated and portable across any environment.
Complex questions are automatically split into sub-queries. Each component is retrieved independently, then fused for a higher-quality answer. Improves recall on multi-faceted prompts.
Adaptive chunking strategy that respects sentence and paragraph boundaries. Produces semantically coherent chunks instead of arbitrary token splits, reducing retrieval noise.
After initial vector retrieval, a cross-encoder model rescores candidates for precision. The top-k results are reordered by true semantic relevance before being returned to the agent.
Automatic end-of-session summarization that distills key decisions, errors, and follow-ups into a compact diary entry. The next session starts with full context, not a cold slate.
Combined vector + BM25 keyword search via SQLite FTS5. Best of both worlds: semantic understanding for concept-level retrieval plus exact keyword matching for precise lookups.
Upgraded from 384 to 768 dimensions (BAAI/bge-base-en-v1.5). Doubled semantic precision for richer memory representations — still CPU-only, no GPU required.
Redundancy-aware Ebbinghaus forgetting curve. Unique memories decay 4x slower than duplicates — no information loss in sparse stores, automatic cleanup in dense ones.
Automatic date extraction and temporal query boosting. "When" questions get smarter filtering — memories are ranked not just by relevance but by recency when context demands it.
Transparent 384→768 embedding upgrade on first startup. All existing memories are re-embedded automatically with zero user action required — no data loss, no manual steps.
Signal weights learn from real feedback via Ridge regression. 2-week shadow mode validates new weights before promoting. Weight momentum and automatic rollback keep the system stable.
Pain memory per file and area. Guard warns on HIGH RISK (>0.5) and CRITICAL (>0.8). Validated recovery on clean checks — the system forgets pain when the problem is fixed.
Vibe, corrections, brevity, topic, tool errors, and git diff. Emergency bypass for urgent sessions. Severity-weighted decay keeps personality calibrated over time without manual resets.
Weekly self-improvement cycle. Analyzes patterns, proposes changes, validates via snapshot/rollback. Circuit breakers and budget caps for safety. See the full procedure.
nexo chat opens your configured terminal client, nexo update syncs the runtime, and nexo doctor audits or repairs it. One binary, full control from any terminal.
First-class managed scripts with lifecycle tracking, schedule associations, and recovery awareness. 9 MCP tools for create, reconcile, sync, classify, and schedule management.
Health checks before every interactive session. Safe migrations, dependency verification, and environment validation ensure the cognitive engine starts clean every time.
Boot and wake catch-up for core and personal jobs. Explicit recovery contracts define what runs after sleep, restart, or missed schedules — no silent failures.
Click to view full size
Instead of burying the home page in long release writeups, here is the short version: NEXO now behaves much more like a real shared-brain runtime across clients, with stronger memory surfaces, enforceable protocol discipline, durable execution, and public product surfaces that match what the runtime can actually do.
v7.1.7 makes email-monitor carry the calibrated operator language through its prompt contract and localizes direct needs_interactive escalation emails, so Spanish operators stop receiving fallback English monitor mail.
F0.6 makes the runtime structure honest: core, personal, and runtime live as separate buckets, fresh installs land there directly, and transition-aware path helpers let updates survive the migration without pretending the flat legacy tree is still the product.
Personal automations gained first-class enable/disable/status controls. The cron wrapper respects the operator flag at every tick, so disabling a script is a reversible product action instead of a manual LaunchAgent intervention.
Multiple inboxes now live in a proper email_accounts table with credential pointers, interactive and machine-safe setup flows, and a Desktop bridge that lets non-technical operators manage accounts without editing files.
Extended sentiment/entity schemas, labelled rule fixtures, telemetry loops, and the local classifier skeleton landed in 6.3.0; 6.3.1 immediately stripped operator-specific data back out of the public preset so privacy stayed aligned with the product claim.
By v5.1.0, Evolution, adaptive, cognitive, and skills subsystems were closing under evidence. NEXO was no longer only a memory layer: it had become a runtime that could measure, learn, review, and reuse what actually worked.
One command to install. NEXO Brain sets up the runtime, downloads the embedding model, configures shared brain for detected clients, and lets you choose your terminal client and automation backend.
A condensed run of `npx nexo-brain`: language, install path, shared-brain clients, default terminal client, automation backend, and recommended model profiles.
People can see the setup experience before they install: NEXO does not just wire MCP. It configures the shared brain, lets you choose the client that `nexo chat` opens, and decides which backend runs background automation.
Claude Code remains the primary recommended path because it has the deepest hook and headless automation surface. The recommended Claude profile is now Opus 4.6 with 1M context.
These are public use cases, not fabricated testimonials. They are the operating patterns the current product already serves well and the kinds of real case studies worth capturing next.
One developer bouncing between feature work, bugs, and local ops who wants the agent to remember prior decisions and stop repeating the same mistakes.
Teams or individuals using more than one terminal agent surface who want shared memory, one runtime truth, and fewer fractured setup paths.
People who need workflows, doctor diagnostics, followups, self-audit, and overnight synthesis to stay connected instead of living in separate scripts and notes.
Users who care about privacy, inspectability, and not sending operational memory to a hosted third-party memory layer.
Maintainers and opt-in contributors who want an improvement loop that proposes safely, pauses on open PRs, and routes review back through humans.
Public package and directory surfaces for the product, plus the main places to read and watch NEXO in public.
Short paths for people deciding whether NEXO is the right runtime, not just browsing package registries.
Core package, codebase, and MCP directory surfaces. ClawHub remains supported as the marketplace install path in the integration section above.
Public explainers, walkthroughs, and launch content for people evaluating the product before they install it.
Common questions about NEXO Brain, how it works, and how to get started.
npx nexo-brain in your terminal. The installer sets up the local runtime, configures shared brain support for Claude Code, Codex, and Claude Desktop when available, lets you choose your terminal client and automation backend, downloads the embedding model, and creates the SQLite databases.nexo_guard_check) is a pre-action safety system. Before your agent edits code or makes changes, it checks for known errors, blocking rules, and relevant learnings. It prevents the agent from repeating mistakes it has already encountered.plugins/ directory are automatically loaded at startup. You can add, remove, or update plugins at runtime without restarting the server. Each plugin can register new MCP tools.CLAUDE.md baseline.npx nexo-brain and the installer handles everything automatically. Code and data are cleanly separated, the shared brain is configured once for supported clients, and your runtime keeps self-healing its managed schedules on updates.nexo contributor on. Read how Evolution works.nexo_personal_plugin_create to scaffold a personal MCP plugin in NEXO_HOME/plugins/. Personal plugins persist across updates and never become part of the core product.Why NEXO exists
NEXO Brain was born out of necessity. I was building WAzion — a WhatsApp marketing SaaS — as a solo founder, working with AI agents every day. But every time I closed a session, the agent forgot everything. The same corrections, the same mistakes, starting from zero every morning.
I needed an agent that remembered. That learned from its errors. That detected when it was about to repeat a mistake it had already made. That maintained continuity between sessions as if it were the same person.
What started as a personal memory layer for Claude Code evolved into a full cognitive runtime shared across Claude Code, Codex, Claude Desktop, and other MCP workflows — shaped by six months of daily production use, co-created between me and NEXO itself. Every feature was born from a real problem, not a roadmap.
Today NEXO Brain is open source and free for everyone. Because if one solo founder needed this, thousands of developers do too.
Francisco Cerdà Puigserver
Founder of WAzion · Creator of NEXO Brain
Open source, AGPL-3.0 licensed, and built in the open. Install it, stress it, open issues, ship PRs, and help shape how agent memory should work.
Translate this page