Wenlan
A living personal knowledge library for the AI-native age. Agents capture what they learn, you add sources you trust, and Wenlan keeps source-cited pages current.
The problem
Every new AI session starts cold.
The work happened, but the context did not survive. Decisions, fixes, and project instincts stay trapped in old chats instead of helping the next agent.
One missing handoff is enough to make the next conversation repeat the last one.
What Wenlan brings
A handoff loop for AI work.
Wenlan captures decisions, lessons, and next steps as work happens, then loads the handoff when the next agent starts.
The next conversation starts from the handoff instead of reconstructing the past.
Deliberate distillation
Wenlan turns repeated context into source-backed pages.
Run /distill when repeated captures should become a readable page. Optional model or API-key paths can add background extraction and page refresh work.
The next run starts from cited context, not transcript residue.
Knowledge pages
The work becomes reusable pages.
Cleaned decisions and lessons become durable pages instead of buried chat logs. They are organized enough for agents to use and concrete enough for humans to read.
Your work stops being transcript history and starts becoming project knowledge.
Hybrid storage
The daemon owns recall. Readable artifacts stay inspectable.
Wenlan keeps raw captures in the local daemon store for retrieval, then projects pages, handoffs, and status files you can open, diff, and move.
Agents recall from the daemon. You inspect the readable files.
Hybrid retrieval, measured
96% fewer tokens. Honest retrieval metrics.
Hybrid retrieval finds the right local context without replaying chat history.
| Surface | Scope | Result |
|---|---|---|
| Full replay | No retrieval | 4,505 tokens / query |
| LME_Oracle | CE-reranked, 500 Q | 168 tokens / query · 93.6% R@5 · 0.883 NDCG@10 |
| LME_S | CE-reranked, N=90 deep-S | 168 tokens / query · 87.7% R@5 · 0.822 NDCG@10 |
Retrieval-only snapshots. LME_Oracle also records 0.857 MRR; LME_S records 0.815 MRR on 84 gradeable rows from the 90-question deep-S fixture. Token comparison is full replay vs retrieved context. Run the harness yourself.
FAQ
Common questions.
What is Wenlan?+
How is Wenlan different from built-in AI memory?+
What retrieval quality does Wenlan reach?+
Is my data private?+
Is Wenlan just another memory MCP?+
What AI tools work with Wenlan?+
Is Wenlan a replacement for Notion or Obsidian?+
How do I set it up?+
Does Wenlan work on Windows or Linux?+
Can I keep work and personal memory separate?+
Is Wenlan free?+
Open source
Open where it matters.
The local runtime, CLI, MCP server, and Claude Code plugin are Apache-2.0.
Get release updates.