Two-faced AI agent.

One side reads, catalogs, connects. The other acts: writes documents, analyzes data, answers questions. Every document in makes answers sharper. Every answer reveals what's still missing.

Knowledge and action, in a loop.

Get Started View on GitHub →

Get started in seconds.

Clone the repo, open it with any coding agent, and start building your knowledge base.

terminal

$ git clone https://github.com/Lover0ne/JanusLM.git
$ cd JanusLM
$ claude # or codex, gemini, cursor

> ingest raw/my-document.md
> What are the main themes?

What is JanusLM?

JanusLM turns any AI coding agent into a general-purpose assistant backed by a structured, multi-project knowledge base.

It builds on the LLM Wiki concept by Andrej Karpathy — a knowledge management system fully maintained by an AI agent. You drop documents in, the agent decomposes them into structured, interlinked pages: sources, entities, concepts, and syntheses. It maintains an index, a living overview, and a knowledge graph.

Where we saw room to grow.

As powerful as the LLM Wiki idea is, working with it across multiple projects revealed two natural limits.

Limit 1

No project separation

The wiki treats all knowledge as a single flat space. Ingesting documents from different projects means entities and concepts merge together — a page like OpenAI.md blends information from every context. Great for discovery, harder when you need a clean, scoped answer.

Limit 2

Single-purpose agent

The instruction file is entirely dedicated to wiki workflows. The agent excels at knowledge operations, but asking it to generate a Word report or a slide deck falls outside its scope. The agent is a librarian, not an assistant.

The idea.

Two goals, one system.

A general-purpose agent

The wiki becomes a knowledge base, not the entire identity. The agent is free to do anything else — Word documents, slides, analysis, web research, and more. Skills are modular: add a new workflow without touching the core instructions.

Project separation

Every ingested document belongs to a specific domain. Queries can filter by project without cross-contamination. Your tenth project works exactly like your first — tags scale naturally.

Three-axis search.

The knowledge base is navigable across three dimensions. The agent classifies each query and applies the right search strategy automatically.

Think of the KB as a sparse matrix — rows are pages, columns are project tags. Every query triggers a classification step that determines how to slice this matrix.

Axis 1

Project Search

Vertical

Filter by tag. Read only pages from that project. Isolated answer, zero contamination.

Column slice — read only pages in that column

"What is BP59 in project alpha?"

Axis 2

Concept Search

Cross-cutting

Read the concept page that aggregates knowledge from all projects. Answer organized by project, never blended.

Row slice — read one page across all columns

"What is a PoC?" — shows how each project uses it

Axis 3

Cross-Project Search

Intersection

Cross-reference tags and concepts. Show shared patterns and differences across projects.

Join — intersect rows and columns, find patterns

"Which projects use RAG? What do they have in common?"

Diagram — how the three axes slice the knowledge matrix

Smarter search decisions

The agent classifies intent before any retrieval. If the query is operational — generate a doc, convert a file — it skips the knowledge base entirely. If uncertain, it checks the index quickly; if nothing matches, it moves on without forcing wiki content into the answer.

Before

The agent searched the wiki for any question. Risk of polluting answers with irrelevant content or blending contexts from different projects.

After

The agent classifies each query into one of the three axes before searching. It knows when not to search. It filters by tag when needed. It separates contexts in the answer.

Blind domain review.

Before every ingest, the agent validates that the new document actually belongs to the declared project. The validation uses a blind review process — the two evaluations run independently to avoid anchoring bias.

Step 1

Parallel evaluation

The agent reads the document and the existing wiki pages for that project, then forms its own semantic judgment — blind, without seeing any score. Independently, a deterministic script computes the quantitative analysis: TF-IDF cosine similarity, entity overlap, and concept overlap.

Step 2

Comparison and verdict

The agent sees both evaluations side by side, flags any discrepancy, and produces the final affinity decision. When the two disagree, the discrepancy itself becomes a valuable signal for the user.

Example output — deterministic analysis

Metric	Score	Detail
Lexical similarity (TF-IDF)	72%	cosine similarity on TF-IDF vectors
Entity overlap	4/7 (57%)	OpenAI, RAG, LangChain, Anthropic
Concept overlap	3/5 (60%)	PoC, Fine-tuning, Embeddings
Composite score	63%	weights: lexical 0.4, entity 0.3, concept 0.3

The validation is advisory, never blocking — the user always has the final word. If the user declines the ingest, the document is tracked with the reason for rejection. If the same document is submitted again later, the agent completes the full blind review first and only then surfaces the previous rejection — keeping the new evaluation free from bias.

The knowledge graph.

The wiki can be visualized as an interactive knowledge graph. Running build graph extracts all connections between pages — both explicit wikilinks and implicit relationships inferred by the agent — and renders them in a self-contained HTML file you can open in any browser. No server needed.

The graph uses Louvain community detection to cluster related pages, and vis.js for interactive visualization with search, filtering, and a detail drawer for each node.

Knowledge graph — nodes and connections across the wiki

Pass 1

Extracted edges

Deterministic. Parses all [[wikilinks]] across wiki pages to build the explicit connection graph.

Pass 2

Inferred edges

Semantic. The agent reads each page and identifies implicit relationships not captured by explicit links, with confidence scores.

Analysis

Health report

Orphan nodes, god nodes, fragile bridges between communities, phantom hubs — with suggested actions to strengthen the graph.

Works with every agent.

Open the repo in any coding agent — each one picks up its own instruction file automatically. No configuration, no setup. The same knowledge base, the same workflows, regardless of which tool you prefer.

Claude CodeCLAUDE.md

CodexAGENTS.md

Gemini CLIGEMINI.md

Cursor.cursorrules

Cline.clinerules

Architecture — how agents interact with the knowledge base

Agent architecture diagram showing how different coding agents connect to the shared knowledge base

No server, no database — everything is plain markdown files. The Python scripts in tools/ can also run standalone from terminal, without a coding agent.

Why this matters.

One entry point, no more session sprawl

You don't need endless parallel sessions with your AI assistant, each with its own fragile context. One interface, one knowledge base that grows with you. You go back to interacting with AI the way it felt at the beginning — a single, general conversation — except now it has memory.

But you're not locked in

Nothing stops you from running multiple sessions if you want to. They won't contaminate each other — project tags keep domains clean. Use one session or ten, the knowledge stays organized either way.

Maintainable and scalable

Skills are modular — add a new workflow without touching the core instructions. Tags scale naturally — your tenth project works exactly like your first. The knowledge base grows without degrading, because every piece of information has a domain and a structure.

Works with any AI assistant

The knowledge base is plain markdown with frontmatter and wikilinks — no proprietary format, no vendor lock-in. The wiki structure works with any AI tool that can read files: Claude Code, Cursor, Windsurf, GitHub Copilot, Gemini CLI, Codex, or any future assistant. The knowledge you build is yours, portable, and readable by humans and machines alike.

Search, transform, deliver

Every task follows the same empowered workflow: the agent searches the knowledge base first, enriches its understanding with what you've already validated, then transforms that into the output you need. The result isn't generic — it's informed, refined, and grounded in your accumulated knowledge.

The workflow

Research

Search the knowledge base, gather context

→

Transform

Enrich with validated knowledge

→

Result

Grounded, informed output

How it works.

Knowledge and action, in a loop. Drop documents in, get sharper answers out.

Drop a document

Place any source file in raw/ — articles, reports, notes, papers.

Ingest

The agent reads it, extracts entities, concepts, and connections. Tags it to your project.

Ask questions

Query in natural language. The agent classifies your intent and searches the right axis.

Get grounded answers

Answers cite your validated knowledge with [[wikilinks]]. No hallucination, no guessing.

Workflow — the full ingest-to-answer pipeline

JanusLM workflow diagram showing the document ingest, knowledge extraction, and query pipeline