second-brain

A personal RAG assistant: point it at your notes, docs, and code, then ask questions and get answers cited to your own sources. Runs on the home AI lab — free local models by default, one env-flip to Claude quality.

uv sync
cp .env.example .env
artjeck                    # start talking to Artjeck directly
sb ingest ~/Notes            # or any file/folder (md, txt, pdf, code…)
sb ask "what did I decide about the gateway routing?"
sb learn "My preferred invoice format is PDF with line items."
sb agent                    # ask questions, teach it with /learn <fact>
sb chat                      # interactive
sb overnight                 # safe nightly scan: ingest changed files + write a morning report
sb morning                   # daily briefing from overnight runs, tasks, and cited RAG
sb status
sb eval                      # retrieval benchmark (add --answers for chat-model checks)
SB_COLLECTION=second_brain_regression sb eval evals/regression.json --ingest-corpus
sb eval --answers --trace-output data/eval-traces.json
sb eval-intake               # build a private 100-question benchmark in the terminal

Why it's built this way (the interesting decisions)

OpenAI-compatible everywhere. llm.py talks to one OpenAI-style endpoint. Default is the GTX Ollama directly (free, no key, both models warm). Switch to .env.gateway and the same code routes through the LiteLLM gateway — free local embeddings + Claude for the answer, with a budget cap. Develop for free, ship with quality, zero code change.
Citations are mandatory. The system prompt forces the model to answer only from retrieved context and cite [n]; the CLI prints the source files + distances. No source, no claim — that's the trust signal that separates a real RAG product from a demo.
Hybrid retrieval rescues exact lookups. Vector search finds semantic matches; a tiny local BM25 pass rescues exact section/list lookups embeddings miss. Results are fused and de-duplicated, keyword hits expand to adjacent chunks, and every answer still has to be cited. SB_HYBRID toggles it for A/B checks.
Learning is explicit and inspectable. sb learn and /learn in agent mode write Markdown memory files under SB_MEMORY_DIR, then ingest them through the same cited RAG pipeline. The model does not silently save its own guesses as truth.
Agent workflow is explicit. Artjeck routes each console turn through a LangGraph workflow that can answer, learn, ingest files, and manage tasks.
Tasks are structured. SB_STATE_DB is a SQLite database for durable tasks and assistant state. It is separate from semantic memory.
Lean, not framework-soup. Small codebase, the OpenAI SDK + Chroma/Qdrant, no LangChain. You can read every step of the pipeline (chunk → embed → store → retrieve → ground → cite), which is the point — it shows you understand RAG, not just how to import it.
Local-first, lab-ready vector store. Chroma on disk is the default zero-infra path. Set SB_STORE=qdrant to use the AI Lab's hosted Qdrant service without changing CLI code.

Architecture

files ──▶ ingest.py ──chunk──▶ llm.embed ──▶ Store (Chroma or Qdrant)
                                                  │
learn ──▶ memory.py ──markdown──▶ ingest.py ──────┤
                                                  │
question ──▶ LangGraph ──ask──▶ Store.query ──top-k──▶ llm.answer ──▶ cited answer
              │
              ├─learn──▶ memory.py
              ├─ingest─▶ ingest.py
              └─task───▶ SQLite (`SB_STATE_DB`)

Roadmap (each item maps to a JD checkbox)

Swap Chroma → lab Qdrant (hosted vector DB)
Explicit learned memory (sb learn, /learn in agent mode)
LangGraph turn workflow + SQLite task store
Hybrid search (vector + BM25 keyword) — keyword rescue for exact section/list lookups, off via SB_HYBRID=0
Wrap as an MCP server so Claude Desktop / Claude Code can query your brain directly (docs)
Eval harness: benchmarks, rubric checks, local traces/spans, and trajectories (docs)
Web UI (Next.js) + deploy → the public-URL portfolio piece

MCP server

Expose the brain to any MCP client (Claude Code, Claude Desktop, a native iPad/MacBook client) so an agent can query and teach it directly. It reuses this engine, so it inherits the free-local-model gateway defaults — no extra cost. Full guide: docs/MCP.md.

uv run sb-mcp                       # stdio (local clients launch it as a subprocess)
SB_MCP_TRANSPORT=http \
SB_MCP_HOST=0.0.0.0 SB_MCP_PORT=8848 \
SB_MCP_TOKEN=$(openssl rand -hex 24) uv run sb-mcp   # Streamable HTTP, token-protected

Tools: ask (cited answer), recall (raw chunks, no model call), ingest, learn, list_tasks, add_task, complete_task, status. Claude Code auto-loads the stdio server from the repo's .mcp.json — just run claude here (works over SSH from the iPad). Inspect interactively with npx @modelcontextprotocol/inspector.

Config

.env (see .env.example / .env.gateway.example): OPENAI_BASE_URL, OPENAI_API_KEY, EMBED_MODEL, CHAT_MODEL, SB_STORE, SB_COLLECTION, SB_MEMORY_DIR, SB_STATE_DB, and retrieval knobs SB_CHUNK_SIZE / SB_CHUNK_OVERLAP / SB_TOP_K / SB_HYBRID.

Chroma default

SB_STORE=chroma
SB_DATA=./data/chroma
SB_MEMORY_DIR=./data/memory
SB_STATE_DB=./data/artjeck.sqlite3

Qdrant on the AI Lab

SB_STORE=qdrant
QDRANT_URL=http://127.0.0.1:6333
QDRANT_API_KEY=<from /opt/ai-lab/.env on the Alienware>

After switching stores, ingest again because Chroma and Qdrant keep separate collections.

Storage Policy

On ArtJack's machines, the Mac Mini should stay light:

Mac Mini keeps the Artjeck code and small SQLite state DB.
Shared Disk E keeps learned Markdown memories, large source files, exports, and backups.
Alienware Qdrant should hold the heavy vector index once QDRANT_API_KEY is configured.

Current recommended local .env shape:

SB_MEMORY_DIR=/Volumes/DISK/AI/artjeck/memory
SB_STATE_DB=./data/artjeck.sqlite3
QDRANT_URL=http://127.0.0.1:6333

Put large documents under /Volumes/DISK/AI/artjeck/inbox, then ingest from there:

artjeck
/ingest /Volumes/DISK/AI/artjeck/inbox

Learning

Teach a fact directly:

sb learn "ArtJack prefers short implementation summaries with exact file paths."
sb ask "How does ArtJack prefer implementation summaries?"

Use agent mode:

artjeck
/learn The AI Lab gateway is the preferred front door for shared model routing.
What is the preferred front door for shared model routing?
/ingest ~/Notes
/task Review new lease documents
/tasks
/done 1
/exit

Learned memories are normal Markdown files in SB_MEMORY_DIR, so you can inspect, edit, delete, or re-ingest them. This keeps learning controlled: the system learns from what you teach it, not from unverified model guesses.

Overnight worker

The overnight worker is the safe "make him smarter while I sleep" loop. It scans configured folders, ingests new or changed supported files, extracts likely tasks into a report, and writes an audit trail under data/overnight/. It does not edit, move, delete, send, or buy anything.

First run it manually:

uv run sb overnight --dry-run
uv run sb overnight

The first run creates data/overnight/config.json. Edit targets there to point at your real inboxes or synced Alienware folders. Reports land in data/overnight/reports/.

To schedule it daily on the Mac Mini:

chmod +x deploy/install-overnight-service.sh
SB_OVERNIGHT_HOUR=3 SB_OVERNIGHT_MINUTE=15 deploy/install-overnight-service.sh

Launchd runs deploy/run-nightly.sh, which refreshes project-context notes, performs sb overnight, syncs tasks, runs health checks, and then writes sb morning. Logs land in data/overnight/logs/. Keep the first few runs read-only and review the reports before giving Artjeck any action permissions beyond ingesting and reporting.

Morning briefing

After the overnight worker runs, generate a daily briefing:

uv run sb morning

It writes a Markdown briefing under data/morning/ with the latest overnight run, open tasks, possible follow-ups found in recent reports, and a project inventory from configured scan targets. To also ask the indexed brain with model calls:

uv run sb morning --rag

Refresh source-backed project notes any time:

uv run sb project-context --ingest

Sync explicit follow-ups into the durable task store:

uv run sb task-sync
uv run sb task-sync --dry-run

Run read-only health checks for lab services and projects:

uv run sb health

Fetch a website into the brain as a cited source:

uv run sb web-check https://example.com
uv run sb web-check https://example.com --no-ingest

For JavaScript-rendered pages, use a real browser capture:

uv run sb browser-check https://example.com
uv run sb browser-check https://example.com --no-screenshot --no-ingest

sb agent starts the same conversation loop. The artjeck command is just the named shortcut intended for daily use.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
deploy		deploy
docs		docs
evals		evals
examples		examples
src/secondbrain		src/secondbrain
tests		tests
.env.example		.env.example
.env.gateway.example		.env.gateway.example
.gitignore		.gitignore
.mcp.json		.mcp.json
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

second-brain

Why it's built this way (the interesting decisions)

Architecture

Roadmap (each item maps to a JD checkbox)

MCP server

Config

Chroma default

Qdrant on the AI Lab

Storage Policy

Learning

Overnight worker

Morning briefing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

second-brain

Why it's built this way (the interesting decisions)

Architecture

Roadmap (each item maps to a JD checkbox)

MCP server

Config

Chroma default

Qdrant on the AI Lab

Storage Policy

Learning

Overnight worker

Morning briefing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages