NoteBrain CLI Skill for AI Agents

NoteBrain is a high-performance Go CLI tool that indexes an Obsidian vault into a local ChromaDB vector database. It provides semantic search, graph traversal, backlink discovery, hidden relationship finding, full note retrieval, and tag analysis across markdown notes.

Core Execution Principles & Rationale

To operate efficiently and prevent wasted tokens or hung sessions, follow these foundational principles:

NoteBrain Only — No Generic File Search: Never use grep, find, ls or ad-hoc shell scripting against the vault's markdown files to answer the user's question. NoteBrain's commands are purpose-built on top of the indexed vector and graph database and will always produce higher-quality, more relevant results than a filesystem-level text search. Treat notebrain as the only interface to the vault's content. If a notebrain command appears to return nothing useful, refine the notebrain query (different phrasing, broader --limit, alternate command) rather than falling back to a generic file search.
Non-Interactive Execution (--format json): By default, NoteBrain launches an interactive terminal TUI (Bubble Tea) designed for human browsing. This interactive interface will hang automated agent sessions. Always specify --format json (or --format ndjson / --format tsv) on query commands so you receive structured, parseable data immediately. Note that all JSON envelope fields use clean snake_case keys (note_slug, title, file_path, score, tags, text).
AI Agent Command Chaining (--jsonpath): Whenever you only need specific fields (like extracting note slugs or text to pipe into follow-up commands), use --jsonpath (e.g., --jsonpath="$.results[0].note_slug" or --jsonpath="$.results[*].note_slug"). When --jsonpath is used, scalar values are output as raw unquoted strings and arrays print each item on a new line, enabling seamless shell variable extraction without needing jq.
Retrieve Complete Notes (notebrain get): When search returns a relevant note chunk, use notebrain get <note-slug-or-path> rather than guessing chunk indices to retrieve the complete, reconstructed markdown note text stitched together across all its indexed chunks. Never reach for cat on the underlying vault file — get reconstructs chunked notes correctly and respects the indexed state of the vault.
Retrieve Content for Synthesis (--include-text): By default, search and traversal commands return lightweight metadata envelopes (titles, file paths, tags, similarity scores). Whenever your task requires summarizing or reasoning about individual chunks directly from search results, append --include-text. The returned text field contains rich markdown with preserved code blocks, so code snippets are copy-pasteable.
Binary Resolution & Quoting: In development environments, execute ./notebrain if notebrain is not in system PATH. Note titles often contain spaces or brackets ("Q3 Planning [Draft]"), so always encapsulate note titles, tags, and search queries within double quotes.
Embedded Persistent Storage: NoteBrain runs exclusively as an embedded database (~/.notebrain/chroma). No standalone ChromaDB Docker containers or HTTP servers are required or supported.
Automated Ingestion: Ingestion is handled via OS schedulers (cron/systemd timers every 3 hours). You do not need to manually run ingest before queries unless the user explicitly requests an immediate re-index after editing notes.

Command Selection Guide

Select the specialized command tailored to the user's analytical goal:

User Intent	Command	Why & How to Use
"What do my notes say about X?"	`search`	Performs vector similarity search across all note chunks. Use `--tag="TagName"` to filter by tag.
"Read the full content of note Y"	`get`	Retrieves and reconstructs the complete note text and metadata by slug or file path.
"What links directly to this note?"	`backlinks`	Finds explicit `[[wikilink]]` references pointing to the target note.
"What is structurally nearby in the graph?"	`connections`	Executes breadth-first search (BFS) over wikilinks up to `--hops N`. Keep `--hops 1` or `--hops 2` to avoid exponential blowup.
"What is related in meaning but NOT linked?"	`hidden`	Surfaces unlinked-but-semantically-similar notes. Highly valuable for discovering conceptual bridges.
"Find concepts related to X centered around note Y"	`boosted`	Combines vector similarity with graph proximity to a `--seed` note.
"What notes share tags with X?"	`tags`	Analyzes tag overlap. Returns clean tag strings in the `tags` array.
"Is the database up to date?"	`stats`	Outputs collection counts (`chunks`, `links`). Supports `--format=json` and `--jsonpath`.
"Index or re-index the vault"	`ingest`	Synchronizes markdown notes into ChromaDB. Re-ingestion is idempotent.

Command Syntax

Semantic Search & Tag Filtering (`search`)

notebrain search "kubernetes reconciliation" --tag="Kubernetes" --limit 5 --format json --include-text

Key search flags

Flag	Purpose	Default
`--top-k N`	Maximum chunks to retain per note. Prevents one long note from dominating results while preserving depth.	`3`
`--context-window N`	Fetches ±N adjacent chunks around each match and populates the `context` array. Use `1` or `2` for most tasks.	`0`
`--has-tasks`	Only return chunks that contain task lists (checkboxes).	off
`--has-code`	Only return chunks that contain fenced code blocks.	off
`--section`	Filter results to chunks under a specific heading path (e.g., `"Architecture > Components"`).	—
`--limit N`	Maximum total results to return.	`10`
`--tag "TagName"`	Filter or search by tag name.	—
`--min-score F`	Suppress results below this similarity score (0–1).	`0`

When to use `--context-window` vs `notebrain get`

Use --context-window 1 when you need lightweight surrounding context for multiple search results at once. This is efficient — one search call gives you both the matched chunks and their neighbors, enough to understand the local context without fetching entire notes.
Use notebrain get when you need the complete reconstructed note text — for instance, when the user asks to read or summarize a specific note, or when you need to see the full document structure.

The windowing approach is especially valuable when processing many search results, because fetching full notes for every result would be wasteful. Fetch the window first, then selectively get only the notes that truly need full context.

Complete Note Retrieval (`get`)

notebrain get "02areaskubernetesckadkubernetes-native-applications" --format json

Graph Connections & Hidden Links (`connections`, `hidden`)

notebrain connections "Distributed Systems" --hops 2 --format json
notebrain hidden "Microservices" --limit 5 --format json --include-text

JSON Output Schema

Understanding the result structure is essential for reliable field extraction. Every query command wraps results in this envelope:

{
  "command": "search",
  "query": "Semantic Search: \"kubernetes\"",
  "total": 3,
  "results": [
    {
      "note_slug": "02areaskubernetes-architecture",
      "title": "Kubernetes Architecture",
      "file_path": "02.Areas/Kubernetes/Architecture.md",
      "score": 0.82,
      "chunk_index": 2,
      "tags": ["Kubernetes", "DevOps"],
      "heading_path": "Components > Control Plane",
      "text": "The API server validates and configures...",
      "context": [
        "Previous chunk text about etcd storage...",
        "The API server validates and configures...",
        "The scheduler watches for newly created Pods..."
      ]
    }
  ]
}

Field	Present When	Description
`note_slug`	Always	URL-safe identifier derived from the file path.
`title`	Always	Note title from frontmatter or filename.
`file_path`	Always	Relative path within the vault.
`score`	Always	Similarity score (0–1) for search; hop count for connections.
`chunk_index`	Search, hidden, boosted	Which chunk of the note matched (0-indexed).
`tags`	When note has tags	Array of tag strings.
`heading_path`	When chunk is under a heading	Breadcrumb path like `"Section > Subsection"`.
`text`	When `--include-text` is passed	The matched chunk's full markdown text, with code blocks preserved.
`context`	When `--context-window N` > 0	Array of ±N adjacent chunk texts, ordered by chunk index.
`extra`	Connections, tags, boosted	Command-specific info (e.g., `"2 hop(s)"`, `"graph-boosted"`).

Deepening Answer Quality: Multi-Signal Retrieval

Before answering any "what do I know about X" or "summarize my notes on X" style question, don't stop at a single search call. Search results alone only surface chunks that are semantically close to the query — they miss the surrounding context the user has deliberately built into their graph. Always widen the picture:

Search with context — run search with --context-window 1 --include-text --top-k 2 to get the most relevant chunks with their surrounding paragraphs. This gives you both precision (the matched chunk) and context (what comes before and after) in a single call.
Pull backlinks for each seed — run notebrain backlinks <seed-slug> --format json --include-text to extract every note that explicitly links into the seed. These are notes the user has manually curated as related, so their content is almost always high-signal and should be weighted heavily in your synthesis.
Walk connections outward — run notebrain connections <seed-slug> --hops 2 --format json to map the local graph neighborhood around the seed. This reveals structurally adjacent notes (e.g., notes two links away) that may not show up in a pure vector search but are part of the same knowledge cluster.
Check for hidden links — run notebrain hidden <seed-slug> --include-text --context-window 1 to catch conceptually related notes the user hasn't linked yet. Call these out explicitly to the user as potential missing links in their vault, since this is one of NoteBrain's most valuable differentiators over plain search.
Synthesize, don't just list — combine the seed note(s), their backlinks, their connections neighborhood, and any hidden results into a single coherent answer, distinguishing what's explicitly linked (high confidence) from what's only semantically similar (worth double-checking).

This search → backlinks → connections → hidden chain should be the default workflow for any non-trivial exploratory question, not just a single search call, because it surfaces both the explicit structure the user built and the implicit structure NoteBrain can detect.

AI Agent Chaining Pipeline

# 1. Search with surrounding context — top-k keeps results diverse across notes
SLUG=$(notebrain search "message broker backpressure" --limit 3 --top-k 2 \
  --context-window 1 --include-text --format json \
  --jsonpath="$.results[0].note_slug")

# 2. Fetch the complete reconstructed note text for the top hit
notebrain get "$SLUG" --jsonpath="$.text"

# 3. Find all backlink note slugs pointing to this note, with their content,
#    to ground the answer in what the user has explicitly linked
notebrain backlinks "$SLUG" --format json --include-text

# 4. Walk the graph neighborhood to surface structurally adjacent notes
#    that vector search alone would miss
notebrain connections "$SLUG" --hops 2 --format json

# 5. Surface unlinked-but-related notes as potential missing connections,
#    with windowed context so you can assess relevance without fetching full notes
notebrain hidden "$SLUG" --limit 5 --format json --include-text --context-window 1

Targeted Retrieval Patterns

# Find code examples about a topic
notebrain search "docker compose networking" --has-code --include-text --format json

# Find actionable tasks related to a project
notebrain search "sprint planning" --has-tasks --include-text --format json

# Search within a specific section hierarchy
notebrain search "authentication" --section="Architecture > Security" --format json

# Get multiple chunks per note for comprehensive coverage of a deep topic
notebrain search "distributed consensus" --top-k 5 --limit 3 --include-text --format json

Configuration Hierarchy

NoteBrain resolves settings in priority order:

CLI command flags (--vault-path, --vault-name, --chroma-path, --top-k, --context-window)
Configuration file (~/.notebrain/config/config.toml or specified via --config)

License

This skill is distributed under the Apache License, Version 2.0. See LICENSE.txt for the full license text.

Allowed Tools

This skill is restricted to invoking the notebrain CLI binary (and its local ./notebrain form) via the Bash tool. No other shell commands — including generic file-search utilities like grep, find or ls against vault files — are sanctioned for use within this skill's workflows.

nmdra/notebrain-cli

notebrain-cli란 무엇인가요?

즐겨 사용하는 AI에게 물어보기

문서

NoteBrain CLI Skill for AI Agents

Core Execution Principles & Rationale

Command Selection Guide

Command Syntax

Semantic Search & Tag Filtering (`search`)

Key search flags

When to use `--context-window` vs `notebrain get`

Complete Note Retrieval (`get`)

Graph Connections & Hidden Links (`connections`, `hidden`)

JSON Output Schema

Deepening Answer Quality: Multi-Signal Retrieval

AI Agent Chaining Pipeline

Targeted Retrieval Patterns

Configuration Hierarchy

License

Allowed Tools

관련 스킬

steipete/sag

steipete/oracle

steipete/peekaboo

obra/brainstorming

affaan-m/prisma-patterns

affaan-m/django-celery

즐겨 사용하는 AI에게 물어보기

문서

NoteBrain CLI Skill for AI Agents

Core Execution Principles & Rationale

Command Selection Guide

Command Syntax

Semantic Search & Tag Filtering (search)

Key search flags

When to use --context-window vs notebrain get

Complete Note Retrieval (get)

Graph Connections & Hidden Links (connections, hidden)

JSON Output Schema

Deepening Answer Quality: Multi-Signal Retrieval

AI Agent Chaining Pipeline

Targeted Retrieval Patterns

Configuration Hierarchy

License

Allowed Tools

관련 스킬

steipete/sag

steipete/oracle

steipete/peekaboo

obra/brainstorming

affaan-m/prisma-patterns

affaan-m/django-celery

Semantic Search & Tag Filtering (`search`)

When to use `--context-window` vs `notebrain get`

Complete Note Retrieval (`get`)

Graph Connections & Hidden Links (`connections`, `hidden`)