CommunityResearch & Data Analysisgithub.com

steipete/notcrawl

Notion archive: desktop/API sync, Markdown export, page search, read-only SQL.

What is notcrawl?

notcrawl is a Claude Code agent skill that notion archive: desktop/API sync, Markdown export, page search, read-only SQL.

Works with~Claude Code~Codex CLI~Cursor
npx skills add https://github.com/steipete/agent-scripts/tree/main/skills/notcrawl

Installed? Explore more Research & Data Analysis skills: obra/superpowers, affaan-m/ECC, affaan-m/quarkus-verification · View all 6 →

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

Documentation

What does notcrawl do?

Use this for local Notion archive questions. Desktop reads local cache; API sync needs NOTION_TOKEN and page access.

Sources

  • DB: ~/.notcrawl/notcrawl.db
  • Pages: ~/.notcrawl/pages
  • CLI: notcrawl

Refresh

notcrawl doctor
notcrawl status
notcrawl sync --source desktop

Use API only when needed and credentials/page access are available:

notcrawl sync --source api

Query Workflow

  1. Check freshness for recent/current Notion questions.
  2. Search pages first; use read-only SQL for exact counts or schema-level analysis.
  3. Use export-md when the user needs Markdown files refreshed.
  4. Report date spans, page/database IDs, counts, and source limits.

Use root or subcommand help for syntax: notcrawl --help, notcrawl search --help, notcrawl sql --help.

Common commands:

notcrawl search --limit 20 "query"
notcrawl databases
notcrawl sql "select count(*) from pages;"
notcrawl export-md

Individual skills in this repo

This repo contains 20 individual skills — each has its own dedicated page.

steipete/agent-transcript

GitHub PR/issue agent transcripts: redact, preview, and insert safely.

steipete/beeper

Beeper cache: contact hints, room lookup, WhatsApp/iMessage traces, FTS.

steipete/browser-use

Existing Chrome automation: Chrome plugin first, mcporter fallback.

steipete/clawsweeper-status

ClawSweeper status: URLs, workflow health, active workers, ops snapshot.

steipete/clickclack

ClickClack ops: chat app, Hetzner deploy, DNS/docs/app, Docker rollout.

steipete/cloudflare-registrar

Cloudflare Registrar: domain availability, prices, registration via mcporter.

steipete/codex-debugging

Codex debugging: codex-rs core/tui/exec/cli/app-server/config.

steipete/create-cli

CLI UX/spec: args, flags, help, output, errors, config, dry-run.

steipete/discord-clawd

Discord-backed OpenClaw agent/session relay; not archive search.

steipete/domain-dns-ops

DNS/domain ops: registrars, zones, redirects, DNS/HTTP verify, manager truth.

steipete/frontend-design

Frontend UI: pages, apps, components, polished non-generic design.

steipete/github-author-context

GitHub contributor context: identity, activity, trust, company/team signal.

steipete/github-cache-hygiene

GitHub quota/cache hygiene: gh, ghx, xcache, gitcrawl, mirrors, limits.

steipete/github-deep-review

GitHub deep review: bugs, PRs, best fix, stale-or-real, read code first.

steipete/github-project-triage

GitHub issue/PR triage: queues, CI, blockers, risk, proof, next actions.

steipete/hopper-debugger

Hopper debugging: macOS/iOS binaries, ObjC/Swift symbols, dyld, LLDB.

steipete/instruments-profiling

Instruments/xctrace profiling: macOS/iOS traces, binaries, args, exports.

steipete/mac-maintenance

Mac upkeep: brew update/upgrade, pull clean repos, empty Trash.

steipete/maintainer-orchestrator

Open-source maintainer orchestration: Codex app workers, work recovery, dependencies, vision, releases.

steipete/markdown-converter

Markdown conversion: PDF, Office, HTML, data, OCR, audio, ZIP, YouTube.

Related Skills