CommunityKunst & Designgithub.com

sid732/LocalContextRouter

Preflight router that decides locally whether each document page reaches a multimodal model as text, on-device OCR, or an image. Cuts vision-token cost. macOS.

Funktioniert mit~Claude Code~Codex CLI~Cursor
npx skills add sid732/LocalContextRouter

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

Dokumentation

Local Context Router

A multimodal model reads a PDF by extracting its text and rendering every page to an image, then paying for both. On a page that is mostly prose, the image is wasted spend. Run this preflight first and send the model only what each page needs.

When to use

Before reading, summarizing, or extracting from a PDF, scan, or screenshot the user has shared.

Requirements

The localcontextrouter package must be installed (pip install localcontextrouter, macOS). It provides the localctx command used below.

Run

Route the document and read the JSON, rendering any visual pages into a folder:

localctx <path-to-document> --json --vision-dir ./lcr-pages

If the localctx command is not available, run the bundled script with a Python that has localcontextrouter installed (the script imports the same package):

python scripts/preflight.py <path-to-document> --json --vision-dir ./lcr-pages

Use the result

The JSON has tokens_saved and a pages array. Each page carries source, text, text_tokens, image_tokens, and image:

  • source: "text": use text directly; do not attach the image.
  • source: "ocr": the page was image-only and has been OCR'd on-device; use text.
  • source: "vision": the page is a table, chart, or diagram; attach the image at image so the model can read it. The text is a rough fallback only.

Assemble the text and OCR pages in reading order, attach images only for the vision pages, and mention tokens_saved if the user cares about cost.

Notes

Everything runs locally and offline; the document does not leave the machine.

Verwandte Skills

neyoon/lindle

An AI orchestration workbench for building visual flows, agent-driven skills, and OpenAI-compatible plugin workflows.

community

jingyibi93/ppt-speaker-notes

Codex skill for generating natural speaker notes from PPT/PPTX/PDF decks, with slide visual analysis and optional PPTX notes insertion.

community

narsinghlaga124/ARIS-in-AI-Offer

Streamline your AI research and campus recruiting tasks with the ARIS agent framework to secure job offers.

community

giuseppe-trisciuoglio/nestjs-best-practices

Provides comprehensive NestJS best practices including modular architecture, dependency injection scoping, exception filters, DTO validation with class-validator, and Drizzle ORM integration. Use when designing NestJS modules, implementing providers, creating exception filters, validating DTOs, or integrating Drizzle ORM within NestJS applications.

community

Postpartum-genushyacinthus29/dotnet-skills

Teach AI agents modern .NET skills for ASP.NET Core, EF, Blazor, MAUI, and more with a growing community catalog

community

jeffallan/rust-engineer

Writes, reviews, and debugs idiomatic Rust code with memory safety and zero-cost abstractions. Implements ownership patterns, manages lifetimes, designs trait hierarchies, builds async applications with tokio, and structures error handling with Result/Option. Use when building Rust applications, solving ownership or borrowing issues, designing trait-based APIs, implementing async/await concurrency, creating FFI bindings, or optimizing for performance and memory safety. Invoke for Rust, Cargo, ownership, borrowing, lifetimes, async Rust, tokio, zero-cost abstractions, memory safety, systems programming.

community