Communitygithub.com

graph-robots/open-robot-skills

Skill and tool bundles for gap (graph as policy) — Anthropic Agent Skills format, discovered by path

相容平台Claude CodeCodex CLI~CursorGemini CLI
npx skills add graph-robots/open-robot-skills

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

說明文件

vlm

API-backed vision-language Q&A. Zero GPU: every provider is a remote endpoint. Images are gap-native uint8 [H, W, 3] numpy arrays, PNG-encoded on the wire.

Providers

Selected by GAP_VLM_PROVIDER (default openrouter); each tool also accepts a per-call provider= override.

ProviderBackendConfig (env)
openrouterOpenRouter's OpenAI-compatible chat-completions APIOPENROUTER_API_KEY (or GAP_VLM_API_KEY); GAP_VLM_MODEL (default gemini-3.1-flash-lite-preview, see DEFAULT_MODEL in tools.py); set GAP_VLM_BASE_URL to point at another OpenAI-compatible server (e.g. a local vLLM)
vertexVertex AI via google-genai (Gemini models)GAP_VLM_MODEL, GAP_VLM_PROJECT_ID, GAP_VLM_REGION

The vertex provider lazy-imports google-genai — install the engine's vertex extra first: pip install "graph-as-policy[vertex]".

When to use

  • Semantic scene checks and checkpoint verification (vlm.query_yes_no).
  • Free-form scene descriptions or attribute queries (vlm.query).
  • Prefer gemini-er.detect when you need pixel-space bounding boxes, and molmo.point_prompt when you need a single click point.

Notes

  • vlm.query_yes_no coerces with the source-verbatim rule: answer is true iff "yes" appears in the lowercased reply.
  • Requests carry no system prompt and no temperature knob (mirrors the original vlm.v1 proto); both providers pin temperature: 0.0.

相關技能