farhanic017/vision-tool
Image & video analysis for AI coding assistants without native vision. Works with any model - CLI, MCP, or opencode skill. 12 vision backends (Gemini, GPT-4o, Claude, etc.). Zero hardcoded secrets.
Image & video analysis for AI coding assistants without native vision. Works with any model - CLI, MCP, or opencode skill. 12 vision backends (Gemini, GPT-4o, Claude, etc.). Zero hardcoded secrets.
npx skills add farhanic017/vision-toolImage & video analysis for AI coding assistants without native vision. Works with any model - CLI, MCP, or opencode skill. 12 vision backends (Gemini, GPT-4o, Claude, etc.). Zero hardcoded secrets.
System audio capture + multi-provider ASR + local-first AI review workspace. Floating live captions, 12 ASR backends, 60+ languages, AI summary/chat/mindmap, Open API, MCP server, and Agent Skill.
Diagrams that move — a Claude AI skill that generates animated flowcharts & architecture diagrams as self-contained HTML/SVG. Flowing connectors, requests traveling as light dots.
Pixazo Agent Skills — install one with: npx skills add Pixazo-AI/skills --skill <model>. 70 skills covering image, video, music, voice, 3D, virtual try-on. Pixazo API key required.
Claude Code skill for Avito Реклама public API — DSP cabinet (CPM/CPC banners, HTML, video) with read access, group budget/price control, multi-account transfers, and ORD advertiser/contract setup. OAuth client_credentials + sandbox-first.
Production multi-agent Claude Code skill — turns a voiceover MP3 into a full whiteboard-explainer YouTube episode. Brief-on-disk + SHA256 verify, gated 2-wave generation, vision review loops.
🎵 简谱练习视频生成器 | MusicXML → MP4 (竖版/横版) | Cursor & Codex Skill