farhanic017/vision-tool

Image & video analysis for AI coding assistants without native vision. Works with any model - CLI, MCP, or opencode skill. 12 vision backends (Gemini, GPT-4o, Claude, etc.). Zero hardcoded secrets.

対応✓Claude Code~Codex CLI~Cursor✓Gemini CLI✓OpenCode

npx skills add farhanic017/vision-tool

オリジナルを見る→すべてのスキルを見る

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

ChatGPT Claude Gemini Grok Perplexity DeepSeek

ドキュメント

farhanic017/vision-tool

Image & video analysis for AI coding assistants without native vision. Works with any model - CLI, MCP, or opencode skill. 12 vision backends (Gemini, GPT-4o, Claude, etc.). Zero hardcoded secrets.

関連スキル

XimilalaXiang/DeLive

System audio capture + multi-provider ASR + local-first AI review workspace. Floating live captions, 12 ASR backends, 60+ languages, AI summary/chat/mindmap, Open API, MCP server, and Agent Skill.

community

csthink/dashmotion

Diagrams that move — a Claude AI skill that generates animated flowcharts & architecture diagrams as self-contained HTML/SVG. Flowing connectors, requests traveling as light dots.

community

Pixazo-AI/skills

Pixazo Agent Skills — install one with: npx skills add Pixazo-AI/skills --skill <model>. 70 skills covering image, video, music, voice, 3D, virtual try-on. Pixazo API key required.

community

atomachinskiy/claude-skill-avito-ads

Claude Code skill for Avito Реклама public API — DSP cabinet (CPM/CPC banners, HTML, video) with read access, group budget/price control, multi-account transfers, and ORD advertiser/contract setup. OAuth client_credentials + sandbox-first.

community

Alexander-Kz/video-layer-skill

Production multi-agent Claude Code skill — turns a voiceover MP3 into a full whiteboard-explainer YouTube episode. Brief-on-disk + SHA256 verify, gated 2-wave generation, vision review loops.

community

jasonzhangshuo/solfege-video

🎵 简谱练习视频生成器 | MusicXML → MP4 (竖版/横版) | Cursor & Codex Skill

community

← More ビデオ＆アニメーション skills