farhanic017/vision-tool
Image & video analysis for AI coding assistants without native vision. Works with any model - CLI, MCP, or opencode skill. 12 vision backends (Gemini, GPT-4o, Claude, etc.). Zero hardcoded secrets.
Image & video analysis for AI coding assistants without native vision. Works with any model - CLI, MCP, or opencode skill. 12 vision backends (Gemini, GPT-4o, Claude, etc.). Zero hardcoded secrets.
npx skills add farhanic017/vision-toolImage & video analysis for AI coding assistants without native vision. Works with any model - CLI, MCP, or opencode skill. 12 vision backends (Gemini, GPT-4o, Claude, etc.). Zero hardcoded secrets.
Claude Code skills for analysing YouTube interview transcripts into structured Chinese markdown notes
Provide data-driven YouTube growth insights with Claude Code, including channel audits, SEO, content strategy, and video optimization tools.
AgentCall lets AI Agents join meetings with voice, video & screen-share to build together. Supports Google Meet, Teams, Zoom (Beta)
Universal audio/video transcription with 8 interchangeable backends — local Whisper (offline), YouTube subtitles, Gemini, Groq, OpenAI, Deepgram, AssemblyAI, custom OpenAI-compat. Works as Claude Code skill, slash command, or standalone CLI.
Automate TikTok slideshow marketing for any app or product. Researches competitors, generates AI images, adds text overlays, posts via Postiz, tracks analytics, and iterates on what works. Use when setting up TikTok marketing automation, creating slideshow posts, analyzing post performance, optimizing app marketing funnels, or when a user mentions TikTok growth, slideshow ads, or social media marketing for their app. Covers competitor research (browser-based), image generation, text overlays, TikTok posting (Postiz API), cross-posting to Instagram/YouTube/Threads, analytics tracking, hook testing, CTA optimization, conversion tracking with RevenueCat, and a full feedback loop that adjusts hooks and CTAs based on views vs conversions.
System audio capture + multi-provider ASR + local-first AI review workspace. Floating live captions, 12 ASR backends, 60+ languages, AI summary/chat/mindmap, Open API, MCP server, and Agent Skill.