n24q02m/imagine-mcp
Production-grade MCP server for image and video understanding + generation across Gemini, OpenAI, and Grok.
Production-grade MCP server for image and video understanding + generation across Gemini, OpenAI, and Grok.
npx add-skill n24q02m/imagine-mcpProduction-grade MCP server for image and video understanding + generation across Gemini, OpenAI, and Grok.
Skill for Claude, Cursor & Copilot that automates the Karpathy LLM Wiki workflow: ingest web, GitHub, and YouTube URLs into a well-structured, citable, cross-referenced knowledge base with automatic linting.
Give Claude a smarter video input. Scene-aware frame extraction, per-frame OCR, auto-chunked Whisper transcripts. Skill for Claude Code, claude.ai, and Codex.
Turn an article, essay, or Markdown note into a narrated video package: spoken script, AI voiceover, sidecar SRT subtitles, generated images, and a final MP4.
AgentCall lets AI Agents join meetings with voice, video & screen-share to build together. Supports Google Meet, Teams, Zoom (Beta)
System audio capture + multi-provider ASR + local-first AI review workspace. Floating live captions, 12 ASR backends, 60+ languages, AI summary/chat/mindmap, Open API, MCP server, and Agent Skill.
Hermes Agent and OpenClaw skills for XRToken API — AI image and video generation