DevvGwardo/grok-imagine-video
Recently updated agent-skill-related GitHub repository: DevvGwardo/grok-imagine-video.
Recently updated agent-skill-related GitHub repository: DevvGwardo/grok-imagine-video.
npx skills add DevvGwardo/grok-imagine-videoRecently updated agent-skill-related GitHub repository: DevvGwardo/grok-imagine-video.
video-speech 是一个 Codex Skill,用于从抖音、B站、小红书视频中提取口播文案,并输出清稿后的 Markdown 文本
Lip-sync a face to a specific audio track on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar from a portrait + audio), Sync Labs sync v2 / Pro (state-of-the-art mouth sync onto a video), Kling lipsync (audio-to- video and text-to-video with synced speech), and Creatify lipsync. The skill picks the right endpoint for the user's actual intent — portrait still + audio (avatar-style), source video + audio (mouth- swap on existing footage), or generate-and-sync from a script. Triggers on "lip sync", "lipsync", "make this video speak", "match audio to mouth", "dub video", "sync lips to voice", "Sync Labs", "voiceover sync", or any explicit ask to drive a face's mouth from an audio track.
Agent skill for multi-speaker meeting transcription with FunASR speaker diarization and LLM cleanup. Supports zh/en/ja/ko/yue. GPU & CPU. Packaged as a Claude Code plugin.
A Claude Code skill that watches videos for you — frames + transcript together. Tuned for creators reverse-engineering what makes hooks and retention beats actually work.
AI agent skills for video generation, lipsync, subtitles and background removal. Works with Claude Code, Cursor, and any agent that supports Markdown skills.
Local AI filmmaking studio — skills, canvas, timeline — driven from your coding agent.