Community视频与动画github.com

google-gemini/gemini-api-dev

Use this skill when building applications with Gemini API hosted models, including Gemini and Gemma 4, working with multimodal content (text, images, audio, video), implementing function calling, using structured outputs, or needing current model specifications. Covers SDK usage (google-genai for Python, @google/genai for JavaScript/TypeScript, com.google.genai:google-genai for Java, google.golang.org/genai for Go), model selection, and API capabilities.

兼容平台~Claude Code~Codex CLI~CursorGemini CLI
npx skills add https://github.com/google-gemini/gemini-skills/tree/main/skills/gemini-api-dev

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

文档

google-gemini/gemini-api-dev

Use this skill when building applications with Gemini API hosted models, including Gemini and Gemma 4, working with multimodal content (text, images, audio, video), implementing function calling, using structured outputs, or needing current model specifications. Covers SDK usage (google-genai for Python, @google/genai for JavaScript/TypeScript, com.google.genai:google-genai for Java, google.golang.org/genai for Go), model selection, and API capabilities.

Individual skills in this repo

This repo contains 3 individual skills — each has its own dedicated page.

相关技能

michaelczesun/claude-ai-ugc-diy

AI UGC video skill pack for Claude Code — fork of arcads-claude-code, rebuilt on fal.ai (pay-per-use, no 70€/mo subscription). Honest about what works and what doesn't.

community

SearchNova/ai-tool-video-creator

Claude Code Skill - AI工具分享视频自动创作

community

jimliu/baoyu-post-to-weibo

Posts content to Weibo (微博). Supports regular posts with text, images, and videos, and headline articles (头条文章) with Markdown input via Chrome CDP. Use when user asks to "post to Weibo", "发微博", "发布微博", "publish to Weibo", "share on Weibo", "写微博", or "微博头条文章".

community

badrnewgames/claude-subagent-editor

🔧 Manage your AI agents easily with Claude Subagent Editor, a web-based tool for editing configuration files using a simple drag-and-drop interface.

community

doany-ai/kling-3-0

Kling 3.0 video generation on RunComfy. Kling 3.0 (also called Kling V3.0) is Kuaishou Technology's third-generation multi-shot video model with native synchronized audio and consistent character identity across shots. This skill covers all six Kling 3.0 endpoints, spanning three rendering tiers (Standard, Pro, 4K) and two modes (text-to-video, image-to-video). Calls runcomfy run kling/kling-3.0/<tier>/<mode> through the local RunComfy CLI. Triggers on "kling", "kling 3.0", "kling v3", "kling pro", "kling 4k", "kling text to video", "kling image to video", or any explicit ask to generate or animate with Kling 3.0.

community

derek-zhuolin/interflow-video-cut

把本地口播视频自动剪成卡片式成片的 Agent Skill:抽音轨 → ElevenLabs 转录 → AI 逐张写 HTML 卡片 → 渲染 MP4。10 种视觉风格 × 4 布局,转录只走 ElevenLabs 永不下本地模型。Turn a talking-head video into an AI-composed card-based video.

community