sfkislev/photoshop
Control Adobe Photoshop from the shell via Flue - ExtendScript bridges without an MCP server.
Control Adobe Photoshop from the shell via Flue - ExtendScript bridges without an MCP server.
npx skills add https://github.com/sfkislev/flue/tree/main/skills/photoshopControl Adobe Photoshop from the shell via Flue - ExtendScript bridges without an MCP server.
This repo contains 9 individual skills — each has its own dedicated page.
Control Adobe desktop apps - Photoshop, Illustrator, Premiere, After Effects, InDesign, Audition - from the shell via Flue, without an MCP server.
Control Adobe Audition from the shell via Flue - ExtendScript bridges without an MCP server.
Control Autodesk desktop apps - 3ds Max and friends - from the shell via Flue, without an MCP server.
Control Blender from the shell via Flue - a Python bridge to bpy without an MCP server.
Let agents control software, including Adobe Photoshop, Illustrator, After Effects, Premiere, Autodesk 3DS Max, Blender, Unity, Houdini, and Microsoft Office.
Control SideFX Houdini from the shell via Flue - a Python bridge to hou without an MCP server.
Control Adobe Illustrator from the shell via Flue - ExtendScript bridges without an MCP server.
Control Adobe InDesign from the shell via Flue - ExtendScript bridges without an MCP server.
Control Adobe Premiere Pro from the shell via Flue - ExtendScript bridges without an MCP server.
[Deprecated: use baoyu-imagine] AI image generation with OpenAI, Azure OpenAI, Google, OpenRouter, DashScope, Z.AI GLM-Image, MiniMax, Jimeng, Seedream and Replicate APIs. Supports text-to-image, reference images, aspect ratios, and batch generation from saved prompt files. Sequential by default; use batch parallel generation when the user already has multiple prompts or wants stable multi-image throughput. Use when user asks to generate, create, or draw images.
Vision-driven Android device automation using Midscene. Operates entirely from screenshots — no DOM or accessibility labels required. Can interact with all visible elements on screen regardless of technology stack. Control Android devices with natural language commands via ADB. Perform taps, swipes, text input, app launches, screenshots, and more. Trigger keywords: android, phone, mobile app, tap, swipe, install app, open app on phone, android device, mobile automation, adb, launch app, mobile screen, test android app, verify mobile app, QA on phone, check the app on android, test on device, see if the app works on phone, end-to-end test on android, visual verification on mobile Powered by Midscene.js (https://midscenejs.com)
Generate Excalidraw diagrams from natural language descriptions. Use when asked to "create a diagram", "make a flowchart", "visualize a process", "draw a system architecture", "create a mind map", or "generate an Excalidraw file". Supports flowcharts, relationship diagrams, mind maps, and system architecture diagrams. Outputs .excalidraw JSON files that can be opened directly in Excalidraw.
Automate All Images AI tasks via Rube MCP (Composio). Always search tools first for current schemas.
Mask-driven image inpainting on RunComfy via the `runcomfy` CLI. Routes to Tongyi MAI Z-Image Turbo Inpainting (the dedicated inpainting endpoint with mask, strength, and control-scale) and to identity-preserving edit models (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when a mask isn't available and the region must be described instead. Use for object removal, watermark removal, region replacement, blemish cleanup, and any controlled local edit where a binary mask defines the target area. Triggers on "inpaint", "inpainting", "image inpaint", "remove from image", "fill region", "mask-driven edit", "remove watermark", "remove object", "patch the photo", "fill the hole", or any explicit ask to edit a specific masked region of a still.
Generates images and text via reverse-engineered Gemini Web API. Supports text generation, image generation from prompts, reference images for vision input, and multi-turn conversations. Use when other skills need image generation backend, or when user requests "generate image with Gemini", "Gemini text generation", or needs vision-capable AI generation.