Community생산성 & 협업github.com

rajpra808/browser-agent

Vision-based browser automation CLI — LLM sees screenshot, clicks coordinates, repeats. No selectors. Supports Claude, Gemini, OpenAI, Ollama (local/free).

지원 대상Claude CodeCodex CLI~CursorGemini CLI
npx skills add rajpra808/browser-agent

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

문서

rajpra808/browser-agent

Vision-based browser automation CLI — LLM sees screenshot, clicks coordinates, repeats. No selectors. Supports Claude, Gemini, OpenAI, Ollama (local/free).

Source: https://github.com/rajpra808/browser-agent

Discovered during the daily awesomeskills.dev agent-skill hunt.

관련 스킬