rajpra808/browser-agent

Vision-based browser automation CLI — LLM sees screenshot, clicks coordinates, repeats. No selectors. Supports Claude, Gemini, OpenAI, Ollama (local/free).

対応Claude CodeCodex CLI~CursorGemini CLI
npx skills add rajpra808/browser-agent

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

ドキュメント

rajpra808/browser-agent

Vision-based browser automation CLI — LLM sees screenshot, clicks coordinates, repeats. No selectors. Supports Claude, Gemini, OpenAI, Ollama (local/free).

Source: https://github.com/rajpra808/browser-agent

Discovered during the daily awesomeskills.dev agent-skill hunt.

関連スキル