CommunityProductivity & Collaborationgithub.com

rajpra808/browser-agent

Vision-based browser automation CLI — LLM sees screenshot, clicks coordinates, repeats. No selectors. Supports Claude, Gemini, OpenAI, Ollama (local/free).

Works withClaude CodeCodex CLI~CursorGemini CLI
npx add-skill rajpra808/browser-agent

rajpra808/browser-agent

Vision-based browser automation CLI — LLM sees screenshot, clicks coordinates, repeats. No selectors. Supports Claude, Gemini, OpenAI, Ollama (local/free).

Source: https://github.com/rajpra808/browser-agent

Discovered during the daily awesomeskills.dev agent-skill hunt.

Related Skills