rajpra808/browser-agent
Vision-based browser automation CLI — LLM sees screenshot, clicks coordinates, repeats. No selectors. Supports Claude, Gemini, OpenAI, Ollama (local/free).
Source: https://github.com/rajpra808/browser-agent
Discovered during the daily awesomeskills.dev agent-skill hunt.