browser-use/browser-use
A library that enables agents to use browsers naturally like humans to perform web-based tasks.
A library that enables agents to use browsers naturally like humans to perform web-based tasks.
npx add-skill browser-use/browser-useA library that enables agents to use browsers naturally like humans to perform web-based tasks.
This repo contains 4 individual skills — each has its own dedicated page.
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, or extract information from web pages.
Documentation reference for using Browser Use Cloud — the hosted API and SDK for browser automation. Use this skill whenever the user needs help with the Cloud REST API (v2 or v3), browser-use-sdk (Python or TypeScript), X-Browser-Use-API-Key authentication, cloud sessions, browser profiles, profile sync, CDP WebSocket connections, stealth browsers, residential proxies, CAPTCHA handling, webhooks, workspaces, skills marketplace, liveUrl streaming, pricing, or integration patterns (chat UI, subagent, adding browser tools to existing agents). Also trigger for questions about n8n/Make/Zapier integration, Playwright/ Puppeteer/Selenium on cloud infrastructure, or 1Password vault integration. Do NOT use this for the open-source Python library (Agent, Browser, Tools config) — use the open-source skill instead.
Documentation reference for writing Python code using the browser-use open-source library. Use this skill whenever the user needs help with Agent, Browser, or Tools configuration, is writing code that imports from browser_use, asks about @sandbox deployment, supported LLM models, Actor API, custom tools, lifecycle hooks, MCP server setup, or monitoring/observability with Laminar or OpenLIT. Also trigger for questions about browser-use installation, prompting strategies, or sensitive data handling. Do NOT use this for Cloud API/SDK usage or pricing — use the cloud skill instead. Do NOT use this for directly automating a browser via CLI commands — use the browser-use skill instead.
Controls a local browser from a sandboxed remote machine. Use when the agent is running in a sandbox (no GUI) and needs to navigate websites, interact with web pages, fill forms, take screenshots, or expose local dev servers via tunnels.
Provider-agnostic parallel agent coordination for LLMs — Anthropic, OpenAI, Ollama, or any OpenAI-compatible endpoint. Parallel worktree isolation, interface contracts, and tier-gated execution at scale.
Agent-agnostic Claude Code / Qoder / Cursor / Custom GPT skills for Git workflows (GitHub / GitLab / Gitea). Includes release-sop; pr-review / issue-triage / changelog-bot / hotfix-flow planned.
Claude Code skill enforcing Gitflow, conventional commits, semantic versioning, and issue-driven branching.
The open-source managed agents platform. Turn coding agents into real teammates — assign tasks, track progress, compound skills.
Provide specialized AI agents that develop, review, debug, and deploy production-ready code efficiently across various programming tasks.
Copilot agent examples and experiments for developer workflows.