Description

Browser automation CLI for AI agents. Runs a full browser engine: navigation & interaction, data extraction & network capture, screenshots, form automation, multi-browser parallel operation, user-configured proxy support, and human-agent collaboration.

README (SKILL.md)

\r \r

browser-act\r

Name: browser-act
Author: 21551594

\r Browser automation CLI for AI agents. Runs a full browser engine: navigation &\r interaction, data extraction & network capture, screenshots, form automation,\r multi-browser parallel operation, user-configured proxy support, and\r human-agent collaboration.\r \r

Features\r

\r

Lightweight extraction — fast JS-rendered content fetch without opening a browser session, advanced WebFetch/curl replacement\r
Session management — multi-browser isolation, multi-account parallel operation\r
Verification assistance — when automation encounters interactive challenges, assists completion with user authorization\r
Complex interaction — network capture (XHR/fetch/HAR), screenshots, form filling, file upload\r
Human-agent collaboration — headed mode + remote assist for manual steps\r
Safety controls — Confirmation Gate protocol requires explicit user approval before browser creation, deletion, and sensitive operations\r
Universal compatibility — works with Cursor, Claude Code, Codex, Windsurf, etc.\r \r Install: uv tool install browser-act-cli --python 3.12\r \r

Start here\r

\r Before running any browser-act command, load the usage guide from the CLI:\r \r

browser-act get-skills core --skill-version 2.0.0   # start here — workflows, common patterns, troubleshooting\r
```\r
\r
**Do NOT skip this step regardless of how simple the command seems.**\r
\r
**Do NOT truncate the output** — it contains operational directives and\r
environment state that are critical for correct operation. Truncating will\r
cause you to miss browser selection rules and safety constraints.\r
\r
`get-skills core` provides environment status, available browsers, operational\r
directives, and the complete interaction workflow — none of which are available\r
through `--help`.\r

Usage Guidance

Install only if you trust the BrowserAct CLI source. Prefer a pinned, audited CLI version, review the runtime guide before following it, use a dedicated browser profile, and require explicit confirmation before logins, form submissions, uploads, account changes, or CAPTCHA assistance.

Capability Assessment

ℹ Purpose & Capability

The stated purpose is coherent for a browser automation tool, but the documented capabilities include high-impact actions such as authenticated sessions, form submission, file upload, network capture, screenshots, and verification assistance.

⚠ Instruction Scope

The skill requires the agent to run a CLI-provided guide before any command and treat its output as operational directives. Those runtime instructions are not included in the reviewed artifact and could steer the agent beyond the static SKILL.md.

⚠ Install Mechanism

The registry has no install spec or code files, while SKILL.md instructs installation of an external, unpinned PyPI CLI. The scanner therefore did not review the executable code that would perform the browser automation.

ℹ Credentials

Network, filesystem, browser-profile, local Chrome CDP, and optional verification-assistance API use are disclosed and generally aligned with browser automation, but they are broad and should be used only with explicit user intent.

ℹ Persistence & Privilege

The skill discloses persistent browser profiles, login sessions, cookies, credentials, and rotated session logs. This is expected for session automation but creates local sensitive-data exposure if profiles or logs are mishandled.

Version History

v1.0.0

Initial release of browser-act skill. - Provides browser automation for AI agents, including navigation, data extraction, interaction, and multi-browser support. - Handles JavaScript-rendered content, verification prompts, sessions, form filling, screenshots, uploads, and network capture. - Enforces user safety with approval required for sensitive actions and browser/session management. - Stores all data and credentials locally—no uploads except captcha images for solve-captcha. - Strongly recommends starting with the get-skills core command for proper workflows and safety guidelines.

Metadata

Slug browser-act-skill-v1

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is browser-act?

Browser automation CLI for AI agents. Runs a full browser engine: navigation & interaction, data extraction & network capture, screenshots, form automation, multi-browser parallel operation, user-configured proxy support, and human-agent collaboration. It is an AI Agent Skill for Claude Code / OpenClaw, with 112 downloads so far.

How do I install browser-act?

Run "/install browser-act-skill-v1" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is browser-act free?

Yes, browser-act is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does browser-act support?

browser-act is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created browser-act?

It is built and maintained by 21551594 (@21551594); the current version is v1.0.0.

More Skills

browser-act