AgentBrowse
/install agentbrowse
AgentBrowse is the browser layer for agent tasks that happen on a real website.
Use this skill when the agent needs to:
- launch a browser or attach to an existing one;
- inspect the current page and decide from visible state;
- click, type, select, and otherwise act on returned target refs;
- navigate directly to a known URL;
- extract structured data from the page;
- capture screenshots or recover a stuck browser session.
AgentBrowse works well on its own for browser automation. It can also be paired with MagicPay later when a broader flow reaches an approved login, identity, or payment step.
Open source:
- Browser library and docs:
https://github.com/MercuryoAI/agentbrowse - CLI package:
@mercuryo-ai/agentbrowse-cli
Setup
agentbrowsemust be available onPATH. If it is missing or outdated, runnpm i -g @mercuryo-ai/agentbrowse-cli@latest, then verify withagentbrowse --version.agentbrowse launchneeds an environment that can start a browser.agentbrowse attach \x3Ccdp-url>needs a reachable CDP endpoint.- Core browser commands such as
launch,attach,navigate,act,browser-status,screenshot, andclosedo not need any API key. - AI-assisted features —
observewith a natural-language goal andextract— call an LLM through the gateway. Configure API access withagentbrowse init \x3CapiKey>before using them. Pass a non-default API URL during init if needed. agentbrowse doctorinspects the local config. Use it afterinitwhen AI-assistedobserveorextractstill fails.
Core Loop
- Start or connect to a browser with
agentbrowse launch [url]oragentbrowse attach \x3Ccdp-url>. - Read the page with
agentbrowse observe. - Act on the returned refs with
agentbrowse act \x3CtargetRef> \x3Caction> [value]. - Re-run
agentbrowse observeafter navigation or meaningful UI changes. - Use
agentbrowse navigate \x3Curl>when the destination is already known. - Use
agentbrowse extract '\x3Cschema-json>' [scopeRef]when you need structured output instead of another page action. - Use
agentbrowse screenshotoragentbrowse browser-statusonly for evidence and debugging. - Finish with
agentbrowse closewhen the browser session is no longer needed.
When To Bring In Another Tool
Bring in a companion protected-flow tool when the site reaches:
- a login step that needs approved protected values;
- an identity form with protected personal data;
- a payment step with protected card details or approval flow.
At that point AgentBrowse can stay the browsing layer around the protected step, but it should not invent its own secret-handling flow.
Ask-User Boundary
Ask the user only when:
- the correct next step is still ambiguous after re-observing the page;
- the environment cannot launch or attach to a browser;
- the task crosses into a protected approval or payment boundary.
Operating Rules
- Trust the visible page state, not assumptions about what should have happened.
- Re-observe after meaningful page changes instead of reusing stale refs.
- Keep browser work and protected-step handling separated.
closeis only teardown or recovery. Never treatcloseas a success signal — task success comes from the visible page state beforeclose.
More Detail
Open an extra reference only when it helps:
- Operating guide for resume and recovery.
- Command guide for every CLI command.
- Failure recovery for common runtime states.
- Boundaries and escalation for safety rules.
If a term (session, ref, targetRef, scopeRef, fillRef, pageRef)
is unfamiliar, check the
AgentBrowse API reference glossary.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install agentbrowse - 安装完成后,直接呼叫该 Skill 的名称或使用
/agentbrowse触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
AgentBrowse 是什么?
Browser automation workflows through the agentbrowse CLI for launch, attach, observe, act, extract, navigation, and screenshots. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 248 次。
如何安装 AgentBrowse?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install agentbrowse」即可一键安装,无需额外配置。
AgentBrowse 是免费的吗?
是的,AgentBrowse 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
AgentBrowse 支持哪些平台?
AgentBrowse 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 AgentBrowse?
由 Dmitry Ukhanov(@xor777)开发并维护,当前版本 v0.1.22。