← 返回 Skills 市场
xor777

AgentBrowse

作者 Dmitry Ukhanov · GitHub ↗ · v0.1.22 · MIT-0
cross-platform ✓ 安全检测通过
248
总下载
0
收藏
0
当前安装
22
版本数
在 OpenClaw 中安装
/install agentbrowse
功能描述
Browser automation workflows through the agentbrowse CLI for launch, attach, observe, act, extract, navigation, and screenshots.
使用说明 (SKILL.md)

AgentBrowse is the browser layer for agent tasks that happen on a real website.

Use this skill when the agent needs to:

  • launch a browser or attach to an existing one;
  • inspect the current page and decide from visible state;
  • click, type, select, and otherwise act on returned target refs;
  • navigate directly to a known URL;
  • extract structured data from the page;
  • capture screenshots or recover a stuck browser session.

AgentBrowse works well on its own for browser automation. It can also be paired with MagicPay later when a broader flow reaches an approved login, identity, or payment step.

Open source:

  • Browser library and docs: https://github.com/MercuryoAI/agentbrowse
  • CLI package: @mercuryo-ai/agentbrowse-cli

Setup

  • agentbrowse must be available on PATH. If it is missing or outdated, run npm i -g @mercuryo-ai/agentbrowse-cli@latest, then verify with agentbrowse --version.
  • agentbrowse launch needs an environment that can start a browser. agentbrowse attach \x3Ccdp-url> needs a reachable CDP endpoint.
  • Core browser commands such as launch, attach, navigate, act, browser-status, screenshot, and close do not need any API key.
  • AI-assisted features — observe with a natural-language goal and extract — call an LLM through the gateway. Configure API access with agentbrowse init \x3CapiKey> before using them. Pass a non-default API URL during init if needed.
  • agentbrowse doctor inspects the local config. Use it after init when AI-assisted observe or extract still fails.

Core Loop

  1. Start or connect to a browser with agentbrowse launch [url] or agentbrowse attach \x3Ccdp-url>.
  2. Read the page with agentbrowse observe.
  3. Act on the returned refs with agentbrowse act \x3CtargetRef> \x3Caction> [value].
  4. Re-run agentbrowse observe after navigation or meaningful UI changes.
  5. Use agentbrowse navigate \x3Curl> when the destination is already known.
  6. Use agentbrowse extract '\x3Cschema-json>' [scopeRef] when you need structured output instead of another page action.
  7. Use agentbrowse screenshot or agentbrowse browser-status only for evidence and debugging.
  8. Finish with agentbrowse close when the browser session is no longer needed.

When To Bring In Another Tool

Bring in a companion protected-flow tool when the site reaches:

  • a login step that needs approved protected values;
  • an identity form with protected personal data;
  • a payment step with protected card details or approval flow.

At that point AgentBrowse can stay the browsing layer around the protected step, but it should not invent its own secret-handling flow.

Ask-User Boundary

Ask the user only when:

  • the correct next step is still ambiguous after re-observing the page;
  • the environment cannot launch or attach to a browser;
  • the task crosses into a protected approval or payment boundary.

Operating Rules

  • Trust the visible page state, not assumptions about what should have happened.
  • Re-observe after meaningful page changes instead of reusing stale refs.
  • Keep browser work and protected-step handling separated.
  • close is only teardown or recovery. Never treat close as a success signal — task success comes from the visible page state before close.

More Detail

Open an extra reference only when it helps:

If a term (session, ref, targetRef, scopeRef, fillRef, pageRef) is unfamiliar, check the AgentBrowse API reference glossary.

安全使用建议
This skill appears to do what it says: automate and interact with a real browser using the agentbrowse CLI. Before installing or using it, verify the npm package owner and version on the npm/GitHub pages, and inspect the CLI source if you can. Do not enable or use AI-assisted features (observe with free-text goals or extract) on pages that contain credentials, payment details, or other sensitive personal data — those features will send visible page content to whatever LLM gateway you configure. If your flow reaches login/identity/payment, follow the skill's guidance to switch to a protected flow tool that is designed to handle secrets. Finally, be aware the CLI stores the API key in local config (use a dedicated key with least privilege and inspect/secure the config file).
功能分析
Type: OpenClaw Skill Name: agentbrowse Version: 0.1.22 The agentbrowse skill provides a structured interface for browser automation using the @mercuryo-ai/agentbrowse-cli tool. The documentation (SKILL.md and references/) includes clear operating rules and safety guardrails, specifically instructing the agent to avoid handling sensitive data like credentials or payments directly and instead defer to protected-flow tools or user intervention. No indicators of malicious intent, such as data exfiltration or unauthorized persistence, were found in the code or instructions.
能力标签
cryptocan-make-purchasesrequires-sensitive-credentials
能力评估
Purpose & Capability
Name/description (browser automation) align with requirements: the skill only requires the agentbrowse CLI and installs via an npm package that provides the 'agentbrowse' binary. There are no unrelated credentials, binaries, or config paths requested.
Instruction Scope
SKILL.md instructs the agent to run agentbrowse CLI commands (launch, attach, observe, act, extract, screenshot, close). This stays within the browsing domain. Important caveat: AI-assisted commands (observe with natural-language goal and extract) call an LLM gateway and will send page content to that gateway; the CLI stores an API key locally via 'agentbrowse init <apiKey>'. The skill explicitly warns to switch to a protected flow for logins/payments, but the runtime behavior can transmit visible page data to an external LLM if used — review before running on pages with sensitive content.
Install Mechanism
Install is a single npm package (@mercuryo-ai/agentbrowse-cli) that provides the 'agentbrowse' binary. This is a typical mechanism for CLIs and not an arbitrary download; risk is moderate (npm registry), so verify package provenance and version before installing.
Credentials
The skill does not declare required environment variables or credentials. AI-assisted features require an API key provided at runtime via 'agentbrowse init' (stored in local config) — this is proportional to the advertised LLM features and is optional for core browsing commands.
Persistence & Privilege
always is false and the skill does not request elevated/persistent platform privileges or to modify other skills. Default autonomous invocation is allowed (normal). The skill may store an API key in its own local config via the CLI, which is expected behavior for LLM integration.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install agentbrowse
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /agentbrowse 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.22
Release agentbrowse-v0.1.22
v0.1.21
Release agentbrowse-v0.1.21
v0.1.20
Release agentbrowse-v0.1.20
v0.1.19
Release agentbrowse-v0.1.19
v0.1.18
Release agentbrowse-v0.1.18
v0.1.17
Release agentbrowse-v0.1.17
v0.1.16
Release agentbrowse-v0.1.16
v0.1.15
Release agentbrowse-v0.1.15
v0.1.14
Release agentbrowse-v0.1.14
v0.1.13
Release agentbrowse-v0.1.13
v0.1.12
Release agentbrowse-v0.1.12
v0.1.11
Release agentbrowse-v0.1.11
v0.1.10
Release agentbrowse-v0.1.10
v0.1.9
Release agentbrowse-v0.1.9
v0.1.8
Release agentbrowse-v0.1.8
v0.1.7
Release agentbrowse-v0.1.7
v0.1.6
Release agentbrowse-v0.1.6
v0.1.5
Release agentbrowse-v0.1.5
v0.1.4
Release agentbrowse-v0.1.4
v0.1.3
Release agentbrowse-v0.1.3
元数据
Slug agentbrowse
版本 0.1.22
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 22
常见问题

AgentBrowse 是什么?

Browser automation workflows through the agentbrowse CLI for launch, attach, observe, act, extract, navigation, and screenshots. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 248 次。

如何安装 AgentBrowse?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install agentbrowse」即可一键安装,无需额外配置。

AgentBrowse 是免费的吗?

是的,AgentBrowse 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

AgentBrowse 支持哪些平台?

AgentBrowse 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 AgentBrowse?

由 Dmitry Ukhanov(@xor777)开发并维护,当前版本 v0.1.22。

💬 留言讨论