← 返回 Skills 市场
tangweigang-jpg

Browser Use Agent

作者 Tang Weigang · GitHub ↗ · v0.1.0 · MIT-0
cross-platform ⚠ suspicious
64
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install browser-use-agent
功能描述
Browser-Use:把 LLM 变成网页操作员的异步 Python 库(Python 3.11+)。Agent 步循环采集 DOM + 截图 → LLM 一次调用产出 thinking / evaluation / next_goal / action[] → 经 CDP 执行。 Browser-Use: a...
使用说明 (SKILL.md)

这个 skill 适合什么用户?能做哪些任务?

概览

Browser-Use 是把 LLM 变成网页操作员的异步 Python 库(github.com/browser-use/browser-use)。Agent 步循环:(1) 通过 14 个 watchdog 围绕 bubus.EventBus 采集 BrowserStateSummary(带数字索引的 DOM、截图、tab 列表、页面状态);(2) 一次 LLM 调用同时产出 thinking + evaluation_previous_goal + memory + next_goal + action[];(3) 经 CDP 原语在双层 page-change 守卫下执行动作。

CD...

Doramagic 晶体页: https://doramagic.ai/zh/crystal/browser-use-agent

知识规模

  • 40 条约束 (4 fatal + 36 non-fatal)
  • 上游源码: browser-use/browser-use @ commit f3878b0e
  • 蓝图 ID: finance-bp-133

用法

Host AI(Claude Code / Cursor / OpenClaw)读 references/seed.yaml,按其中的:

  • intent_router 匹配用户意图
  • architecture 理解项目架构
  • constraints 应用 anti-pattern 约束
  • business_decisions 参考核心设计决策

FAQ 摘要

这个 skill 适合什么用户?能做哪些任务?

适合做网页自动化的工程师:表单填写、信息抓取、回归测试、跨站点数据采集等。Agent 把视觉理解 + 操作规划合并到一次 LLM 调用,比传统 Playwright 脚本更适合非确定性页面。访问 doramagic.ai/r/browser-use 查看完整用例。

需要准备什么环境?依赖什么?

Python 3.11+,Chromium 系浏览器(local_browser_watchdog 自动启动或通过 cdp_url 接管),至少一个 LLM provider 配置(默认 ChatBrowserUse项目自家微调模型)。要求 async event loop——Agent / BrowserSession 接口是 async-native。

会踩哪些坑?这个 skill 怎么防护?

本 skill 内置 40 条约束(4 条 fatal)。典型踩坑:(1) alert/confirm/beforeunload 自动接受,破坏性确认(如 'Delete this')也会通过;(2) Agent(sensitive_data=...) 不配 Browser(allowed_domains=[...]) 是 fail-OPEN(只 warning 不 raise),合规场景必须显式


完整文档: 见 references/seed.yaml (v6.1 schema). 浏览页: https://doramagic.ai/zh/crystal/browser-use-agent

安全使用建议
This skill mixes a browser-automation description with a large, unrelated finance blueprint that instructs the host to run local python checks and read/write host paths. Before installing or invoking it: 1) Ask the publisher/author where this package came from and why finance seed.yaml is bundled with a browser agent. 2) Inspect references/seed.yaml fully (especially preconditions and install_trigger) and confirm you are comfortable with any commands it would make your host run. 3) Do not provide secrets or credentials; avoid running in production or on machines with sensitive data. 4) Prefer skills with a clear homepage/source repo and explicit, minimal install steps; if you must test, run in an isolated environment (VM/container) with no access to personal files. If you want, I can list the exact precondition commands and file paths that the seed.yaml would require the host to run so you can review them line-by-line.
功能分析
Type: OpenClaw Skill Name: browser-use-agent Version: 0.1.0 The skill bundle is a configuration for the 'browser-use' web automation library and the 'zvt' quant trading framework. It contains extensive safety constraints (e.g., preventing destructive actions without confirmation, requiring domain lockdowns for sensitive data) and environment preconditions to ensure the host system is properly configured. While the 'seed.yaml' file appears to be a hybrid of two different domains (web automation and financial trading), likely due to template reuse, there is no evidence of malicious intent, data exfiltration, or harmful prompt injection. The instructions are focused on operational reliability and security hardening for the AI agent.
能力标签
cryptocan-make-purchasesrequires-sensitive-credentials
能力评估
Purpose & Capability
SKILL.md and human_summary advertise a browser-agent (web automation) skill, but the referenced seed.yaml is compiled as 'finance-bp-133' with ZVT/backtesting preconditions and finance-specific blueprints. That financial blueprint content (zvt checks, MACD defaults, finance rules) does not align with a pure browser automation purpose and suggests the package wires in unrelated capabilities.
Instruction Scope
Runtime instructions require the host AI to 'read references/seed.yaml' and follow an execution_protocol that mandates reloading seed.yaml, running preconditions (python import and filesystem checks), and executing install/verification steps. These instructions direct the agent to run local shell/python commands and reference host filesystem paths (e.g., ZVT_HOME, host_workspace), which is broader than a simple knowledge-only browser doc and risks unexpected local actions.
Install Mechanism
The skill is instruction-only (no install spec), which normally is low-risk. However seed.yaml contains an install_trigger and expects 'resources.host_adapter.install_recipes[]' and import verification; this is inconsistent with 'no installation required' in SKILL.md and could prompt the host to perform installs via its own adapter. No explicit external download URLs are present in the skill bundle itself.
Credentials
Declared requirements list no env vars, yet the execution protocol and preconditions reference environment/state (ZVT_HOME, local writable directories) and run Python import checks. The skill also covers use cases that may handle sensitive data (auto-fill forms, checkout, password manager paths & negative terms mention '1password'). Requesting no credentials while instructing the agent to access local paths and run checks is inconsistent and disproportionate.
Persistence & Privilege
always is false and autonomous invocation is allowed (platform default). The seed.yaml's rule 'On any behavioral decision, agents MUST re-read seed.yaml' is aggressive (forces frequent re-loads) but does not equate to always:true or system-wide config modification. This increases runtime footprint but is not itself an explicit privilege escalation.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install browser-use-agent
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /browser-use-agent 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.0
Browser-Use LLM browser automation skill — 40 constraints / 4 fatal. CDP-driven, 14 watchdogs, 8 prompt templates.
元数据
Slug browser-use-agent
版本 0.1.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Browser Use Agent 是什么?

Browser-Use:把 LLM 变成网页操作员的异步 Python 库(Python 3.11+)。Agent 步循环采集 DOM + 截图 → LLM 一次调用产出 thinking / evaluation / next_goal / action[] → 经 CDP 执行。 Browser-Use: a... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 64 次。

如何安装 Browser Use Agent?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install browser-use-agent」即可一键安装,无需额外配置。

Browser Use Agent 是免费的吗?

是的,Browser Use Agent 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Browser Use Agent 支持哪些平台?

Browser Use Agent 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Browser Use Agent?

由 Tang Weigang(@tangweigang-jpg)开发并维护,当前版本 v0.1.0。

💬 留言讨论