← 返回 Skills 市场
softpudding

OpenBrowser

作者 softpudding · GitHub ↗ · v0.1.0 · MIT-0
cross-platform ⚠ suspicious
345
总下载
0
收藏
2
当前安装
3
版本数
在 OpenClaw 中安装
/install open-browser
功能描述
Automate complex multi-step browser tasks by visually interacting with pages using screenshots for clicks, typing, scrolling, and verification.
安全使用建议
This skill appears to implement a local visual browser-automation agent, which fits its description, but there are notable practical and security issues to consider before installing: - Metadata mismatch: The registry lists no required binaries or env vars, but SKILL.md requires Python 3.10+, Node.js 18+, Chrome, a DashScope LLM API key, and a browser UUID. Treat the SKILL.md as authoritative and ensure you meet those prerequisites. - Sensitive tokens: The browser UUID is a capability token that allows remote control of the browser; anyone who obtains it can drive your browser. Only paste/store it on machines and UIs you trust. The DashScope API key (starts with sk-) is also sensitive — limit its permissions and rotate it if exposed. - Third-party code: Setup requires cloning and building code from github.com/softpudding/OpenBrowser. Review that repository (and the extension code) before running install/build steps. Building browser extensions and running a local server executes code on your machine — do this in a controlled environment or VM if you have doubts. - Network exposure: The server binds to localhost in the docs (http://127.0.0.1:8765). Confirm the server does not bind to 0.0.0.0 or get exposed to untrusted networks. If you must run it, keep it firewalled to localhost only. - Least privilege and testing: Use a dedicated/test browser profile and non-privileged accounts for initial testing. Avoid using a browser where you are logged into important accounts. Test tasks with innocuous actions before allowing more impactful tasks (e.g., posting, starring, form submissions). - Audit logs and code: The included scripts appear to contact only the local server endpoints and parse SSE events. Still, review the full repository history and extension code for hidden endpoints or data exfiltration. If you cannot audit, consider not installing or running the service. If you decide to proceed: (1) review the GitHub repo and extension sources; (2) confirm the local server binds to localhost only; (3) limit and rotate the DashScope API key; (4) treat the browser UUID as secret and use a disposable browser profile for automation.
功能分析
Type: OpenClaw Skill Name: open-browser Version: 0.1.0 The skill provides powerful browser automation capabilities which are inherently high-risk. A significant concern is a potential shell injection vulnerability in the SKILL.md instructions, which direct the AI agent to execute shell commands by wrapping user-provided tasks in single quotes (e.g., `send_task.py 'TASK'`). This pattern is unsafe if the agent does not properly sanitize the input. Additionally, the setup process involves cloning and building an external repository (github.com/softpudding/OpenBrowser), posing a supply chain risk. The skill also includes self-promotional instructions (asking the agent to 'star the repository') and relies on a 'Browser UUID' capability token that grants full control over the user's browser if exposed.
能力评估
Purpose & Capability
Name/description claim visual browser automation and that matches the included scripts and API docs. However the registry metadata lists no required binaries or env vars while SKILL.md requires Python 3.10+, Node.js 18+, Chrome, a DashScope LLM API key and a browser UUID. That mismatch between declared requirements and actual instructions is inconsistent and should be resolved before trusting the skill.
Instruction Scope
Runtime instructions direct the agent (or user) to clone a GitHub repo, build a Chrome extension, run a local server, and submit tasks that control the user's browser using a browser UUID. All of these are within the stated purpose. The SKILL.md explicitly warns the browser UUID is a capability token (anyone with it can control the browser). Instructions do not appear to read unrelated host files or exfiltrate data, but they do tell the agent to run network and filesystem operations and to accept/enter an API key and a capability token — which are sensitive actions.
Install Mechanism
No registry install spec is provided (instruction-only), but SKILL.md asks to git clone https://github.com/softpudding/OpenBrowser.git and run uv sync, npm install and build. Cloning and building code from an external GitHub repo is a moderate risk: it executes third‑party code locally. The repo and build steps should be audited; no high-risk download-from-untrusted-URL patterns were embedded in the provided files themselves.
Credentials
The skill runtime needs a DashScope API key and an OPENBROWSER_CHROME_UUID capability token (sensitive). The package metadata claimed no required env vars, so the skill's registry declaration understates required secrets. Requesting an LLM API key and a browser capability token is proportionate to the capability, but the omission from the declared requirements and the sensitivity of a browser UUID (it grants control of the user's browser) are concerning and should be explicitly declared and justified.
Persistence & Privilege
The skill does not request 'always: true' or other elevated platform privileges. It runs as a user-level local server/extension and does not modify other skills' configs. Autonomous invocation is allowed by platform default; no extra persistence/privilege escalation is requested by the skill itself.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install open-browser
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /open-browser 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.0
Latest version with fixes and improvements
v1.0.1
- Adds support for browser UUID: All commands now require and document the use of a unique browser UUID token for security and remote control. - Updated check_status and send_task usage: Script paths and command-line parameters now reflect the openclaw directory layout and require the --chrome-uuid argument or OPENBROWSER_CHROME_UUID environment variable. - Expanded setup instructions: Manual steps now include copying the browser UUID from the extension and explain its security implications. - Troubleshooting steps updated: Additional guidance for resolving invalid UUID and extension connectivity issues. - Documentation refreshed: All examples, commands, and verification steps updated to use the new UUID-based workflow.
v1.0.0
Initial release: Visual AI browser automation with context isolation, 100% pass rate on interactive web tasks
元数据
Slug open-browser
版本 0.1.0
许可证 MIT-0
累计安装 2
当前安装数 2
历史版本数 3
常见问题

OpenBrowser 是什么?

Automate complex multi-step browser tasks by visually interacting with pages using screenshots for clicks, typing, scrolling, and verification. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 345 次。

如何安装 OpenBrowser?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install open-browser」即可一键安装,无需额外配置。

OpenBrowser 是免费的吗?

是的,OpenBrowser 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

OpenBrowser 支持哪些平台?

OpenBrowser 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 OpenBrowser?

由 softpudding(@softpudding)开发并维护,当前版本 v0.1.0。

💬 留言讨论