← 返回 Skills 市场
Claw Browser
作者
Can Birlik
· GitHub ↗
· v1.0.2
899
总下载
0
收藏
3
当前安装
3
版本数
在 OpenClaw 中安装
/install claw-browser
功能描述
A visual, human-like web browser for OpenClaw agents.Supports reading,screenshots, and visible mode.
使用说明 (SKILL.md)
\r \r
🌐 Claw Browser Skill\r
\r
This skill gives the agent "Eyes" to browse the web using a real\r
Chromium browser.\r
\r
Unlike simple HTTP requests, claw-browser:\r
\r
- Renders JavaScript\r
- Handles dynamic content\r
- Loads SPA applications\r
- Takes real screenshots\r
- Can run in visible (non-headless) mode\r \r ------------------------------------------------------------------------\r \r
🛠️ Installation & Setup (READ FIRST)\r
\r
⚠️ Before using this tool, the agent MUST set up the Python\r
environment and download browser binaries.\r
\r
If venv does not exist, run this setup command block automatically:\r
\r
# 1. Create virtual environment\r
python3 -m venv venv\r
\r
# 2. Activate it\r
source venv/bin/activate\r
\r
# 3. Install dependencies\r
pip install -r requirements.txt\r
\r
# 4. Download Chromium binaries (CRITICAL STEP)\r
playwright install chromium\r
```\r
\r
> ⚠️ The `playwright install chromium` step is **CRITICAL**.\\r
> Without it, the browser will NOT launch.\r
\r
------------------------------------------------------------------------\r
\r
# 🔧 Tools & Usage\r
\r
## `browser.py`\r
\r
### Arguments\r
\r
-----------------------------------------------------------------------\r
Argument Description\r
----------------------------------- -----------------------------------\r
`--url` The website to visit (**Required**)\r
\r
`--action` Action to perform: `read` or\r
`screenshot`\r
\r
`--visible` (Optional) Runs the browser in\r
visible mode (NOT headless). Use if\r
the user wants to *see* the\r
browser.\r
-----------------------------------------------------------------------\r
\r
------------------------------------------------------------------------\r
\r
# 🚀 Execution Examples\r
\r
⚠️ Always activate the virtual environment first.\r
\r
------------------------------------------------------------------------\r
\r
## 1️⃣ Read a Page (Headless - Default)\r
\r
``` bash\r
source venv/bin/activate && python browser.py --url "https://github.com/trending" --action read\r
```\r
\r
------------------------------------------------------------------------\r
\r
## 2️⃣ Take a Screenshot of a News Site\r
\r
``` bash\r
source venv/bin/activate && python browser.py --url "https://news.ycombinator.com" --action screenshot\r
```\r
\r
This saves:\r
\r
evidence.png\r
\r
------------------------------------------------------------------------\r
\r
## 3️⃣ Visible Mode (Shows Browser UI - Great for Demos)\r
\r
``` bash\r
source venv/bin/activate && python browser.py --url "https://google.com" --action read --visible\r
```\r
\r
This will launch a real Chromium window on screen.\r
\r
------------------------------------------------------------------------\r
\r
## 4️⃣ Visible Mode (WSL / Linux Fix) ⭐️\r
\r
**Use this if you get "Missing X server" or "Display not found" errors:**\r
\r
```bash\r
export DISPLAY=:0 && source venv/bin/activate && python browser.py --url "[https://google.com](https://google.com)" --action read --visible\r
```\r
This sets the `DISPLAY` variable so the browser can open on your screen.\r
\r
------------------------------------------------------------------------\r
\r
# 🧠 When Should the Agent Use This Skill?\r
\r
Use `claw-browser` when:\r
\r
- The page requires JavaScript rendering\r
- The site is dynamic (React, Vue, Angular, etc.)\r
- Screenshots are required\r
- The user explicitly asks to "open" or "see" a website\r
- Traditional HTTP requests fail\r
\r
------------------------------------------------------------------------\r
\r
# ⚡ Summary\r
\r
`claw-browser` transforms your OpenClaw agent from a simple API caller\r
into a **real browser-powered assistant** with visual capabilities.\r
\r
It enables:\r
\r
- Dynamic page interaction\r
- Visual verification\r
- Demo-ready browsing\r
- Real-world automation\r
\r
------------------------------------------------------------------------\r
\r
Made with ❤️ by @canbirlik\r
安全使用建议
This skill is coherent and implements a standard Playwright-based browser. Before installing, consider: 1) Playwright will download Chromium binaries from the network — ensure your environment/network policy permits that. 2) The skill will access any URL it's given (including internal IPs, metadata endpoints, or file:// paths) and will print page text and save a screenshot (evidence.png) to the working directory — avoid letting it visit sensitive internal services or files. 3) If you are concerned about autonomous browsing, restrict invocation to user-only or run the skill in a sandboxed/container environment with limited network access. 4) Review and run the included browser.py yourself if you want to validate behavior; it is small and straightforward. If you want extra safety, add URL whitelisting or input validation before allowing agent-driven calls.
功能分析
Type: OpenClaw Skill
Name: claw-browser
Version: 1.0.2
The skill provides a web browsing capability using Playwright, which inherently grants significant power. The `SKILL.md` instructs the agent to execute shell commands (`playwright install chromium`) for setup, involving downloading and installing binaries. The `browser.py` script can visit arbitrary URLs (user-controlled), execute JavaScript on those pages, and save files (screenshots) to the local filesystem. While these capabilities are plausibly needed for a browser tool, they represent high-risk actions (shell execution, network access, file write, JavaScript execution) that could be leveraged for unintended purposes if the agent is compromised or instructed maliciously, without clear evidence of intentional malicious design within the skill itself.
能力评估
Purpose & Capability
The name/description (visual browser) match the included code and SKILL.md. The skill uses Playwright to render pages, handle JS, take screenshots, and support visible/headless modes — exactly what a browser skill needs.
Instruction Scope
Instructions are specific: create a venv, pip install requirements, run 'playwright install chromium', then execute browser.py with --url and --action. The script navigates to the provided URL, extracts document.body.innerText (first ~5000 chars) and title, and optionally saves a screenshot. There is no attempt to read other system files or environment secrets. Caveat: the skill accepts arbitrary URLs (including internal network and file:// URLs) and will return page text/screenshots — this is expected for a browser but is a potential data-exposure vector if the agent is allowed to fetch internal or sensitive endpoints.
Install Mechanism
There is no packaged install spec; installation is done by running the provided shell steps which use pip and 'playwright install chromium'. These are standard, well-known mechanisms; the Playwright step will download browser binaries from Playwright's upstream — expected for browser automation.
Credentials
The skill does not request any secrets or external credentials. It suggests setting DISPLAY for visible mode (a non-secret OS variable). No environment variables or config paths are required, which is proportionate to its purpose.
Persistence & Privilege
always is false and the skill is user-invocable; model invocation is enabled (platform default). The skill does not request persistent system-wide privileges or modify other skills/configs.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install claw-browser - 安装完成后,直接呼叫该 Skill 的名称或使用
/claw-browser触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.2
- Added a new section for running in visible mode on WSL/Linux, including DISPLAY environment variable guidance.
- No other changes; functionality and examples remain the same.
v1.0.1
- Fixed minor formatting issues in the description field of SKILL.md.
- No changes to functionality or source code.
- Documentation content remains unchanged except for formatting.
v1.0.0
- Initial release of claw-browser skill for OpenClaw agents.
- Enables web browsing via a real Chromium browser, supporting JavaScript, dynamic content, and single-page applications.
- Provides both reading and screenshot capabilities.
- Supports visible (non-headless) mode for live browser interaction.
- Requires separate installation of Playwright and Chromium for setup.
- Works across darwin, linux, and win32 platforms.
元数据
常见问题
Claw Browser 是什么?
A visual, human-like web browser for OpenClaw agents.Supports reading,screenshots, and visible mode. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 899 次。
如何安装 Claw Browser?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install claw-browser」即可一键安装,无需额外配置。
Claw Browser 是免费的吗?
是的,Claw Browser 完全免费(开源免费),可自由下载、安装和使用。
Claw Browser 支持哪些平台?
Claw Browser 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Claw Browser?
由 Can Birlik(@canbirlik)开发并维护,当前版本 v1.0.2。
推荐 Skills