← 返回 Skills 市场
baiyunrei2025

Agent Browser Skill

作者 baiyunrei2025 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
383
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install agent-browser-skill
功能描述
Automate web browsing with navigation, form filling, clicking, screenshots, data extraction, and testing using Chrome via OpenClaw browser tool.
使用说明 (SKILL.md)

Agent Browser Skill

Description

Enhanced browser automation for OpenClaw agents with advanced navigation, screenshot, and interaction capabilities.

When to Use

Use this skill when:

  • Automating web browsing tasks
  • Taking screenshots of web pages
  • Filling forms and clicking buttons
  • Extracting data from websites
  • Testing web applications
  • Navigating complex web flows

Prerequisites

  • OpenClaw browser tool must be enabled
  • Chrome or Chromium browser installed
  • Internet connection for web access

Examples

Basic Navigation

# Navigate to a website
openclaw browser open --url "https://example.com"

# Take a screenshot
openclaw browser snapshot --url "https://example.com" --output screenshot.png

Form Interaction

# Fill a form
openclaw browser act --url "https://forms.example.com" --kind fill --fields '{"name": "John", "email": "[email protected]"}'

# Click a button
openclaw browser act --url "https://example.com" --kind click --selector "button.submit"

Data Extraction

# Extract page content
openclaw browser snapshot --url "https://news.example.com" --maxChars 5000

# Monitor page changes
openclaw browser act --url "https://status.example.com" --kind wait --textGone "Loading..."

Integration with OpenClaw

This skill enhances the native OpenClaw browser tool with:

  1. Simplified commands - Easier syntax for common tasks
  2. Error handling - Better recovery from failures
  3. Performance optimization - Faster page loads and interactions
  4. Accessibility support - Better element detection

Safety Notes

  • Only automate public websites
  • Respect robots.txt and terms of service
  • Avoid excessive requests to prevent IP blocking
  • Use delays between actions to mimic human behavior

Troubleshooting

Common Issues

  1. Browser not starting: Check if Chrome is installed
  2. Element not found: Try different selectors or wait for page load
  3. Timeout errors: Increase timeout values for slow pages
  4. Permission denied: Ensure OpenClaw has necessary permissions

Debug Tips

# Enable verbose logging
openclaw browser open --url "https://example.com" --verbose

# Check browser status
openclaw browser status

References

安全使用建议
This skill appears to do what it claims: it builds shell commands that call the local OpenClaw browser CLI. Before installing or enabling it, consider: 1) The included script uses child_process.exec and concatenates user-supplied values into shell commands — if the agent is ever given untrusted input that becomes an argument, a malicious string could perform command injection. Prefer running this skill only with trusted inputs or replacing exec with a safer API (spawn/execFile with arg arrays) or sanitizing inputs. 2) The skill will drive a browser and therefore can access any content the browser can — avoid visiting pages with secrets or auto-submitting sensitive forms. 3) Verify the OpenClaw browser tool/CLI on your system is the legitimate implementation you expect. 4) If you will run the skill in a shared environment, sandbox it (network / filesystem restrictions) or review and modify scripts to harden argument handling. If you need, I can suggest concrete code changes to remove shell interpolation and use a safer child_process pattern.
功能分析
Type: OpenClaw Skill Name: agent-browser-skill Version: 1.0.0 The skill bundle contains a significant command injection vulnerability in `scripts/browser-automation.js`. The `executeCommand` function constructs shell commands by concatenating arguments into a string passed to `child_process.exec` without proper sanitization or escaping. While the script's stated purpose is browser automation, this flaw allows for arbitrary code execution if malicious input is provided to the command arguments. No evidence of intentional malice or data exfiltration was found.
能力评估
Purpose & Capability
Name, description, SKILL.md, and the included script all focus on driving the OpenClaw browser tool (open, snapshot, act, etc.). There are no unrelated environment variables, binaries, or config paths requested; the requested capabilities are proportional to a browser automation helper.
Instruction Scope
SKILL.md confines runtime instructions to calling the OpenClaw browser tool and gives examples, safety guidance, and troubleshooting steps. It does not instruct reading unrelated files, harvesting environment variables, or sending data to external endpoints outside the expected browser interactions.
Install Mechanism
There is no install spec (instruction-only). However, the package includes a JavaScript wrapper (scripts/browser-automation.js) which will be written to disk when the skill is installed. The script uses child_process.exec to run CLI commands; no remote downloads or extract steps are present.
Credentials
The skill declares no required environment variables, no primary credential, and no config paths. The code does not read environment variables. This matches the described purpose.
Persistence & Privilege
always is false and model invocation is allowed (platform default). The skill does not request permanent presence or attempt to modify other skills or system-wide configs.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install agent-browser-skill
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /agent-browser-skill 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Agent Browser Skill 1.0.0 introduces advanced browser automation for OpenClaw agents. - Provides simplified commands for navigation, screenshots, form interactions, and data extraction. - Enhances error handling and performance for faster and more reliable web automation. - Adds accessibility improvements for better element detection. - Includes best practices, troubleshooting tips, and usage examples for common tasks. - Requires Chrome or Chromium and an active OpenClaw browser tool.
元数据
Slug agent-browser-skill
版本 1.0.0
许可证 MIT-0
累计安装 1
当前安装数 1
历史版本数 1
常见问题

Agent Browser Skill 是什么?

Automate web browsing with navigation, form filling, clicking, screenshots, data extraction, and testing using Chrome via OpenClaw browser tool. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 383 次。

如何安装 Agent Browser Skill?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install agent-browser-skill」即可一键安装,无需额外配置。

Agent Browser Skill 是免费的吗?

是的,Agent Browser Skill 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Agent Browser Skill 支持哪些平台?

Agent Browser Skill 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Agent Browser Skill?

由 baiyunrei2025(@baiyunrei2025)开发并维护,当前版本 v1.0.0。

💬 留言讨论