← 返回 Skills 市场
automation browser
作者
handongpu16
· GitHub ↗
· v1.0.0
· MIT-0
469
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install automation-browser
功能描述
Control Browser's kernel for web automation. Supports web navigation, element interaction, page scrolling, file/video downloading, and content extraction.
使用说明 (SKILL.md)
\r \r
QB X5 Use\r
\r Based on the Browser, providing comprehensive browser automation capabilities.\r \r
Installation (one-time only)\r
\r Install QQ Browser and the x5use Python package.\r \r
bash skills/qb-x5-use/scripts/install_dep.sh\r
```\r
\r
## Setup (run before each session)\r
\r
Start the X5 background service on port 18009. Must be called after Installation. If the service is already running, it exits immediately without restarting.\r
\r
```bash\r
bash skills/qb-x5-use/scripts/setup.sh\r
```\r
\r
## Commands\r
\r
### Navigation\r
```bash\r
python3 skills/qb-x5-use/scripts/go_to_url.py \x3Curl> # Navigate to URL\r
python3 skills/qb-x5-use/scripts/go_back.py # Go back\r
```\r
\r
### Element interaction\r
```bash\r
python3 skills/qb-x5-use/scripts/click_element.py \x3Cindex> [xpath] # Click element by index\r
python3 skills/qb-x5-use/scripts/input_text.py \x3Cindex> \x3Ctext> [xpath] # Fill input by index\r
python3 skills/qb-x5-use/scripts/get_dropdown_options.py \x3Cindex> # Get dropdown options\r
python3 skills/qb-x5-use/scripts/select_dropdown_option.py \x3Cindex> \x3Ctext> # Select dropdown option\r
```\r
\r
### Scrolling\r
```bash\r
python3 skills/qb-x5-use/scripts/scroll_down.py [amount] # Scroll down\r
python3 skills/qb-x5-use/scripts/scroll_up.py [amount] # Scroll up\r
python3 skills/qb-x5-use/scripts/scroll_to_text.py \x3Ctext> # Scroll to text\r
python3 skills/qb-x5-use/scripts/scroll_to_top.py # Scroll to top\r
python3 skills/qb-x5-use/scripts/scroll_to_bottom.py # Scroll to bottom\r
```\r
\r
### Download\r
```bash\r
python3 skills/qb-x5-use/scripts/download_file.py \x3Cindex> # Download file by index\r
python3 skills/qb-x5-use/scripts/download_url.py \x3Curl> # Download file by URL\r
```\r
\r
### Content\r
```bash\r
python3 skills/qb-x5-use/scripts/get_content.py # Get page content as Markdown\r
```\r
\r
### Wait\r
```bash\r
python3 skills/qb-x5-use/scripts/wait.py [seconds] # Wait specified time\r
```\r
\r
## Core workflow\r
\r
1. **Navigate**: `go_to_url.py \x3Curl>`\r
2. **Read result**: Check the returned interactive elements with refs like `[0]`, `[1]`\r
3. **Interact**: Use index from the result to click, fill, select, etc.\r
4. **Re-read result**: After navigation or interaction, check new interactive elements\r
\r
## Return value\r
\r
Every command returns the current page state, including action result and interactive elements.\r
\r
### Structure\r
\r
**Action Result**\r
- Success or Failed status\r
- Target URL and Content-Type\r
\r
**Page Content**\r
\r
| Field | Description |\r
|-------|-------------|\r
| Previous page | Title and URL of the previous page |\r
| Action | Action name and parameters |\r
| Action Result | Execution result (e.g. `navigation triggered`) |\r
| Current page | Title and URL of the current page |\r
| Interactive elements | All interactive elements in the viewport, each with `[index]\x3Ctag text/>` |\r
\r
### Example output\r
\r
Navigating to Baidu:\r
\r
```bash\r
python3 skills/qb-x5-use/scripts/go_to_url.py https://www.baidu.com/\r
```\r
\r
```\r
Action result: Success! Navigated to https://www.baidu.com/, The Content-Type of the url in response headers is 'text/html; charset=utf-8'\r
\r
>>>>> Page Content\r
State of current webpage. NOTE that the following is one-time information!\r
[Start of state]\r
Previous page: 百度一下,你就知道 (https://www.baidu.com/)\r
Action: go_to_url ({"url":"https://www.baidu.com/"})\r
Action Result: navigation triggered.\r
Current page: [0] 百度一下,你就知道 (https://www.baidu.com/)\r
Interactive elements from top layer of current page inside the viewport: [Start of page]\r
[0]\x3Ca 新闻/>\r
[1]\x3Ca hao123/>\r
[2]\x3Ca 地图/>\r
[3]\x3Ca 贴吧/>\r
[5]\x3Ca 图片/>\r
[13]\x3Ctextarea />\r
[29]\x3Cbutton 百度一下/>\r
[12]\x3Ca tj_login>登录/>\r
...\r
[End of page]\r
[End of state]\r
```\r
\r
### Interactive element format\r
\r
Each element: `[index]\x3Ctag text/>`\r
\r
| Part | Description | Example |\r
|------|-------------|---------|\r
| `[index]` | Element index for `click_element`, `input_text`, etc. | `[13]` |\r
| `\x3Ctag>` | HTML element type (`a`, `button`, `textarea`, `div`, `img`, `span`) | `\x3Ctextarea>` |\r
| `text` | Display text (may be empty) | `百度一下` |\r
\r
## Example: Search on Baidu\r
\r
```bash\r
# Navigate to Baidu\r
python3 skills/qb-x5-use/scripts/go_to_url.py https://www.baidu.com/\r
# Output shows: [13]\x3Ctextarea />, [29]\x3Cbutton 百度一下/>\r
\r
# Fill search box\r
python3 skills/qb-x5-use/scripts/input_text.py 13 "搜索词"\r
\r
# Click search button\r
python3 skills/qb-x5-use/scripts/click_element.py 29\r
\r
# Check result\r
python3 skills/qb-x5-use/scripts/get_content.py\r
```\r
\r
## Example: Scroll and download\r
\r
```bash\r
# Navigate to page\r
python3 skills/qb-x5-use/scripts/go_to_url.py https://example.com/files\r
\r
# Scroll to find more content\r
python3 skills/qb-x5-use/scripts/scroll_down.py 500\r
\r
# Download file by index from interactive elements\r
python3 skills/qb-x5-use/scripts/download_file.py 5\r
\r
# Or download by direct URL\r
python3 skills/qb-x5-use/scripts/download_url.py https://example.com/file.pdf\r
```\r
\r
## Troubleshooting\r
- If an element is not found, use the returned interactive elements list to find the correct index.\r
- If the page is not fully loaded, add a `wait.py` command after navigation.\r
\r
安全使用建议
This skill appears to implement what it claims, but the installer will download and install native packages and a Python wheel and start a background service on your machine. Before you install: (1) Confirm the download URLs and package publisher (the script points at a QQ domain) and prefer signed releases or vendor instructions; (2) Run the installer in a VM or isolated environment first — the script may require root to run apt/dpkg/yum and will write to /usr/local and create a background process; (3) Be cautious about pip3 --break-system-packages which can alter system-managed Python packages; (4) Understand that the x5use binary will run a local SSE server on port 18009 — if other services expose that port it could be reachable; (5) If you need stricter safety, avoid running the install script as root, review the x5use binary source or obtain it from a verifiable upstream, or prefer a sandbox/container deployment. If you want, I can point out exact lines that start processes, download files, or create system paths, or help draft a safer install plan (containerized or with checksum verification).
功能分析
Type: OpenClaw Skill
Name: automation-browser
Version: 1.0.0
The skill bundle performs high-risk system-level operations, including downloading and installing Debian/RPM packages and Python wheels from a remote Tencent domain (qq.com) using root privileges. It also starts a background network service (x5use-linux-mcp) on port 18009 to facilitate browser control. While these actions are consistent with the stated goal of browser automation via the X5 kernel, the execution of remote binaries and the requirement for system-wide installation (install_dep.sh, setup.sh) pose significant security risks without further verification of the external artifacts.
能力评估
Purpose & Capability
Name/description align with what the files do: Python wrappers call a local MCP (x5use) service to automate a browser. The required binary (python3) and provided scripts are coherent with browser automation.
Instruction Scope
SKILL.md clearly documents installing QQ Browser and the x5use service, starting the service, and running per-action scripts. The runtime instructions do not request unrelated files or credentials and confine actions to the local MCP service and browser automation tasks.
Install Mechanism
Installation uses an included shell script that downloads .deb/.rpm and a .whl from a remote URL and runs dpkg/apt/yum and pip3 (--break-system-packages). Although the download host appears to be a QQ domain, the pattern (curl + dpkg/yum + pip install of a wheel) is a higher-risk supply-chain operation because it installs native binaries and third-party Python code on the host.
Credentials
The skill requests no environment variables or unrelated credentials. It does require the ability to write system directories (creating /usr/local/qb_logs) and to install system packages, which is proportionate to installing a browser and service but does require elevated filesystem and package-manager access.
Persistence & Privilege
The skill will start (and can start) a persistent background binary service (x5use-linux-mcp) that listens on localhost:18009 and creates a log directory under /usr/local. This is expected for local automation but increases the attack surface (a native binary running as a background process). 'always' is false and agent autonomy is default (no special gating).
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install automation-browser - 安装完成后,直接呼叫该 Skill 的名称或使用
/automation-browser触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
- Initial release of automation_browser skill for web automation using QQ Browser's X5 kernel.
- Supports navigation, element interaction (click, input, dropdown), page scrolling, and file/video download.
- Provides get_content to extract page data as Markdown and lists interactive elements for scripted actions.
- Includes setup and installation scripts for smooth environment initialization.
- Each script command returns detailed page state and interactive elements for further automation.
元数据
常见问题
automation browser 是什么?
Control Browser's kernel for web automation. Supports web navigation, element interaction, page scrolling, file/video downloading, and content extraction. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 469 次。
如何安装 automation browser?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install automation-browser」即可一键安装,无需额外配置。
automation browser 是免费的吗?
是的,automation browser 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
automation browser 支持哪些平台?
automation browser 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 automation browser?
由 handongpu16(@handongpu16)开发并维护,当前版本 v1.0.0。
推荐 Skills