← Back to Skills Marketplace
635
Downloads
0
Stars
2
Active Installs
1
Versions
Install in OpenClaw
/install browser-automation-ctrl
Description
通过 Selenium 自动化控制浏览器,支持网页打开、元素操作、标签页管理、截图、JS 执行及代理设置等功能。
README (SKILL.md)
Browser Control Skill
浏览器控制工具 - 通过 Selenium 实现浏览器自动化控制。
功能列表 (35+ 命令)
基础操作
| 命令 | 说明 |
|---|---|
open \x3Curl> |
打开网页 |
screenshot |
网页截图 |
screenshotb64 |
网页截图(base64) |
fullscreen |
全页面截图 |
js \x3Cscript> |
执行 JavaScript |
cookies |
获取 Cookies |
setcookie \x3Cname> \x3Cvalue> |
设置 Cookie |
clearcookies |
清除 Cookies |
source |
获取页面源码 |
title |
获取标题 |
url |
获取当前 URL |
元素操作
| 命令 | 说明 |
|---|---|
find \x3Cselector> |
查找元素 |
click \x3Cselector> |
点击元素 |
fill \x3Cselector> \x3Ctext> |
填写输入框 |
submit \x3Cselector> |
提交表单 |
hover \x3Cselector> |
悬停 |
attr \x3Cselector> \x3Cname> |
获取元素属性 |
导航操作
| 命令 | 说明 |
|---|---|
back |
后退 |
forward |
前进 |
refresh |
刷新 |
wait \x3Cseconds> |
等待 |
scroll [x] [y] |
滚动页面 |
scrollto \x3Cselector> |
滚动到元素 |
waitelem \x3Cselector> |
等待元素出现 |
标签页操作
| 命令 | 说明 |
|---|---|
newtab \x3Curl> |
新建标签页 |
switchtab \x3Cindex> |
切换标签页 |
closetab |
关闭标签页 |
tabs |
获取所有标签页 |
高级功能
| 命令 | 说明 |
|---|---|
links |
获取所有链接 |
images |
获取所有图片 |
size |
获取页面尺寸 |
ua \x3Cuser-agent> |
设置 User-Agent |
proxy \x3Caddress> |
设置代理 |
依赖安装
pip install selenium
需要 Chrome 浏览器和 ChromeDriver。
使用示例
# 打开网页并截图
python scripts/browser_ctrl.py open https://www.baidu.com
python scripts/browser_ctrl.py screenshot
# 查找元素并填写
python scripts/browser_ctrl.py find "#kw"
python scripts/browser_ctrl.py fill "#kw" "搜索内容"
python scripts/browser_ctrl.py click "#su"
# 获取页面所有链接
python scripts/browser_ctrl.py links
# 设置代理
python scripts/browser_ctrl.py proxy "http://127.0.0.1:8080"
# 设置 User-Agent
python scripts/browser_ctrl.py ua "Mozilla/5.0..."
触发关键词
- "打开网页"、"浏览器"
- "网页截图"
- "执行 JS"
- "点击"、"填写"
- "前进"、"后退"
- "获取链接"、"获取图片"
- "设置代理"
注意事项
- 需要安装 Chrome 浏览器
- 需要 chromedriver 与 Chrome 版本匹配
- 无头模式运行(不显示浏览器窗口)
- 截图保存在 ~/Pictures/OpenClaw/
- 支持 CSS 选择器和 XPath
Usage Guidance
This package appears to be a straightforward Selenium browser automation helper. Before installing: (1) ensure you want a tool that can open arbitrary URLs and save screenshots to ~/Pictures/OpenClaw (it will create that directory); (2) be aware it requires a matching Chrome + chromedriver and 'pip install selenium'; (3) avoid giving it untrusted URLs or sensitive credentials — automated browsing can load malicious pages or capture sensitive on‑screen content in screenshots; (4) if you allow autonomous agent invocation, the agent could run browser actions without further prompts, so restrict usage or keep invocation manual if you want tighter control.
Capability Analysis
Type: OpenClaw Skill
Name: browser-automation-ctrl
Version: 1.0.0
The skill bundle provides powerful browser automation capabilities via Selenium, including arbitrary JavaScript execution (`js` command), session cookie extraction (`cookies`), and proxy configuration (`proxy`). While these features are aligned with the stated purpose in `SKILL.md`, they represent a high-risk capability set that could be easily leveraged for data exfiltration or session hijacking if the agent is manipulated. The script `scripts/browser_ctrl.py` includes anti-detection measures (CDP commands to hide the webdriver attribute) and allows the agent to capture and return full page sources and cookies to the caller, which are significant security risks in an agentic environment.
Capability Assessment
Purpose & Capability
Name, description, SKILL.md and the included Python script all describe Selenium-based browser automation (open pages, click, fill, screenshot, proxy, UA). There are no unrelated dependencies, credentials, or config paths requested.
Instruction Scope
Runtime instructions and code focus on browser automation. The SKILL.md tells users to install selenium and run the script; the script implements the documented commands. It does create screenshots under ~/Pictures/OpenClaw and requires Chrome/Chromedriver, which is consistent with its purpose.
Install Mechanism
No install spec is provided (instruction-only), and SKILL.md recommends 'pip install selenium' and installing Chrome/Chromedriver. No remote downloads or obscure install URLs are present in the package.
Credentials
The skill requests no environment variables, no credentials, and no config paths. This is proportional to a local browser automation utility.
Persistence & Privilege
always is false and the skill is user-invocable. It does not request persistent system-wide changes or modify other skills’ configs. Note: disable-model-invocation is false (agent may call it autonomously), which is the platform default.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install browser-automation-ctrl - After installation, invoke the skill by name or use
/browser-automation-ctrl - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Major update: Rebuilt skill to provide browser control via Selenium automation instead of remote VNC/Google OAuth access.
- Simplified implementation to a single Python script (browser_ctrl.py); removed all shell script management and OAuth setup.
- Added 35+ browser automation commands for navigation, element interaction, screenshots, cookies, tabs, advanced features, and more.
- Updated documentation to list all available commands, usage examples, setup (Python, Selenium, chromedriver), and trigger phrases.
- No longer relies on external tunnel or user’s Google authentication; actions are run directly via local headless Chrome.
Metadata
Frequently Asked Questions
What is Selenium Browser Control?
通过 Selenium 自动化控制浏览器,支持网页打开、元素操作、标签页管理、截图、JS 执行及代理设置等功能。 It is an AI Agent Skill for Claude Code / OpenClaw, with 635 downloads so far.
How do I install Selenium Browser Control?
Run "/install browser-automation-ctrl" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Selenium Browser Control free?
Yes, Selenium Browser Control is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Selenium Browser Control support?
Selenium Browser Control is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Selenium Browser Control?
It is built and maintained by 楊小虎 (@tr0812); the current version is v1.0.0.
More Skills