← 返回 Skills 市场
tr0812

Selenium Browser Control

作者 楊小虎 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
635
总下载
0
收藏
2
当前安装
1
版本数
在 OpenClaw 中安装
/install browser-automation-ctrl
功能描述
通过 Selenium 自动化控制浏览器,支持网页打开、元素操作、标签页管理、截图、JS 执行及代理设置等功能。
使用说明 (SKILL.md)

Browser Control Skill

浏览器控制工具 - 通过 Selenium 实现浏览器自动化控制。

功能列表 (35+ 命令)

基础操作

命令 说明
open \x3Curl> 打开网页
screenshot 网页截图
screenshotb64 网页截图(base64)
fullscreen 全页面截图
js \x3Cscript> 执行 JavaScript
cookies 获取 Cookies
setcookie \x3Cname> \x3Cvalue> 设置 Cookie
clearcookies 清除 Cookies
source 获取页面源码
title 获取标题
url 获取当前 URL

元素操作

命令 说明
find \x3Cselector> 查找元素
click \x3Cselector> 点击元素
fill \x3Cselector> \x3Ctext> 填写输入框
submit \x3Cselector> 提交表单
hover \x3Cselector> 悬停
attr \x3Cselector> \x3Cname> 获取元素属性

导航操作

命令 说明
back 后退
forward 前进
refresh 刷新
wait \x3Cseconds> 等待
scroll [x] [y] 滚动页面
scrollto \x3Cselector> 滚动到元素
waitelem \x3Cselector> 等待元素出现

标签页操作

命令 说明
newtab \x3Curl> 新建标签页
switchtab \x3Cindex> 切换标签页
closetab 关闭标签页
tabs 获取所有标签页

高级功能

命令 说明
links 获取所有链接
images 获取所有图片
size 获取页面尺寸
ua \x3Cuser-agent> 设置 User-Agent
proxy \x3Caddress> 设置代理

依赖安装

pip install selenium

需要 Chrome 浏览器和 ChromeDriver。

使用示例

# 打开网页并截图
python scripts/browser_ctrl.py open https://www.baidu.com
python scripts/browser_ctrl.py screenshot

# 查找元素并填写
python scripts/browser_ctrl.py find "#kw"
python scripts/browser_ctrl.py fill "#kw" "搜索内容"
python scripts/browser_ctrl.py click "#su"

# 获取页面所有链接
python scripts/browser_ctrl.py links

# 设置代理
python scripts/browser_ctrl.py proxy "http://127.0.0.1:8080"

# 设置 User-Agent
python scripts/browser_ctrl.py ua "Mozilla/5.0..."

触发关键词

  • "打开网页"、"浏览器"
  • "网页截图"
  • "执行 JS"
  • "点击"、"填写"
  • "前进"、"后退"
  • "获取链接"、"获取图片"
  • "设置代理"

注意事项

  • 需要安装 Chrome 浏览器
  • 需要 chromedriver 与 Chrome 版本匹配
  • 无头模式运行(不显示浏览器窗口)
  • 截图保存在 ~/Pictures/OpenClaw/
  • 支持 CSS 选择器和 XPath
安全使用建议
This package appears to be a straightforward Selenium browser automation helper. Before installing: (1) ensure you want a tool that can open arbitrary URLs and save screenshots to ~/Pictures/OpenClaw (it will create that directory); (2) be aware it requires a matching Chrome + chromedriver and 'pip install selenium'; (3) avoid giving it untrusted URLs or sensitive credentials — automated browsing can load malicious pages or capture sensitive on‑screen content in screenshots; (4) if you allow autonomous agent invocation, the agent could run browser actions without further prompts, so restrict usage or keep invocation manual if you want tighter control.
功能分析
Type: OpenClaw Skill Name: browser-automation-ctrl Version: 1.0.0 The skill bundle provides powerful browser automation capabilities via Selenium, including arbitrary JavaScript execution (`js` command), session cookie extraction (`cookies`), and proxy configuration (`proxy`). While these features are aligned with the stated purpose in `SKILL.md`, they represent a high-risk capability set that could be easily leveraged for data exfiltration or session hijacking if the agent is manipulated. The script `scripts/browser_ctrl.py` includes anti-detection measures (CDP commands to hide the webdriver attribute) and allows the agent to capture and return full page sources and cookies to the caller, which are significant security risks in an agentic environment.
能力评估
Purpose & Capability
Name, description, SKILL.md and the included Python script all describe Selenium-based browser automation (open pages, click, fill, screenshot, proxy, UA). There are no unrelated dependencies, credentials, or config paths requested.
Instruction Scope
Runtime instructions and code focus on browser automation. The SKILL.md tells users to install selenium and run the script; the script implements the documented commands. It does create screenshots under ~/Pictures/OpenClaw and requires Chrome/Chromedriver, which is consistent with its purpose.
Install Mechanism
No install spec is provided (instruction-only), and SKILL.md recommends 'pip install selenium' and installing Chrome/Chromedriver. No remote downloads or obscure install URLs are present in the package.
Credentials
The skill requests no environment variables, no credentials, and no config paths. This is proportional to a local browser automation utility.
Persistence & Privilege
always is false and the skill is user-invocable. It does not request persistent system-wide changes or modify other skills’ configs. Note: disable-model-invocation is false (agent may call it autonomously), which is the platform default.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install browser-automation-ctrl
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /browser-automation-ctrl 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
- Major update: Rebuilt skill to provide browser control via Selenium automation instead of remote VNC/Google OAuth access. - Simplified implementation to a single Python script (browser_ctrl.py); removed all shell script management and OAuth setup. - Added 35+ browser automation commands for navigation, element interaction, screenshots, cookies, tabs, advanced features, and more. - Updated documentation to list all available commands, usage examples, setup (Python, Selenium, chromedriver), and trigger phrases. - No longer relies on external tunnel or user’s Google authentication; actions are run directly via local headless Chrome.
元数据
Slug browser-automation-ctrl
版本 1.0.0
许可证 MIT-0
累计安装 2
当前安装数 2
历史版本数 1
常见问题

Selenium Browser Control 是什么?

通过 Selenium 自动化控制浏览器,支持网页打开、元素操作、标签页管理、截图、JS 执行及代理设置等功能。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 635 次。

如何安装 Selenium Browser Control?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install browser-automation-ctrl」即可一键安装,无需额外配置。

Selenium Browser Control 是免费的吗?

是的,Selenium Browser Control 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Selenium Browser Control 支持哪些平台?

Selenium Browser Control 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Selenium Browser Control?

由 楊小虎(@tr0812)开发并维护,当前版本 v1.0.0。

💬 留言讨论