← 返回 Skills 市场
Selenium Browser
作者
Andrei Bespalov
· GitHub ↗
· v1.0.0
667
总下载
0
收藏
8
当前安装
1
版本数
在 OpenClaw 中安装
/install selenium-browser
功能描述
Start a Selenium‑controlled Chrome browser, open a URL, take a screenshot, and report progress. Supports headless mode and optional proxy.
安全使用建议
Do not install or run this skill as-is. Key concerns: (1) The SKILL.md, the script embedded in SKILL.md, and the actual scripts on disk disagree: one version takes and reports a screenshot, another never takes a screenshot and loops forever. (2) SKILL.md says the skill returns JSON, but the embedded script prints a Python dict (not serialized JSON) and the on-disk script prints nothing — the agent will likely fail to parse results. (3) SKILL.md promises a hardcoded save path (/home/main/clawd/diffusion_pdfs/) that isn't implemented; confirm where files will be written and make that path configurable. Actions you should take before using this skill: inspect and reconcile the actual scripts (which file will be executed), require the executable to output valid JSON, add a timeout or controlled termination instead of an infinite loop, ensure screenshots are written to a safe, configurable directory, run the skill in an isolated container or sandbox with limited permissions, and test with non-sensitive URLs. Also prefer a publisher-provided homepage or source repository (currently missing) and ask the author to fix the inconsistencies and provide a clear termination/protocol for the process. If you cannot validate and fix these issues, treat the skill as untrusted.
功能分析
Type: OpenClaw Skill
Name: selenium-browser
Version: 1.0.0
The skill is classified as suspicious due to a significant functional discrepancy between its documentation (SKILL.md) and its actual implementation (scripts/launch_browser.py). While SKILL.md describes taking a screenshot and reporting its path, the Python script instead enters an infinite loop (`while True: time.sleep(1)`) after navigation, keeping the browser alive indefinitely without taking a screenshot or producing the expected JSON output. This leads to unexpected behavior and potential uncontrolled resource consumption. Additionally, the `webdriver.Chrome` constructor in `scripts/launch_browser.py` incorrectly passes `executable_path=chrome_bin` (the browser binary) when a `Service` object (already configured with `chromedriver_path`) is also provided, indicating a bug that could prevent the skill from functioning correctly.
能力评估
Purpose & Capability
The declared purpose (launch Chrome via Selenium, capture a screenshot, return the saved path) is reasonable. However the SKILL.md and the actual scripts disagree: SKILL.md includes a script that saves a screenshot and prints a JSON-like dict, but the file manifest's launch_browser.py opens the page and then loops forever without taking or saving a screenshot. SKILL.md also promises saving to /home/main/clawd/diffusion_pdfs/, which is not used by the bundled scripts. These inconsistencies indicate the package is not internally coherent with its stated purpose.
Instruction Scope
SKILL.md instructs the agent to run the Python script and parse stdout as JSON. The embedded script in SKILL.md prints a Python dict (not serialized JSON) and the actual file on disk does not print or return the expected JSON and instead blocks in an infinite loop. The mismatch could cause the agent to hang, mis-parse output, or never receive a screenshot path. The instructions also reference a fixed save path in SKILL.md that is not present in the real script.
Install Mechanism
There is no automated install spec—this is instruction-only with bundled scripts and a recommendation to pip install selenium. No remote downloads or archive extractions are present in the package itself, so install risk is low, assuming the operator runs the recommended pip command from a trusted environment.
Credentials
The skill requests no credentials and only optionally uses CHROME_BIN and CHROMEDRIVER_PATH env vars (reasonable). It supports an optional proxy. However SKILL.md's hardcoded save path (/home/main/clawd/diffusion_pdfs/) is unexpected and unrelated to the skill's basic function; the bundled scripts do not reliably respect or expose a configurable, safe output path. That fixed-path claim is disproportionate and should be clarified or removed.
Persistence & Privilege
The registry flags are normal (not always:true). But the bundled script in the manifest enters an indefinite while-true sleep loop, relying on an external KeyboardInterrupt or 'terminate' to exit; that behavior can leave the agent waiting and consume resources. This persistence is not expressed in SKILL.md and should be addressed (timeouts or explicit termination protocol).
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install selenium-browser - 安装完成后,直接呼叫该 Skill 的名称或使用
/selenium-browser触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of selenium-browser skill.
- Launches a Selenium-controlled Chrome (or Chromium) browser.
- Opens a specified URL, takes a screenshot, and saves it.
- Supports headless mode and optional proxy configuration.
- Reports screenshot location or errors back to the chat.
- Includes user instructions, script examples, and troubleshooting notes.
元数据
常见问题
Selenium Browser 是什么?
Start a Selenium‑controlled Chrome browser, open a URL, take a screenshot, and report progress. Supports headless mode and optional proxy. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 667 次。
如何安装 Selenium Browser?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install selenium-browser」即可一键安装,无需额外配置。
Selenium Browser 是免费的吗?
是的,Selenium Browser 完全免费(开源免费),可自由下载、安装和使用。
Selenium Browser 支持哪些平台?
Selenium Browser 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Selenium Browser?
由 Andrei Bespalov(@andreybespalov89)开发并维护,当前版本 v1.0.0。
推荐 Skills