← 返回 Skills 市场
312
总下载
0
收藏
2
当前安装
1
版本数
在 OpenClaw 中安装
/install virtual-desktop-browser
功能描述
Launch Chromium in non-headless mode inside Xvfb virtual display (fixed 1200x720x24) and automate with human-like mouse/keyboard/screenshot operations. Use f...
安全使用建议
This skill is coherent with its stated purpose but has real operational impact: it requires installing OS packages (apt-get) and pip dependencies and will spawn Xvfb and a real Chromium process and then simulate mouse/keyboard input. Before installing/running, consider: 1) install and run inside an isolated environment (container or dedicated VM) so simulated input and browser sessions cannot interact with sensitive local apps; 2) be aware the agent can type/click arbitrarily in that virtual browser—do not pass secrets or session tokens into pages the skill will visit; 3) confirm you are comfortable running apt-get/pip on the host; 4) you may want to review or sandbox the skill code (skill.py) and the templates/images you supply for matching. If you need higher assurance, test in a disposable VM first.
功能分析
Type: OpenClaw Skill
Name: virtual-desktop-browser
Version: 1.0.0
The skill provides powerful GUI automation capabilities by running Chromium in a virtual Xvfb display and controlling it via PyAutoGUI. While the implementation in `skill.py` aligns with the stated purpose of automating bot-resistant websites, it introduces high-risk behaviors such as simulated keyboard/mouse input, screen capture (Base64 exfiltration potential), and window management. Notably, it launches Chromium with the `--no-sandbox` flag, which weakens browser security. These capabilities are risky and could be abused for unauthorized actions if the agent is misdirected, though no explicit malicious intent or backdoors were found.
能力评估
Purpose & Capability
Name/description (virtual desktop browser to simulate human GUI actions) matches the included code and SKILL.md. The Python code spawns Xvfb and chromium-browser and uses PyAutoGUI/OpenCV for clicks, typing, screenshots and template matching — all expected for the stated purpose.
Instruction Scope
SKILL.md instructs installing system packages (apt-get) and pip dependencies, and the code writes a small state file under ~/.cache/virtual-desktop-browser/state.json. The install/run instructions and runtime operations (starting processes, moving mouse, typing, capturing screenshots) are within scope for GUI automation, but they do require OS-level package installs and will perform arbitrary input actions in the launched browser session (i.e., the agent can type/click any text or interact with pages).
Install Mechanism
No automated install spec is bundled (instruction-only install). SKILL.md recommends apt-get and pip commands — these are common for this functionality but require privileged/system changes and network access to package repositories. No downloads from unfamiliar URLs or extract operations are present in the bundle itself.
Credentials
The skill requests no environment variables, no credentials, and no external configuration paths beyond writing its own state in the user's home cache. That is proportionate to its stated function.
Persistence & Privilege
The skill does not request always:true and does not modify other skills or global agent configuration. It persists minimal state in ~/.cache/virtual-desktop-browser/state.json (display and PIDs) which is reasonable for lifecycle management.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install virtual-desktop-browser - 安装完成后,直接呼叫该 Skill 的名称或使用
/virtual-desktop-browser触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
- Initial release of Virtual Desktop Browser Skill.
- Launches Chromium in a non-headless Xvfb virtual display (1200x720x24).
- Provides human-like automation via PyAutoGUI: mouse, keyboard, screenshots.
- Designed for bot-resistant sites requiring GUI simulation (e.g., Xiaohongshu, X/Twitter).
- Includes tools for browser lifecycle, interaction, image search, and window focus.
- Multi-language documentation available.
元数据
常见问题
Virtual Desktop Browser 是什么?
Launch Chromium in non-headless mode inside Xvfb virtual display (fixed 1200x720x24) and automate with human-like mouse/keyboard/screenshot operations. Use f... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 312 次。
如何安装 Virtual Desktop Browser?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install virtual-desktop-browser」即可一键安装,无需额外配置。
Virtual Desktop Browser 是免费的吗?
是的,Virtual Desktop Browser 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Virtual Desktop Browser 支持哪些平台?
Virtual Desktop Browser 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Virtual Desktop Browser?
由 Allen Niu(@nhzallen)开发并维护,当前版本 v1.0.0。
推荐 Skills