← Back to Skills Marketplace
312
Downloads
0
Stars
2
Active Installs
1
Versions
Install in OpenClaw
/install virtual-desktop-browser
Description
Launch Chromium in non-headless mode inside Xvfb virtual display (fixed 1200x720x24) and automate with human-like mouse/keyboard/screenshot operations. Use f...
Usage Guidance
This skill is coherent with its stated purpose but has real operational impact: it requires installing OS packages (apt-get) and pip dependencies and will spawn Xvfb and a real Chromium process and then simulate mouse/keyboard input. Before installing/running, consider: 1) install and run inside an isolated environment (container or dedicated VM) so simulated input and browser sessions cannot interact with sensitive local apps; 2) be aware the agent can type/click arbitrarily in that virtual browser—do not pass secrets or session tokens into pages the skill will visit; 3) confirm you are comfortable running apt-get/pip on the host; 4) you may want to review or sandbox the skill code (skill.py) and the templates/images you supply for matching. If you need higher assurance, test in a disposable VM first.
Capability Analysis
Type: OpenClaw Skill
Name: virtual-desktop-browser
Version: 1.0.0
The skill provides powerful GUI automation capabilities by running Chromium in a virtual Xvfb display and controlling it via PyAutoGUI. While the implementation in `skill.py` aligns with the stated purpose of automating bot-resistant websites, it introduces high-risk behaviors such as simulated keyboard/mouse input, screen capture (Base64 exfiltration potential), and window management. Notably, it launches Chromium with the `--no-sandbox` flag, which weakens browser security. These capabilities are risky and could be abused for unauthorized actions if the agent is misdirected, though no explicit malicious intent or backdoors were found.
Capability Assessment
Purpose & Capability
Name/description (virtual desktop browser to simulate human GUI actions) matches the included code and SKILL.md. The Python code spawns Xvfb and chromium-browser and uses PyAutoGUI/OpenCV for clicks, typing, screenshots and template matching — all expected for the stated purpose.
Instruction Scope
SKILL.md instructs installing system packages (apt-get) and pip dependencies, and the code writes a small state file under ~/.cache/virtual-desktop-browser/state.json. The install/run instructions and runtime operations (starting processes, moving mouse, typing, capturing screenshots) are within scope for GUI automation, but they do require OS-level package installs and will perform arbitrary input actions in the launched browser session (i.e., the agent can type/click any text or interact with pages).
Install Mechanism
No automated install spec is bundled (instruction-only install). SKILL.md recommends apt-get and pip commands — these are common for this functionality but require privileged/system changes and network access to package repositories. No downloads from unfamiliar URLs or extract operations are present in the bundle itself.
Credentials
The skill requests no environment variables, no credentials, and no external configuration paths beyond writing its own state in the user's home cache. That is proportionate to its stated function.
Persistence & Privilege
The skill does not request always:true and does not modify other skills or global agent configuration. It persists minimal state in ~/.cache/virtual-desktop-browser/state.json (display and PIDs) which is reasonable for lifecycle management.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install virtual-desktop-browser - After installation, invoke the skill by name or use
/virtual-desktop-browser - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of Virtual Desktop Browser Skill.
- Launches Chromium in a non-headless Xvfb virtual display (1200x720x24).
- Provides human-like automation via PyAutoGUI: mouse, keyboard, screenshots.
- Designed for bot-resistant sites requiring GUI simulation (e.g., Xiaohongshu, X/Twitter).
- Includes tools for browser lifecycle, interaction, image search, and window focus.
- Multi-language documentation available.
Metadata
Frequently Asked Questions
What is Virtual Desktop Browser?
Launch Chromium in non-headless mode inside Xvfb virtual display (fixed 1200x720x24) and automate with human-like mouse/keyboard/screenshot operations. Use f... It is an AI Agent Skill for Claude Code / OpenClaw, with 312 downloads so far.
How do I install Virtual Desktop Browser?
Run "/install virtual-desktop-browser" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Virtual Desktop Browser free?
Yes, Virtual Desktop Browser is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Virtual Desktop Browser support?
Virtual Desktop Browser is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Virtual Desktop Browser?
It is built and maintained by Allen Niu (@nhzallen); the current version is v1.0.0.
More Skills