← Back to Skills Marketplace
0xcjl

browser-cdp

by Jialin · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
275
Downloads
2
Stars
12
Active Installs
1
Versions
Install in OpenClaw
/install browser-cdp
Description
Real Chrome browser automation via CDP Proxy — access pages with full user login state, bypass anti-bot detection, perform interactive operations (click/fill...
README (SKILL.md)

What is browser-cdp?

browser-cdp connects directly to your local Chrome via Chrome DevTools Protocol (CDP), giving the AI agent:

  • Full login state — your cookies and sessions are carried through
  • Anti-bot bypass — pages that block static fetchers (search results, video platforms)
  • Interactive operations — click, fill forms, scroll, drag, file upload
  • Dynamic content extraction — read JavaScript-rendered DOM
  • Screenshots — capture any page at any point

Architecture

Chrome (remote-debugging-port=9222)
    ↓ CDP WebSocket
CDP Proxy (cdp-proxy.mjs) — HTTP API on localhost:3456
    ↓ HTTP REST
OpenClaw AI Agent

Setup

1. Start Chrome with debugging port

# macOS — must use full binary path (not `open -a`)
pkill -9 "Google Chrome"; sleep 2
"/Applications/Google Chrome.app/Contents/MacOS/Google Chrome" \
  --remote-debugging-port=9222 \
  --user-data-dir=/tmp/chrome-debug-profile \
  --no-first-run &

Verify:

curl -s http://127.0.0.1:9222/json/version

2. Start CDP Proxy

node ~/.openclaw/skills/browser-cdp/scripts/cdp-proxy.mjs &
sleep 3
curl -s http://localhost:3456/health
# {"status":"ok","connected":true,"sessions":0,"chromePort":9222}

API Reference

# List all tabs
curl -s http://localhost:3456/targets

# Open URL in new tab
curl -s "http://localhost:3456/new?url=https://example.com"

# Execute JavaScript
curl -s -X POST "http://localhost:3456/eval?target=TARGET_ID" \
  -d 'document.title'

# JS click (fast, preferred)
curl -s -X POST "http://localhost:3456/click?target=TARGET_ID" \
  -d 'button.submit'

# Real mouse click
curl -s -X POST "http://localhost:3456/clickAt?target=TARGET_ID" \
  -d '.upload-btn'

# Screenshot
curl -s "http://localhost:3456/screenshot?target=TARGET_ID&file=/tmp/shot.png"

# Scroll (lazy loading)
curl -s "http://localhost:3456/scroll?target=TARGET_ID&direction=bottom"

# Navigate
curl -s "http://localhost:3456/navigate?target=TARGET_ID&url=https://..."

# Close tab
curl -s "http://localhost:3456/close?target=TARGET_ID"

Tool Selection: Three-Layer Strategy

Scenario Use Reason
Public pages (GitHub, Wikipedia, blogs) agent-reach Fast, low token, structured
Search results (Bing/Google/YouTube) browser-cdp agent-reach blocked
Login-gated content browser-cdp No cookies in agent-reach
JS-rendered pages browser-cdp Reads rendered DOM
Simple automation, isolated screenshots agent-browser No Chrome setup
Large-scale parallel scraping agent-reach + parallel browser-cdp gets rate-limited

Decision flow:

Public content → agent-reach (fast, cheap)
Search results / blocked → browser-cdp
Still fails → agent-reach fallback + record in site-patterns

Known Limitations

  • Chrome must use a separate profile (/tmp/chrome-debug-profile)
  • Same-site parallel tabs may get rate-limited
  • Node.js 22+ required (native WebSocket)
  • macOS: use full binary path to start Chrome, not open -a

Site Patterns & Usage Log

~/.openclaw/skills/browser-cdp/references/site-patterns/   # per-domain experience
~/.openclaw/skills/browser-cdp/references/usage-log.md    # per-use tracking

Origin

Adapted from eze-is/web-access (MIT) for OpenClaw. A bug in the original (require() in ES module, reported here) is fixed in this version.

Usage Guidance
This skill does what it says: it attaches to your local Chrome to access pages with your cookies and perform clicks/eval/screenshots. That capability is powerful but sensitive. Before installing: 1) Ensure you run it only on trusted machines and use a dedicated Chrome profile (SKILL.md already recommends a separate user-data-dir). 2) Use Node.js 22+ as documented. 3) Confirm the proxy binds only to localhost (not 0.0.0.0) so other hosts cannot access the HTTP API. 4) Be aware that any use of this skill can read content you are logged into (email, social media, paywalled sites) and can execute arbitrary JS in those pages — limit its use and agent permissions accordingly. 5) The registry metadata should be updated to list Node and Chrome requirements; if you need stronger guarantees, review the full cdp-proxy.mjs to ensure there are no additional network exfiltration paths and consider running the proxy inside a confined environment or container.
Capability Analysis
Type: OpenClaw Skill Name: browser-cdp Version: 1.0.0 The skill bundle provides a CDP (Chrome DevTools Protocol) proxy that allows an AI agent to control a local Chrome instance, including access to logged-in user sessions and private data. While aligned with its stated purpose of browser automation, the `cdp-proxy.mjs` script contains high-risk capabilities such as an unvalidated arbitrary file write vulnerability via the `/screenshot?file=` endpoint and a `/setFiles` endpoint that allows the agent to upload local files to remote websites. The script also aggressively attempts to discover and connect to existing Chrome debugging ports across various system paths, which could grant the agent unauthorized access to the user's primary browser profile beyond the intended debug environment.
Capability Assessment
Purpose & Capability
The skill's code and SKILL.md match the declared purpose: it implements a local CDP→HTTP proxy to drive a Chrome instance (cookies, interactive ops, screenshots). However the registry metadata lists no required binaries while the instructions and code require Node.js 22+ and a locally-launched Chrome with --remote-debugging-port. The omission of those runtime requirements in the metadata is an incoherence that should be corrected.
Instruction Scope
The runtime instructions and the included cdp-proxy.mjs explicitly read browser state (DevToolsActivePort discovery), attach to browser pages, execute arbitrary JS in pages (eval endpoint) and can save screenshots to local file paths. That behaviour is necessary for the stated tasks (reading login-gated content, interacting with pages), but it gives the skill full access to any logged-in sites in the user's Chrome profile and lets an HTTP client trigger arbitrary page JS/eval. This is high-sensitivity access — verify you only run it on machines and profiles you trust and that local HTTP endpoints are not exposed to untrusted networks.
Install Mechanism
No install spec or remote downloads are present; the skill is instruction-first with a bundled JS script. That lowers supply-chain risk. The script uses only core Node APIs (with optional fallback to the 'ws' module).
Credentials
The skill declares no required env variables, and the code only optionally reads CDP_PROXY_PORT. However the practical requirement to access the user's Chrome profile (cookies/sessions) is implicit and very sensitive. Requesting access to the browser's debugging port and profile data is proportional to the feature (login-gated automation) but is privacy-critical — treat it like granting access to all logged-in accounts in that browser profile.
Persistence & Privilege
The skill does not request always:true and appears to run only when invoked. It writes/refs files under a per-skill directory (~/.openclaw/skills/browser-cdp/references), which is expected. One implementation detail to verify: the server likely listens on the configured port but the code sample does not force binding to localhost; if it binds to all interfaces it could expose the proxy to the LAN. Confirm server binding is restricted to localhost.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install browser-cdp
  3. After installation, invoke the skill by name or use /browser-cdp
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release
Metadata
Slug browser-cdp
Version 1.0.0
License MIT-0
All-time Installs 14
Active Installs 12
Total Versions 1
Frequently Asked Questions

What is browser-cdp?

Real Chrome browser automation via CDP Proxy — access pages with full user login state, bypass anti-bot detection, perform interactive operations (click/fill... It is an AI Agent Skill for Claude Code / OpenClaw, with 275 downloads so far.

How do I install browser-cdp?

Run "/install browser-cdp" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is browser-cdp free?

Yes, browser-cdp is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does browser-cdp support?

browser-cdp is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created browser-cdp?

It is built and maintained by Jialin (@0xcjl); the current version is v1.0.0.

💬 Comments