browser-cdp

Name: browser-cdp
Author: 0xcjl

Description

Real Chrome browser automation via CDP Proxy — access pages with full user login state, bypass anti-bot detection, perform interactive operations (click/fill...

README (SKILL.md)

What is browser-cdp?

browser-cdp connects directly to your local Chrome via Chrome DevTools Protocol (CDP), giving the AI agent:

Full login state — your cookies and sessions are carried through
Anti-bot bypass — pages that block static fetchers (search results, video platforms)
Interactive operations — click, fill forms, scroll, drag, file upload
Dynamic content extraction — read JavaScript-rendered DOM
Screenshots — capture any page at any point

Architecture

Chrome (remote-debugging-port=9222)
    ↓ CDP WebSocket
CDP Proxy (cdp-proxy.mjs) — HTTP API on localhost:3456
    ↓ HTTP REST
OpenClaw AI Agent

Setup

1. Start Chrome with debugging port

# macOS — must use full binary path (not `open -a`)
pkill -9 "Google Chrome"; sleep 2
"/Applications/Google Chrome.app/Contents/MacOS/Google Chrome" \
  --remote-debugging-port=9222 \
  --user-data-dir=/tmp/chrome-debug-profile \
  --no-first-run &

Verify:

curl -s http://127.0.0.1:9222/json/version

2. Start CDP Proxy

node ~/.openclaw/skills/browser-cdp/scripts/cdp-proxy.mjs &
sleep 3
curl -s http://localhost:3456/health
# {"status":"ok","connected":true,"sessions":0,"chromePort":9222}

API Reference

# List all tabs
curl -s http://localhost:3456/targets

# Open URL in new tab
curl -s "http://localhost:3456/new?url=https://example.com"

# Execute JavaScript
curl -s -X POST "http://localhost:3456/eval?target=TARGET_ID" \
  -d 'document.title'

# JS click (fast, preferred)
curl -s -X POST "http://localhost:3456/click?target=TARGET_ID" \
  -d 'button.submit'

# Real mouse click
curl -s -X POST "http://localhost:3456/clickAt?target=TARGET_ID" \
  -d '.upload-btn'

# Screenshot
curl -s "http://localhost:3456/screenshot?target=TARGET_ID&file=/tmp/shot.png"

# Scroll (lazy loading)
curl -s "http://localhost:3456/scroll?target=TARGET_ID&direction=bottom"

# Navigate
curl -s "http://localhost:3456/navigate?target=TARGET_ID&url=https://..."

# Close tab
curl -s "http://localhost:3456/close?target=TARGET_ID"

Tool Selection: Three-Layer Strategy

Scenario	Use	Reason
Public pages (GitHub, Wikipedia, blogs)	`agent-reach`	Fast, low token, structured
Search results (Bing/Google/YouTube)	`browser-cdp`	agent-reach blocked
Login-gated content	`browser-cdp`	No cookies in agent-reach
JS-rendered pages	`browser-cdp`	Reads rendered DOM
Simple automation, isolated screenshots	`agent-browser`	No Chrome setup
Large-scale parallel scraping	`agent-reach` + parallel	browser-cdp gets rate-limited

Decision flow:

Public content → agent-reach (fast, cheap)
Search results / blocked → browser-cdp
Still fails → agent-reach fallback + record in site-patterns

Known Limitations

Chrome must use a separate profile (/tmp/chrome-debug-profile)
Same-site parallel tabs may get rate-limited
Node.js 22+ required (native WebSocket)
macOS: use full binary path to start Chrome, not open -a

Site Patterns & Usage Log

~/.openclaw/skills/browser-cdp/references/site-patterns/   # per-domain experience
~/.openclaw/skills/browser-cdp/references/usage-log.md    # per-use tracking

Origin

Adapted from eze-is/web-access (MIT) for OpenClaw. A bug in the original (require() in ES module, reported here) is fixed in this version.

Usage Guidance

This skill does what it says: it attaches to your local Chrome to access pages with your cookies and perform clicks/eval/screenshots. That capability is powerful but sensitive. Before installing: 1) Ensure you run it only on trusted machines and use a dedicated Chrome profile (SKILL.md already recommends a separate user-data-dir). 2) Use Node.js 22+ as documented. 3) Confirm the proxy binds only to localhost (not 0.0.0.0) so other hosts cannot access the HTTP API. 4) Be aware that any use of this skill can read content you are logged into (email, social media, paywalled sites) and can execute arbitrary JS in those pages — limit its use and agent permissions accordingly. 5) The registry metadata should be updated to list Node and Chrome requirements; if you need stronger guarantees, review the full cdp-proxy.mjs to ensure there are no additional network exfiltration paths and consider running the proxy inside a confined environment or container.

Capability Analysis

Type: OpenClaw Skill Name: browser-cdp Version: 1.0.0 The skill bundle provides a CDP (Chrome DevTools Protocol) proxy that allows an AI agent to control a local Chrome instance, including access to logged-in user sessions and private data. While aligned with its stated purpose of browser automation, the `cdp-proxy.mjs` script contains high-risk capabilities such as an unvalidated arbitrary file write vulnerability via the `/screenshot?file=` endpoint and a `/setFiles` endpoint that allows the agent to upload local files to remote websites. The script also aggressively attempts to discover and connect to existing Chrome debugging ports across various system paths, which could grant the agent unauthorized access to the user's primary browser profile beyond the intended debug environment.

Capability Assessment

ℹ Purpose & Capability

The skill's code and SKILL.md match the declared purpose: it implements a local CDP→HTTP proxy to drive a Chrome instance (cookies, interactive ops, screenshots). However the registry metadata lists no required binaries while the instructions and code require Node.js 22+ and a locally-launched Chrome with --remote-debugging-port. The omission of those runtime requirements in the metadata is an incoherence that should be corrected.

⚠ Instruction Scope

The runtime instructions and the included cdp-proxy.mjs explicitly read browser state (DevToolsActivePort discovery), attach to browser pages, execute arbitrary JS in pages (eval endpoint) and can save screenshots to local file paths. That behaviour is necessary for the stated tasks (reading login-gated content, interacting with pages), but it gives the skill full access to any logged-in sites in the user's Chrome profile and lets an HTTP client trigger arbitrary page JS/eval. This is high-sensitivity access — verify you only run it on machines and profiles you trust and that local HTTP endpoints are not exposed to untrusted networks.

✓ Install Mechanism

No install spec or remote downloads are present; the skill is instruction-first with a bundled JS script. That lowers supply-chain risk. The script uses only core Node APIs (with optional fallback to the 'ws' module).

ℹ Credentials

The skill declares no required env variables, and the code only optionally reads CDP_PROXY_PORT. However the practical requirement to access the user's Chrome profile (cookies/sessions) is implicit and very sensitive. Requesting access to the browser's debugging port and profile data is proportional to the feature (login-gated automation) but is privacy-critical — treat it like granting access to all logged-in accounts in that browser profile.

ℹ Persistence & Privilege

The skill does not request always:true and appears to run only when invoked. It writes/refs files under a per-skill directory (~/.openclaw/skills/browser-cdp/references), which is expected. One implementation detail to verify: the server likely listens on the configured port but the code sample does not force binding to localhost; if it binds to all interfaces it could expose the proxy to the LAN. Confirm server binding is restricted to localhost.

Version History

v1.0.0

Initial release

Metadata

Slug browser-cdp

Version 1.0.0

License MIT-0

All-time Installs 14

Active Installs 12

Total Versions 1

Frequently Asked Questions

What is browser-cdp?

Real Chrome browser automation via CDP Proxy — access pages with full user login state, bypass anti-bot detection, perform interactive operations (click/fill... It is an AI Agent Skill for Claude Code / OpenClaw, with 275 downloads so far.

How do I install browser-cdp?

Run "/install browser-cdp" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is browser-cdp free?

Yes, browser-cdp is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does browser-cdp support?

browser-cdp is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created browser-cdp?

It is built and maintained by Jialin (@0xcjl); the current version is v1.0.0.

More Skills

What is browser-cdp?

Architecture

Setup

1. Start Chrome with debugging port

2. Start CDP Proxy

API Reference

Tool Selection: Three-Layer Strategy

Known Limitations

Site Patterns & Usage Log

Origin

What is browser-cdp?

How do I install browser-cdp?

Is browser-cdp free?

Which platforms does browser-cdp support?

Who created browser-cdp?

💬 Comments