← Back to Skills Marketplace

Claw Use — Device Control for AI Agents

Name: Claw Use — Device Control for AI Agents
Author: 4ier

by 傅洋 · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

250

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install claw-use

Description

Control physical devices over HTTP with unified commands for screen reading, input actions, app launch, navigation, and audio output using the Claw Use proto...

README (SKILL.md)

Claw Use — Device Control for AI Agents

Give your AI agent eyes, hands, and a voice on real devices.

Claw Use is a protocol + skill for AI agents to control physical devices over HTTP. The cu CLI provides a unified interface — the same commands work across any device that implements the Claw Use API.

Supported Devices

Platform	Implementation	Status
Android	claw-use-android	✅ Available
iOS	claw-use-ios	🔮 Planned
Desktop	claw-use-desktop	🔮 Planned

Prerequisites

cu CLI installed (ships with claw-use-android, or install standalone)
At least one device running a Claw Use implementation
Device and agent on the same network (or connected via Tailscale)

Setup

# Add a device with a friendly name
cu add redmi 192.168.0.105 \x3Ctoken>
cu add pixel 100.80.1.10 \x3Ctoken>

# List devices
cu devices
# ▸ redmi  192.168.0.105  online v1.2.0
#   pixel  100.80.1.10    offline

# Switch default
cu use pixel

# Target a specific device
cu -d redmi screenshot

Core API (all platforms)

Every Claw Use implementation exposes the same HTTP endpoints:

Perception — read the device

cu screen              # UI tree (semantic: element text, bounds, state)
cu screen -c           # compact mode (interactive elements only)
cu screenshot          # visual capture (JPEG, configurable quality)
cu notifications       # system notifications
cu status              # device health dashboard

Action — control the device

cu tap \x3Cx> \x3Cy>         # tap coordinates
cu click \x3Ctext>        # tap by visible text (semantic click)
cu type "text"         # type text (CJK supported)
cu swipe up|down|left|right
cu scroll up|down|left|right
cu back / cu home      # system navigation
cu launch \x3Capp>        # open an application
cu open \x3Curl>          # open URL
cu intent '\x3Cjson>'     # platform-specific intent (Android)

Audio

cu tts "hello"         # speak through device speaker
cu say "你好"          # alias

Device State

cu wake                # wake screen
cu lock / cu unlock    # lock/unlock (PIN required for unlock)

Workflow Patterns

Navigate and interact

cu launch org.telegram.messenger
cu screen -c                        # see what's on screen
cu click "Search"
cu type "John"
cu click "John, last seen recently"
cu type "Hey!"
cu click "Send"

Visual + semantic dual-channel

cu screen -c                         # semantic: what elements exist
cu screenshot 50 720 /tmp/look.jpg   # visual: what it actually looks like

Multi-device orchestration

cu -d phone1 launch com.whatsapp
cu -d phone2 screenshot
cu -d tablet open "https://example.com"

For Agent Developers

Claw Use is designed as a protocol, not just an app. To add support for a new platform:

Implement the Claw Use HTTP API spec
Expose endpoints on a configurable port (default: 7333)
Support token auth via X-Bridge-Token header
Return JSON responses matching the documented schemas

The cu CLI and this skill work automatically with any compliant implementation.

Tips

cu screen -c is the primary perception tool — compact mode filters noise
cu click by text is more reliable than cu tap when text is visible
cu screenshot when you need visual context the UI tree can't capture
Auto-unlock is transparent: locked devices auto-unlock before any command
Combine with Tailscale for remote access from anywhere

Usage Guidance

This skill appears to be a genuine device-control protocol, but a few gaps warrant caution. Before installing or enabling it: 1) Confirm how authentication tokens are provided and stored — the skill mentions X-Bridge-Token but declares no required credentials; avoid giving long-lived tokens unless you trust the implementation. 2) Clarify the unlock behavior — does the agent really auto-unlock devices, and under what conditions? Auto-unlock can let the agent access locked device contents. 3) Restrict autonomous invocation or require explicit user confirmation for sensitive actions (screenshots, reading notifications, unlocking, sending intents). 4) Only use implementations you trust (verify the cu CLI source or the device-side implementation), run device control on isolated networks when possible, and avoid exposing control ports to the public internet. 5) If the publisher can provide updated metadata (declared required credentials, install instructions, and a security/privacy page), that would reduce uncertainty and could change this assessment to benign.

Capability Analysis

Type: OpenClaw Skill Name: claw-use Version: 1.0.0 The skill bundle provides instructions for an AI agent to use the 'cu' CLI tool for remote device control. While the functionality is transparently documented and aligned with its stated purpose, it introduces high-risk capabilities such as visual screen capture (cu screenshot), reading system notifications (cu notifications), and simulating user input or keystrokes (cu type, cu tap) on connected devices. These features represent a significant attack surface if the agent is manipulated, though no evidence of intentional malice, backdoors, or unauthorized data exfiltration was found in SKILL.md or _meta.json.

Capability Assessment

ℹ Purpose & Capability

The SKILL.md describes a coherent device-control protocol (HTTP API, cu CLI, endpoints for screen, tap, screenshot, TTS, etc.), which matches the skill name and description. However, the doc requires token-based auth (X-Bridge-Token) yet the registry metadata declares no required credentials or primaryEnv. That omission is notable: a real integration would usually declare how tokens are supplied.

⚠ Instruction Scope

Runtime instructions permit reading UI trees, screenshots, notifications, and performing input/launch actions — all expected for device control but high-impact. SKILL.md also contains a contradiction: Device State says 'unlock (PIN required for unlock)' but Tips state 'Auto-unlock is transparent: locked devices auto-unlock before any command.' That discrepancy affects security assumptions (automatic unlocking vs requiring PIN). The instructions do not limit or describe how sensitive data (screenshots, notifications) should be handled or transmitted.

✓ Install Mechanism

This is instruction-only with no install spec or bundled code, which is low-risk from a supply chain perspective. The skill expects the 'cu' CLI to be installed but doesn't provide an install mechanism — acceptable but the absence of an official install source or guidance in metadata reduces transparency.

⚠ Credentials

The protocol requires token auth via X-Bridge-Token per the docs, but the registry metadata lists no required environment variables or primary credential. The skill also suggests optional Tailscale use for remote access (which would require separate credentials/config). Requesting no declared credentials while describing token-based auth is inconsistent and leaves unclear how secrets are expected to be supplied or stored.

ℹ Persistence & Privilege

always:false (no forced persistence). The skill can be invoked autonomously by the agent (platform default). Given this skill allows remote control of physical devices (taps, unlocks, app launches, screenshots), autonomous invocation increases risk; consider restricting autonomous use or requiring explicit user approval for commands that affect devices or capture content.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install claw-use
After installation, invoke the skill by name or use /claw-use
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release: cu CLI + skill for controlling Android phones (and future devices) via HTTP API. 25 endpoints covering screen reading, screenshots, tap/click/type/swipe, TTS, intents, notifications, and device management. Multi-device support with named aliases.

Metadata

Slug claw-use

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Claw Use — Device Control for AI Agents?

Control physical devices over HTTP with unified commands for screen reading, input actions, app launch, navigation, and audio output using the Claw Use proto... It is an AI Agent Skill for Claude Code / OpenClaw, with 250 downloads so far.

How do I install Claw Use — Device Control for AI Agents?

Run "/install claw-use" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Claw Use — Device Control for AI Agents free?

Yes, Claw Use — Device Control for AI Agents is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Claw Use — Device Control for AI Agents support?

Claw Use — Device Control for AI Agents is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Claw Use — Device Control for AI Agents?

It is built and maintained by 傅洋 (@4ier); the current version is v1.0.0.

More Skills

Claw Use — Device Control for AI Agents

Claw Use — Device Control for AI Agents

Supported Devices

Prerequisites

Setup

Core API (all platforms)

Perception — read the device

Action — control the device

Audio

Device State

Workflow Patterns

Navigate and interact

Visual + semantic dual-channel

Multi-device orchestration

For Agent Developers

Tips

What is Claw Use — Device Control for AI Agents?

How do I install Claw Use — Device Control for AI Agents?

Is Claw Use — Device Control for AI Agents free?

Which platforms does Claw Use — Device Control for AI Agents support?

Who created Claw Use — Device Control for AI Agents?

💬 Comments