← Back to Skills Marketplace
406
Downloads
1
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install clawphone-phone-control
Description
使用手机控制 MCP 完成手机界面感知与操作。适用于读取当前手机状态、打开 App、处理弹窗、点击控件、输入文本、排查手机自动化失败等场景。执行时优先读取界面状态,涉及坐标点击时必须基于当前截图临时判定,禁止把历史坐标当成通用规则。
README (SKILL.md)
\r \r
ClawPhone Phone Control\r
\r
快速流程\r
\r
- 先感知,再操作。\r
- 优先用文字/节点能力,坐标点击只在当前截图下临时使用。\r
- 脆弱流程必须一步一验,不要把多步盲打成“理应成功”。\r \r
感知优先级\r
\r
- 先用
get_screen_info判断前台应用、分辨率、可见文字。\r - 需要精确定位时,再用
capture_screen。\r - 需要找标准控件时,优先
click_by_text或find_node。\r \r
操作原则\r
\r
- 输入前先确认输入框真的已获焦。\r
- 启动 App 后先确认前台应用真的切换成功。\r
- 若工具返回“已点击”“已启动”“已输入”,仍要以界面复核为准。\r
- 发送前先确认文本真的已经进入输入框。\r
- 点击发送前先确认发送控件真实可见。\r
- 发送后必须再次确认界面已变化,不要仅凭工具返回值认定成功。\r \r
启动 App\r
\r
- 可先调用
launch_app(...)。\r - 立即用
get_current_app或get_screen_info确认前台应用是否真的切换。\r - 若仍停留在桌面或其他 App,不要在同一路径上重复盲开。\r
- 先读取当前屏幕状态,再决定是否改用桌面图标点击、通知入口或其他页面内入口。\r \r
输入兜底\r
\r
- 先确认输入框已聚焦。\r
- 优先尝试
type_text(...)。\r - 若
type_text(...)失败,或文本没有真正进入输入框,立即切到兜底链路:\rset_clipboard(...)\r- 长按当前截图里的输入区域\r
- 截图确认菜单\r
- 基于当前截图临时点击
粘贴\r
- 粘贴后再次确认文本真的进入输入框,再继续后续操作。\r \r
坐标原则\r
\r
- 坐标只对应“当前设备、当前页面、当前截图”。\r
- 不要把一次成功的坐标写成固定流程。\r
- 如果页面一变、键盘弹出、工具栏切换,之前的坐标立即失效,应重新截图。\r \r
失败排查\r
\r 按下面顺序定位失败点:\r \r
- 是否在正确页面。\r
- 是否点中了真实控件,而不是附近空白区域。\r
- 输入框是否真的聚焦。\r
- 文字是否真的进入输入框。\r
- 发送按钮是否真的出现且可点击。\r
- 操作后界面是否真的变化。\r
- 某条路径失败后,是否及时切换到更合适的兜底路径,而不是重复试错。\r \r
微信等脆弱场景\r
\r
- 聊天发送属于脆弱流程,默认一步一验。\r
press_enter只能作为兜底,不要把它当作“必然发送成功”的主路径。\r- 微信自定义弹窗通常不在无障碍树中,遇到菜单项时应截图后临时取坐标点击。\r \r
参考\r
\r
- 需要工具说明时,读
tools-reference.md。\r
Usage Guidance
This skill appears coherent and limited to controlling a phone via MCP primitives. Before installing, ensure the MCP toolset it expects actually exists and is trusted in your environment. Be aware the skill may use the device clipboard as a fallback — avoid sending highly sensitive secrets through clipboard/paste flows. Because the skill source/homepage is unknown, prefer installing only in environments where you trust the underlying MCP provider and have control over which device actions are permitted.
Capability Analysis
Type: OpenClaw Skill
Name: clawphone-phone-control
Version: 1.0.0
The skill bundle provides a comprehensive set of tools and instructions for mobile device automation and interface interaction, including screen capture, text recognition, and input simulation. The instructions in SKILL.md and tools-reference.md focus on operational reliability and verification (e.g., confirming app launches and text entry) rather than malicious activities. No evidence of data exfiltration, unauthorized remote execution, or prompt injection attacks was found; the capabilities are consistent with the stated purpose of phone control.
Capability Assessment
Purpose & Capability
Name/description say: control phone UI and perform screen-aware operations. The SKILL.md only references MCP perception and action primitives (get_screen_info, capture_screen, tap, type_text, launch_app, etc.), which are exactly what such a skill needs. No unrelated binaries, credentials, or config paths are requested.
Instruction Scope
Instructions consistently limit actions to reading the current screen and performing UI interactions on the device; they emphasize verification (one-step-one-check) and warn against reusing historical coordinates. No instructions to read host files, network‑exfiltrate data, or access unrelated system state are present.
Install Mechanism
No install spec and no code files — instruction-only. This is the lowest-risk install model and appropriate for a skill that delegates to existing MCP tooling.
Credentials
No environment variables, credentials, or external API keys are requested. The only sensitive operation is setting/using the device clipboard (documented in tools-reference), which is proportional to the described input/paste fallback.
Persistence & Privilege
always:false and user-invocable: true. The skill does not request permanent presence or to modify other skills/settings. Autonomous invocation is allowed by platform default but not a special privilege here.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install clawphone-phone-control - After installation, invoke the skill by name or use
/clawphone-phone-control - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial public release from App-Control project.
Metadata
Frequently Asked Questions
What is ClawPhone Phone Control?
使用手机控制 MCP 完成手机界面感知与操作。适用于读取当前手机状态、打开 App、处理弹窗、点击控件、输入文本、排查手机自动化失败等场景。执行时优先读取界面状态,涉及坐标点击时必须基于当前截图临时判定,禁止把历史坐标当成通用规则。 It is an AI Agent Skill for Claude Code / OpenClaw, with 406 downloads so far.
How do I install ClawPhone Phone Control?
Run "/install clawphone-phone-control" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is ClawPhone Phone Control free?
Yes, ClawPhone Phone Control is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does ClawPhone Phone Control support?
ClawPhone Phone Control is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created ClawPhone Phone Control?
It is built and maintained by CLOUD BOY (@be1human); the current version is v1.0.0.
More Skills