← 返回 Skills 市场
hzj520lyx

PyAutoGUI Controller

作者 hzj520lyx · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ⚠ suspicious
362
总下载
0
收藏
1
当前安装
2
版本数
在 OpenClaw 中安装
/install pyautogui-controller
功能描述
Control Windows desktop with natural-language commands for screenshots, app launching, typing, clicking, and multi-step UI automation using a local Python co...
安全使用建议
This skill appears to be what it says: a local Windows desktop automation controller using pyautogui and an optional Playwright DOM bridge. Before installing or invoking it, consider the following: - It will control your screen, move/click the mouse, type (including via clipboard), take screenshots, and launch arbitrary executables found on the machine — only run it on a machine and account you trust. Treat it like a remote-control tool. - The code writes logs and state to runtime/ and saves learned configs under your home directory (~/.pyautogui-controller). Expect persistent state and profile directories (e.g., runtime/playwright_profile). - The Playwright bridge can navigate to web URLs and may connect to a CDP endpoint if DOM_BRIDGE_CDP_URL is set; that can create network interactions. If you do not need DOM-level automation, ensure PYAUTOGUI_CONTROLLER_USE_DOM is set to disable it, or avoid providing bridge env vars. - There are minor metadata mismatches: SKILL.md declares python/python3 as required but the registry entry lists no required binaries; the code reads multiple env vars that are not declared in registry metadata. Verify and set expected environment variables intentionally rather than relying on defaults. - Recommended precautions: run in a sandbox or throwaway VM to test, inspect runtime/playwright_profile and runtime/logs after running, and avoid providing elevated privileges or network-facing CDP URLs unless necessary. If you need to grant less privilege, consider running the skill with a limited user account and without the DOM bridge enabled. If you want, I can list the specific files that create or write to disk, the env variables the code reads, and the exact paths the skill will scan/modify.
功能分析
Type: OpenClaw Skill Name: pyautogui-controller Version: 1.0.1 The bundle provides a powerful desktop and browser automation framework using PyAutoGUI and Playwright. It contains high-risk capabilities, including the ability to execute arbitrary local binaries via os.startfile and subprocess.Popen in action/desktop_backend.py, and the ability to attach to existing browser sessions via CDP in bridges/playwright_dom_bridge.py. While these features align with the stated goal of 'advanced automation,' the lack of robust input sanitization in the NLU parser (nlu/parser.py) poses a risk of command injection if the agent is prompted with malicious input. The inclusion of security tools like 'Burp Suite' in the application mapping and the use of hardcoded local paths (C:\Users\dev\Desktop\昱昱\...) suggest a highly specialized and potentially intrusive automation environment.
能力评估
Purpose & Capability
The repository implements local desktop automation (pyautogui, keyboard/mouse controllers, window manager, OCR, Playwright DOM bridge) which aligns with the skill description: screenshots, app launching, typing, clicking and multi-step UI automation.
Instruction Scope
SKILL.md tells the agent to run the local controller script in a specific path and use it for screen-based automation. The code is consistent with those instructions but contains actions that go beyond simple 'click and type' (searches Program Files / desktop for installed apps, launches arbitrary executables, reads/writes configs and logs, takes screenshots, and may attach to browser CDP sessions via the bridge). Those behaviors are expected for this purpose but are powerful — the SKILL.md does not explicitly enumerate all filesystem locations scanned or the logging/state files created.
Install Mechanism
No install spec is provided (instruction-only wrapper). All code is included in the bundle; nothing is downloaded at install time. That lowers risk compared with remote downloads.
Credentials
The registry declared no required env vars, but the code reads several environment variables (e.g., BROWSER_DOM_BRIDGE_CMD, BROWSER_DOM_DEFAULT_URL, DOM_BRIDGE_CDP_URL, DOM_BRIDGE_* envs, PYAUTOGUI_CONTROLLER_USE_DOM). SKILL.md metadata lists python/python3 as required, but registry metadata did not — a small mismatch. The env usage is plausible for the bridge functionality but these vars can change runtime behavior (e.g., connecting to a CDP URL).
Persistence & Privilege
The skill writes state, logs, profiles and learned site configs under runtime/ and the user's home directory (~/.pyautogui-controller). It also can start arbitrary local executables and navigate browsers via Playwright/CDP. These are expected for a desktop automation tool but give it broad local privileges (screenshots, launching apps, clipboard access).
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install pyautogui-controller
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /pyautogui-controller 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
Public upload from desktop skill folder
v1.0.0
Initial public release
元数据
Slug pyautogui-controller
版本 1.0.1
许可证 MIT-0
累计安装 1
当前安装数 1
历史版本数 2
常见问题

PyAutoGUI Controller 是什么?

Control Windows desktop with natural-language commands for screenshots, app launching, typing, clicking, and multi-step UI automation using a local Python co... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 362 次。

如何安装 PyAutoGUI Controller?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install pyautogui-controller」即可一键安装,无需额外配置。

PyAutoGUI Controller 是免费的吗?

是的,PyAutoGUI Controller 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

PyAutoGUI Controller 支持哪些平台?

PyAutoGUI Controller 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 PyAutoGUI Controller?

由 hzj520lyx(@hzj520lyx)开发并维护,当前版本 v1.0.1。

💬 留言讨论