← 返回 Skills 市场
jackyun1024

Screen Vision

作者 Jack Yun · GitHub ↗ · v1.2.0 · MIT-0
cross-platform ⚠ suspicious
656
总下载
0
收藏
5
当前安装
3
版本数
在 OpenClaw 中安装
/install screen-vision
功能描述
macOS screen OCR & click automation via Apple Vision + ScreenCaptureKit. Capture any window or screen region, extract text with coordinates, find text, and c...
安全使用建议
This skill appears to do what it says: it installs a CLI that captures screen contents and can simulate clicks. Before installing, review the upstream GitHub repository and release you will download (setup.sh references the project's GitHub releases). Prefer the Homebrew path when possible, or build from source yourself if you want maximum assurance. Be aware you will need to grant Screen Recording permission to your terminal; that permission allows the tool to capture any visible screen content, so avoid running it when sensitive information is displayed. Also note the setup script extracts a tarball into /usr/local/bin (may require elevated rights or fail depending on permissions) and the script references release v1.0.0 while the registry metadata is v1.2.0 — verify you are installing the intended version. If you are uncomfortable granting screen-recording access or installing binaries from an external release, do not run setup.sh and instead inspect or build the code locally first.
功能分析
Type: OpenClaw Skill Name: screen-vision Version: 1.2.0 The skill provides macOS screen OCR and click automation, which are high-risk capabilities requiring Screen Recording permissions. The 'setup.sh' script automatically installs dependencies by downloading a pre-built binary from a personal GitHub repository (github.com/jackyun1024/mac-screen-vision) or building from source. While the logic appears aligned with the stated purpose, the automated installation of external binaries and the potential for screen data misuse or unauthorized UI interaction make this bundle suspicious.
能力评估
Purpose & Capability
Name/description (macOS screen OCR + click) match the included instructions and setup script: the script installs a 'screen-vision' binary (Homebrew, GitHub release, or source build) and 'cliclick' for automation. No unrelated services, credentials, or config paths are requested.
Instruction Scope
SKILL.md limits actions to running the CLI and parsing its output (list, ocr, find, tap, wait). It explicitly requires macOS 14+ and Screen Recording permission. There are no instructions to read unrelated files, exfiltrate data, or contact unexpected endpoints.
Install Mechanism
Install is handled by the included setup.sh (no separate install spec). The script uses Homebrew where available, otherwise downloads a tarball from the project's GitHub releases or clones/builds the repo via git/swift. Those are typical approaches, but the curl|tar extraction into /usr/local/bin and building from remote source are operations that write binaries to disk and should be reviewed before running.
Credentials
The skill declares no environment variables, no credentials, and no config paths. The setup script does not attempt to read or require unrelated secrets or environment variables.
Persistence & Privilege
The skill is not forced-always and does not modify other skills. The setup script installs binaries into /usr/local/bin (write to system path) and instructs the user to grant Screen Recording permission to the terminal app — both are expected for a screen-capture tool but are elevated actions that require user consent and attention.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install screen-vision
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /screen-vision 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.2.0
setup.sh now downloads pre-built binary on Apple Silicon (no Xcode/brew required)
v1.1.0
Add setup.sh for auto-install of CLI binary on first use
v1.0.0
Initial release of screen-vision skill. - Adds macOS screen OCR and click automation via Apple Vision + ScreenCaptureKit. - Supports OCR, text search, click automation, and wait-for-text features from the terminal. - Allows targeting specific apps, screen regions, or the full screen. - Outputs results as JSON or human-readable text with coordinates. - Provides command-line usage patterns and argument handling for common automation scenarios.
元数据
Slug screen-vision
版本 1.2.0
许可证 MIT-0
累计安装 5
当前安装数 5
历史版本数 3
常见问题

Screen Vision 是什么?

macOS screen OCR & click automation via Apple Vision + ScreenCaptureKit. Capture any window or screen region, extract text with coordinates, find text, and c... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 656 次。

如何安装 Screen Vision?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install screen-vision」即可一键安装,无需额外配置。

Screen Vision 是免费的吗?

是的,Screen Vision 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Screen Vision 支持哪些平台?

Screen Vision 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Screen Vision?

由 Jack Yun(@jackyun1024)开发并维护,当前版本 v1.2.0。

💬 留言讨论