← 返回 Skills 市场
wolverin0

Give eyes to your openclaw

作者 wolverin0 · GitHub ↗ · v1.0.1
cross-platform ⚠ suspicious
551
总下载
0
收藏
3
当前安装
2
版本数
在 OpenClaw 中安装
/install eye2byte
功能描述
Give your agent eyes — capture screenshots, voice, and annotations from any screen, monitor, or device via MCP.
使用说明 (SKILL.md)

\r \r

Eye2byte — Screen Context for Your Agent\r

\r Eye2byte is an open-source MCP server (GitHub, PyPI) that lets you see the user's screen. Use these MCP tools only when the user explicitly asks you to look at something, debug a visual issue, or capture their screen.\r \r

Privacy & Data Storage\r

\r

  • All data stays local. Captures are stored in ~/.eye2byte/output/ on the user's machine. Nothing is sent to external servers (except the vision model API the user configured).\r
  • Auto-cleanup: Captures are deleted after N days (default: 7, configurable in settings). Set to 0 to disable.\r
  • MCP token: When using SSE remote transport, the --token flag sets a bearer token stored only in the user's openclaw.json. Treat it like any API secret. The token is never logged or transmitted beyond the Authorization header.\r
  • No telemetry. Eye2byte collects zero analytics or usage data.\r \r

Available MCP Tools\r

\r

capture_and_summarize\r

Screenshot the user's screen and get a structured analysis.\r \r Parameters:\r

  • mode"full" (default), "window", or "region"\r
  • monitor0 = active monitor (default), 1/2/3 = specific monitor, -1 = ALL monitors at once\r
  • delay — seconds to wait before capturing (useful for menus/tooltips)\r
  • window_name — capture a specific app window by name (e.g., "chrome", "code")\r \r Use this when the user says things like "look at my screen", "what do you see", "debug this", or "what's wrong here".\r \r

capture_with_voice\r

Screenshot + voice recording + transcription. Returns both visual analysis and what the user said.\r \r Use when the user wants to describe something verbally while showing their screen.\r \r

record_clip_and_summarize\r

Record a short screen clip, extract keyframes, and analyze the sequence.\r \r Use when the user wants to show you something that changes over time (animations, workflows, step sequences).\r \r

summarize_screenshot\r

Analyze an existing image file. Pass a file path to get a structured analysis.\r \r

transcribe_audio\r

Local Whisper transcription of any audio file.\r \r

get_recent_context\r

Retrieve recent Context Pack summaries from previous captures.\r \r Use this to recall what you've seen recently without re-capturing.\r \r

What You Get Back\r

\r Every capture returns a structured Context Pack:\r \r

Goal           — what the user appears to be doing\r
Environment    — OS, editor, repo, branch, language\r
Screen State   — visible panels, files, terminal output\r
Signals        — verbatim errors, stack traces, warnings\r
Likely Situation — what's probably happening\r
Suggested Next Info — what you should ask or do next\r
```\r
\r
## When to Use Eye2byte\r
\r
- User mentions something visual ("this button is broken", "the layout is wrong")\r
- User asks you to "look at" or "check" something on their screen\r
- You need to see error dialogs, UI bugs, or terminal output the user can't easily copy\r
- User is debugging and visual context would help your diagnosis\r
- User asks you to monitor or watch something\r
- You want to verify your changes had the intended visual effect\r
\r
## Multi-Monitor Tips\r
\r
- `monitor=-1` captures ALL monitors stitched together — useful for seeing the full workspace\r
- `monitor=1`, `2`, `3` for targeting specific displays\r
- Default (`monitor=0`) captures whichever monitor has the active window\r
\r
## Setup\r
\r
Eye2byte must be running on the machine whose screen you want to capture:\r
\r
**Local (same machine):** Already configured if this skill loaded.\r
\r
**Remote (different machine):** The user runs `eye2byte-mcp --sse --token \x3Csecret>` on their local machine, and configures the MCP connection URL in openclaw.json.\r
安全使用建议
This skill appears to do what it advertises, but check these things before installing: (1) Confirm the 'eye2byte' package source on PyPI/GitHub matches the SKILL.md links and review the repo/readme for maintainership and recent releases. (2) Be aware that although captures are claimed to be local, configuring a remote SSE transport or a third‑party vision API will send images/audio off the machine — only enable those if you trust the destination. (3) Treat EYE2BYTE_MCP_TOKEN like any secret; check where it's stored (openclaw.json) and rotate it if you stop using the service. (4) Because the installer creates a binary, consider installing in a controlled environment first (or review the package contents) if you run this on sensitive systems.
功能分析
Type: OpenClaw Skill Name: eye2byte Version: 1.0.1 The skill bundle enables high-risk capabilities such as screen capture, voice recording, and screen recording. While the `SKILL.md` explicitly states that data stays local, has no telemetry, and includes strong agent invocation restrictions (`disable-model-invocation: true`, 'only when the user explicitly asks'), it also notes an exception: 'Nothing is sent to external servers (except the vision model API the user configured)'. This exception highlights a potential data exfiltration vector if a user configures an external or compromised vision model, making the skill's output (sensitive screen/voice data) susceptible to leaving the local machine. This constitutes a risky capability without clear malicious intent from the skill itself, thus classifying it as suspicious.
能力评估
Purpose & Capability
Name, description, and declared requirements (python, ffmpeg, EYE2BYTE_MCP_TOKEN) align with a screen-capture/recording tool that runs locally and exposes an MCP endpoint. The primary credential (MCP token) is appropriate for remote SSE transport.
Instruction Scope
SKILL.md instructs local captures and local storage (~/.eye2byte/output/) which fits the stated purpose. However it explicitly allows using an external 'vision model API the user configured' and remote SSE transport; those configurations would cause captures and transcriptions to be sent off-machine. The file also states the token is stored in openclaw.json — reading/writing agent config is expected but worth noting.
Install Mechanism
The install spec uses a 'uv' package named eye2byte that creates an 'eye2byte' binary. SKILL.md references a GitHub repo and PyPI project, so installing a package is consistent, but any install that extracts or installs binaries writes code to disk — verify the package source (PyPI/GitHub) and integrity before installing.
Credentials
Only one required environment variable (EYE2BYTE_MCP_TOKEN) is listed and it matches the described remote transport use-case. No unrelated secrets or config paths are requested.
Persistence & Privilege
always is false and model invocation is disabled (skill is user-invocable only), so the skill cannot autonomously run. It stores its own outputs under a user path and the MCP token in openclaw.json as described — this is within expected behavior.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install eye2byte
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /eye2byte 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
- Added privacy and security section outlining data storage, token usage, and cleanup policies. - Marked the skill as user-invocable only and disabled model invocation. - Specified new environment variable requirements for token authentication. - Clarified usage instructions: only capture when the user explicitly asks. - Minor improvements to documentation and setup details.
v1.0.0
- Initial release of Eye2byte skill. - Enables agents to capture screenshots, voice, and annotations from any screen, monitor, or device via MCP. - Provides tools for capturing and analyzing screen content, recording short clips, transcribing audio, and retrieving recent context. - Supports multi-monitor setups with options to select specific monitors or all at once. - Designed to help when visual context is needed for debugging, verification, or user assistance.
元数据
Slug eye2byte
版本 1.0.1
许可证
累计安装 3
当前安装数 3
历史版本数 2
常见问题

Give eyes to your openclaw 是什么?

Give your agent eyes — capture screenshots, voice, and annotations from any screen, monitor, or device via MCP. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 551 次。

如何安装 Give eyes to your openclaw?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install eye2byte」即可一键安装,无需额外配置。

Give eyes to your openclaw 是免费的吗?

是的,Give eyes to your openclaw 完全免费(开源免费),可自由下载、安装和使用。

Give eyes to your openclaw 支持哪些平台?

Give eyes to your openclaw 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Give eyes to your openclaw?

由 wolverin0(@wolverin0)开发并维护,当前版本 v1.0.1。

💬 留言讨论