← Back to Skills Marketplace
datappt8

一个功能强大的屏幕浏览、OCR识别和屏幕分析技能包,专为AI助手设计

by datappt8 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
591
Downloads
0
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install screen-capture-hub
Description
此技能应被用于任何需要捕获屏幕截图、分析屏幕内容或处理屏幕图像的任务。包括:使用Python PIL库捕获整个屏幕或特定区域的截图;保存截图到指定位置;分析截图中的文字内容(支持中英文OCR识别);对截图进行基本图像处理(裁剪、旋转、调整大小)。当用户提到"屏幕截图"、"截屏"、"查看屏幕"或需要分析屏幕上的内容...
Usage Guidance
This package appears to do what it says: take screenshots and run OCR/analysis. Before installing or running it, consider the following: - Screenshots capture all visible data on your screen (including passwords, 2FA codes, private messages). Only run these scripts in a trusted environment and be mindful of privacy of captured images and output files. - The Windows installer script (scripts/install_tesseract.py) downloads a .exe from the network and executes it with silent install flags. Verify the download URLs and, if possible, the file checksum/signature before running. Prefer to install Tesseract manually from the official project or your distribution's package manager if you are uncomfortable with automated installers. - Inspect scripts/install_tesseract.py and scripts/setup.py (and any other scripts) before running them to ensure there is no unwanted network behavior or commands. Consider running the package in a disposable VM/container first. - The package does not request credentials, but it will write files (images, text, JSON) to the working directory. Review where output is stored and ensure sensitive captures are handled securely. - If you plan to allow an agent to invoke this skill autonomously, restrict autonomous execution or monitor runs; automated screenshotting has high privacy impact. Disable or limit autonomous invocation if you cannot review each action. - Good checks to perform: run scripts/dependency_check.py to see what is missing, run the examples in a safe environment, and prefer manual installation of Tesseract (brew/apt or official Windows installer you obtained yourself).
Capability Analysis
Type: OpenClaw Skill Name: screen-capture-hub Version: 1.0.0 The skill bundle provides extensive screen capture, OCR, and analysis capabilities, which are inherently high-risk. Most notably, 'scripts/install_tesseract.py' automatically downloads and executes a binary installer from external sources (digi.bib.uni-mannheim.de and github.com) without hash verification, posing a significant supply chain risk. Additionally, 'scripts/setup.py' uses 'exec()' on a hardcoded string for environment verification, and the overall toolset allows for silent monitoring of user screens and text extraction. While these features align with the stated purpose, the automated execution of remote binaries and the sensitive nature of screen access warrant a suspicious classification.
Capability Assessment
Purpose & Capability
Name/description, SKILL.md, README and included scripts consistently implement screen capture, OCR and basic image analysis. Requested Python libraries (Pillow, pyautogui, pytesseract, OpenCV, numpy) and reference to Tesseract OCR are appropriate and expected for the described functionality.
Instruction Scope
SKILL.md instructs the agent to run local Python scripts to capture the screen, save images, and perform OCR/analysis — these are within scope. However the runtime instructions also call out a one‑click installer (python scripts/setup.py) and an install script that will download and run a Tesseract installer; these steps cause network activity and execution of an external binary beyond simple local script execution.
Install Mechanism
There is no platform install spec in the package manifest, but scripts/install_tesseract.py programmatically downloads a Windows installer EXE (primary URL: digi.bib.uni-mannheim.de, fallback to a GitHub releases download URL) and runs it silently. Downloading and executing an external installer is a high-risk action — even if the URLs appear related to UB‑Mannheim Tesseract builds, it requires the user to trust external binaries and grants the script the ability to run privileged installers on the host.
Credentials
The skill does not request environment variables, credentials, or config paths unrelated to its purpose. Dependency checks inspect common Program Files paths and call tesseract commands; this is proportional to locating a locally installed Tesseract binary. No secrets or unrelated tokens are requested.
Persistence & Privilege
Skill flags show no forced persistence (always: false) and normal autonomous invocation is allowed. The only persistent effect the package may request is installing Tesseract system‑wide (writes to Program Files), which requires administrative privileges on Windows. The skill does not modify other skills or system agent configs.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install screen-capture-hub
  3. After installation, invoke the skill by name or use /screen-capture-hub
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
openclaw-screen-viewer 1.0.0 – Initial release - Adds Python-based screen capture and image processing capabilities (full screen or region). - Supports screenshot saving, text extraction via OCR (Chinese and English), and basic image editing (crop, rotate, resize). - Provides scripts for quick capture, OCR, region selection, and screen content analysis. - Includes detailed usage, installation, and troubleshooting instructions for multiple platforms.
Metadata
Slug screen-capture-hub
Version 1.0.0
License MIT-0
All-time Installs 1
Active Installs 1
Total Versions 1
Frequently Asked Questions

What is 一个功能强大的屏幕浏览、OCR识别和屏幕分析技能包,专为AI助手设计?

此技能应被用于任何需要捕获屏幕截图、分析屏幕内容或处理屏幕图像的任务。包括:使用Python PIL库捕获整个屏幕或特定区域的截图;保存截图到指定位置;分析截图中的文字内容(支持中英文OCR识别);对截图进行基本图像处理(裁剪、旋转、调整大小)。当用户提到"屏幕截图"、"截屏"、"查看屏幕"或需要分析屏幕上的内容... It is an AI Agent Skill for Claude Code / OpenClaw, with 591 downloads so far.

How do I install 一个功能强大的屏幕浏览、OCR识别和屏幕分析技能包,专为AI助手设计?

Run "/install screen-capture-hub" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is 一个功能强大的屏幕浏览、OCR识别和屏幕分析技能包,专为AI助手设计 free?

Yes, 一个功能强大的屏幕浏览、OCR识别和屏幕分析技能包,专为AI助手设计 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does 一个功能强大的屏幕浏览、OCR识别和屏幕分析技能包,专为AI助手设计 support?

一个功能强大的屏幕浏览、OCR识别和屏幕分析技能包,专为AI助手设计 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created 一个功能强大的屏幕浏览、OCR识别和屏幕分析技能包,专为AI助手设计?

It is built and maintained by datappt8 (@datappt8); the current version is v1.0.0.

💬 Comments