← Back to Skills Marketplace
屏幕截图OCR工具
by
flyingants
· GitHub ↗
· v1.0.1
· MIT-0
256
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install screenshot-ocr-tool
Description
从屏幕截图中提取文字,支持纯文本、结构化及问答分离格式,适合中医考试题识别。
Usage Guidance
This skill appears coherent and local-only, but before installing: 1) ensure you have Tesseract OCR installed (and Chinese traineddata 'chi_sim') and Python packages Pillow and pytesseract — the skill does not declare or install them; 2) test it in a safe/isolated environment to confirm it behaves as expected; 3) be aware that image contents are processed locally by the script — do not feed sensitive images unless you are comfortable with local processing; 4) note minor metadata issues (SKILL.json version vs registry, duplicate function in script) that suggest light maintenance issues but not malicious intent. If you need automatic dependency installation or stronger provenance (homepage/source), request those from the author before trusting in production.
Capability Analysis
Type: OpenClaw Skill
Name: screenshot-ocr-tool
Version: 1.0.1
The skill bundle is a legitimate OCR tool designed for extracting and structuring text from screenshots, specifically tailored for exam questions. The implementation in `scripts/screenshot_ocr.py` uses standard libraries like `pytesseract` and `PIL` for image processing and regular expressions for text parsing. No evidence of data exfiltration, malicious execution, or prompt injection was found.
Capability Assessment
Purpose & Capability
Code implements OCR + structure/QA parsing consistent with the skill description (TCM exam screenshots). However, the skill uses Python packages (pytesseract, Pillow) and the Tesseract engine (system binary and traineddata 'chi_sim') but does not declare these runtime requirements in the metadata — this is an alignment/packaging omission.
Instruction Scope
SKILL.md points the agent to ./scripts/screenshot_ocr.py and the script only opens the provided image path, runs local OCR, and parses text. The instructions and implementation do not access unrelated files, environment variables, or network endpoints.
Install Mechanism
No install spec (instruction-only) — lowest installation risk. Code files are included but no automated installer. Because the code depends on Python packages and the external tesseract binary, the skill will fail without those installed; the skill does not provide an installation step or list these dependencies.
Credentials
The skill requests no environment variables, no credentials, and references no config paths. This is proportionate to its stated purpose (local OCR).
Persistence & Privilege
always is false and the skill does not request persistent or elevated agent privileges. It does not modify other skills or system-wide settings.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install screenshot-ocr-tool - After installation, invoke the skill by name or use
/screenshot-ocr-tool - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
v1.0.1 - Bump version
v1.0.0
v1.0.0 - First release
Metadata
Frequently Asked Questions
What is 屏幕截图OCR工具?
从屏幕截图中提取文字,支持纯文本、结构化及问答分离格式,适合中医考试题识别。 It is an AI Agent Skill for Claude Code / OpenClaw, with 256 downloads so far.
How do I install 屏幕截图OCR工具?
Run "/install screenshot-ocr-tool" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is 屏幕截图OCR工具 free?
Yes, 屏幕截图OCR工具 is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does 屏幕截图OCR工具 support?
屏幕截图OCR工具 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created 屏幕截图OCR工具?
It is built and maintained by flyingants (@tom859174-sketch); the current version is v1.0.1.
More Skills