← Back to Skills Marketplace
igetmm

Image OCR Reader

by igetmm · GitHub ↗ · v1.0.0
cross-platform ✓ Security Clean
714
Downloads
0
Stars
9
Active Installs
1
Versions
Install in OpenClaw
/install image-ocr-reader
Description
Extract text from images using OCR with support for Chinese and English in common formats like jpg, png, and jpeg.
README (SKILL.md)

Image OCR Reader

从图片中提取文字内容的技能,支持中文和英文识别。

功能

  • 从图片文件中提取文字 (OCR)
  • 支持中文、英文混合识别
  • 使用 Tesseract OCR 引擎
  • 兼容 jpg、png、jpeg 等常见图片格式

依赖

  • Python 3
  • tesseract-ocr
  • pytesseract
  • Pillow

安装

1. 安装系统依赖

# Ubuntu/Debian
sudo apt-get install tesseract-ocr

# macOS
brew install tesseract

# CentOS/RHEL
sudo yum install tesseract

2. 安装 Python 依赖

pip install pytesseract Pillow

使用方法

命令行

python3 image_ocr_reader.py --file /path/to/image.jpg

Python API

from image_ocr_reader import extract_text

text = extract_text("/path/to/image.jpg")
print(text)

输出示例

输入一张包含文字的图片,输出提取的文字内容。


积分

  • 版本: 1.0.0
  • 作者: OpenClaw
  • 许可证: MIT
Usage Guidance
This skill appears to do only local OCR with Tesseract and Pillow. Before installing/running: 1) Manually install the tesseract binary and the appropriate language data (chi_sim) as the SKILL.md directs; without the Chinese traineddata, Chinese OCR will fail. 2) Install the Python packages (pytesseract, Pillow) in a controlled environment (virtualenv). 3) Note that --output will write to whatever path you provide; avoid running with paths you don't trust. 4) The code has no networking or secret access, so it won't exfiltrate data, but treat untrusted images cautiously (Pillow vulnerabilities are possible in edge cases). 5) If you plan to use the detailed mode, test it — the code returns confidence values as strings which may cause minor formatting issues when printing; this is a quality issue, not malicious.
Capability Analysis
Type: OpenClaw Skill Name: image-ocr-reader Version: 1.0.0 The skill bundle is classified as benign. All files, including `SKILL.md` and `image_ocr_reader.py`, align perfectly with the stated purpose of an 'Image OCR Reader'. The `SKILL.md` provides standard installation instructions for system dependencies (tesseract-ocr) and Python libraries, without any prompt injection attempts. The Python code in `image_ocr_reader.py` uses standard libraries for image processing and OCR, reads user-specified image files, and optionally writes extracted text to a user-specified output file, all of which are legitimate operations for this type of tool. There is no evidence of data exfiltration, unauthorized network activity, persistence mechanisms, or other malicious intent.
Capability Assessment
Purpose & Capability
Name/description, SKILL.md, _meta.json, and the included Python code all implement local OCR using Tesseract (pytesseract + Pillow). Required binaries (tesseract) and Python packages match the stated purpose.
Instruction Scope
Runtime instructions and code only read the user-provided image file and optionally write a user-specified output file. There are no references to external endpoints, environment secrets, or unrelated system paths in the SKILL.md or code.
Install Mechanism
There is no automatic install spec (instruction-only for system/package installs). The SKILL.md instructs the user to install system tesseract and pip packages manually. This is low-risk but means the user must install external binaries (and Tesseract language data for Chinese) themselves.
Credentials
The skill requests no environment variables, credentials, or config paths. The needed components (tesseract binary, pytesseract, Pillow) are directly related to OCR functionality.
Persistence & Privilege
The skill does not request persistent presence (always:false) and does not alter other skills or global agent configuration. It only writes to a user-specified output file if requested.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install image-ocr-reader
  3. After installation, invoke the skill by name or use /image-ocr-reader
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release
Metadata
Slug image-ocr-reader
Version 1.0.0
License
All-time Installs 9
Active Installs 9
Total Versions 1
Frequently Asked Questions

What is Image OCR Reader?

Extract text from images using OCR with support for Chinese and English in common formats like jpg, png, and jpeg. It is an AI Agent Skill for Claude Code / OpenClaw, with 714 downloads so far.

How do I install Image OCR Reader?

Run "/install image-ocr-reader" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Image OCR Reader free?

Yes, Image OCR Reader is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Image OCR Reader support?

Image OCR Reader is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Image OCR Reader?

It is built and maintained by igetmm (@igetmm); the current version is v1.0.0.

💬 Comments