← Back to Skills Marketplace

Image OCR Reader

Name: Image OCR Reader
Author: igetmm

by igetmm · GitHub ↗ · v1.0.0

cross-platform ✓ Security Clean

714

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install image-ocr-reader

Description

Extract text from images using OCR with support for Chinese and English in common formats like jpg, png, and jpeg.

README (SKILL.md)

Image OCR Reader

从图片中提取文字内容的技能，支持中文和英文识别。

功能

从图片文件中提取文字 (OCR)
支持中文、英文混合识别
使用 Tesseract OCR 引擎
兼容 jpg、png、jpeg 等常见图片格式

依赖

Python 3
tesseract-ocr
pytesseract
Pillow

安装

1. 安装系统依赖

# Ubuntu/Debian
sudo apt-get install tesseract-ocr

# macOS
brew install tesseract

# CentOS/RHEL
sudo yum install tesseract

2. 安装 Python 依赖

pip install pytesseract Pillow

使用方法

命令行

python3 image_ocr_reader.py --file /path/to/image.jpg

Python API

from image_ocr_reader import extract_text

text = extract_text("/path/to/image.jpg")
print(text)

输出示例

输入一张包含文字的图片，输出提取的文字内容。

积分

版本: 1.0.0
作者: OpenClaw
许可证: MIT

Usage Guidance

This skill appears to do only local OCR with Tesseract and Pillow. Before installing/running: 1) Manually install the tesseract binary and the appropriate language data (chi_sim) as the SKILL.md directs; without the Chinese traineddata, Chinese OCR will fail. 2) Install the Python packages (pytesseract, Pillow) in a controlled environment (virtualenv). 3) Note that --output will write to whatever path you provide; avoid running with paths you don't trust. 4) The code has no networking or secret access, so it won't exfiltrate data, but treat untrusted images cautiously (Pillow vulnerabilities are possible in edge cases). 5) If you plan to use the detailed mode, test it — the code returns confidence values as strings which may cause minor formatting issues when printing; this is a quality issue, not malicious.

Capability Analysis

Type: OpenClaw Skill Name: image-ocr-reader Version: 1.0.0 The skill bundle is classified as benign. All files, including `SKILL.md` and `image_ocr_reader.py`, align perfectly with the stated purpose of an 'Image OCR Reader'. The `SKILL.md` provides standard installation instructions for system dependencies (tesseract-ocr) and Python libraries, without any prompt injection attempts. The Python code in `image_ocr_reader.py` uses standard libraries for image processing and OCR, reads user-specified image files, and optionally writes extracted text to a user-specified output file, all of which are legitimate operations for this type of tool. There is no evidence of data exfiltration, unauthorized network activity, persistence mechanisms, or other malicious intent.

Capability Assessment

✓ Purpose & Capability

Name/description, SKILL.md, _meta.json, and the included Python code all implement local OCR using Tesseract (pytesseract + Pillow). Required binaries (tesseract) and Python packages match the stated purpose.

✓ Instruction Scope

Runtime instructions and code only read the user-provided image file and optionally write a user-specified output file. There are no references to external endpoints, environment secrets, or unrelated system paths in the SKILL.md or code.

ℹ Install Mechanism

There is no automatic install spec (instruction-only for system/package installs). The SKILL.md instructs the user to install system tesseract and pip packages manually. This is low-risk but means the user must install external binaries (and Tesseract language data for Chinese) themselves.

✓ Credentials

The skill requests no environment variables, credentials, or config paths. The needed components (tesseract binary, pytesseract, Pillow) are directly related to OCR functionality.

✓ Persistence & Privilege

The skill does not request persistent presence (always:false) and does not alter other skills or global agent configuration. It only writes to a user-specified output file if requested.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install image-ocr-reader
After installation, invoke the skill by name or use /image-ocr-reader
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release

Metadata

Slug image-ocr-reader

Version 1.0.0

License —

All-time Installs 9

Active Installs 9

Total Versions 1

Frequently Asked Questions

What is Image OCR Reader?

Extract text from images using OCR with support for Chinese and English in common formats like jpg, png, and jpeg. It is an AI Agent Skill for Claude Code / OpenClaw, with 714 downloads so far.

How do I install Image OCR Reader?

Run "/install image-ocr-reader" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Image OCR Reader free?

Yes, Image OCR Reader is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Image OCR Reader support?

Image OCR Reader is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Image OCR Reader?

It is built and maintained by igetmm (@igetmm); the current version is v1.0.0.

More Skills

Image OCR Reader

Image OCR Reader

功能

依赖

安装

1. 安装系统依赖

2. 安装 Python 依赖

使用方法

命令行

Python API

输出示例

积分

What is Image OCR Reader?

How do I install Image OCR Reader?

Is Image OCR Reader free?

Which platforms does Image OCR Reader support?

Who created Image OCR Reader?

💬 Comments