← 返回 Skills 市场
igetmm

Image OCR Reader

作者 igetmm · GitHub ↗ · v1.0.0
cross-platform ✓ 安全检测通过
714
总下载
0
收藏
9
当前安装
1
版本数
在 OpenClaw 中安装
/install image-ocr-reader
功能描述
Extract text from images using OCR with support for Chinese and English in common formats like jpg, png, and jpeg.
使用说明 (SKILL.md)

Image OCR Reader

从图片中提取文字内容的技能,支持中文和英文识别。

功能

  • 从图片文件中提取文字 (OCR)
  • 支持中文、英文混合识别
  • 使用 Tesseract OCR 引擎
  • 兼容 jpg、png、jpeg 等常见图片格式

依赖

  • Python 3
  • tesseract-ocr
  • pytesseract
  • Pillow

安装

1. 安装系统依赖

# Ubuntu/Debian
sudo apt-get install tesseract-ocr

# macOS
brew install tesseract

# CentOS/RHEL
sudo yum install tesseract

2. 安装 Python 依赖

pip install pytesseract Pillow

使用方法

命令行

python3 image_ocr_reader.py --file /path/to/image.jpg

Python API

from image_ocr_reader import extract_text

text = extract_text("/path/to/image.jpg")
print(text)

输出示例

输入一张包含文字的图片,输出提取的文字内容。


积分

  • 版本: 1.0.0
  • 作者: OpenClaw
  • 许可证: MIT
安全使用建议
This skill appears to do only local OCR with Tesseract and Pillow. Before installing/running: 1) Manually install the tesseract binary and the appropriate language data (chi_sim) as the SKILL.md directs; without the Chinese traineddata, Chinese OCR will fail. 2) Install the Python packages (pytesseract, Pillow) in a controlled environment (virtualenv). 3) Note that --output will write to whatever path you provide; avoid running with paths you don't trust. 4) The code has no networking or secret access, so it won't exfiltrate data, but treat untrusted images cautiously (Pillow vulnerabilities are possible in edge cases). 5) If you plan to use the detailed mode, test it — the code returns confidence values as strings which may cause minor formatting issues when printing; this is a quality issue, not malicious.
功能分析
Type: OpenClaw Skill Name: image-ocr-reader Version: 1.0.0 The skill bundle is classified as benign. All files, including `SKILL.md` and `image_ocr_reader.py`, align perfectly with the stated purpose of an 'Image OCR Reader'. The `SKILL.md` provides standard installation instructions for system dependencies (tesseract-ocr) and Python libraries, without any prompt injection attempts. The Python code in `image_ocr_reader.py` uses standard libraries for image processing and OCR, reads user-specified image files, and optionally writes extracted text to a user-specified output file, all of which are legitimate operations for this type of tool. There is no evidence of data exfiltration, unauthorized network activity, persistence mechanisms, or other malicious intent.
能力评估
Purpose & Capability
Name/description, SKILL.md, _meta.json, and the included Python code all implement local OCR using Tesseract (pytesseract + Pillow). Required binaries (tesseract) and Python packages match the stated purpose.
Instruction Scope
Runtime instructions and code only read the user-provided image file and optionally write a user-specified output file. There are no references to external endpoints, environment secrets, or unrelated system paths in the SKILL.md or code.
Install Mechanism
There is no automatic install spec (instruction-only for system/package installs). The SKILL.md instructs the user to install system tesseract and pip packages manually. This is low-risk but means the user must install external binaries (and Tesseract language data for Chinese) themselves.
Credentials
The skill requests no environment variables, credentials, or config paths. The needed components (tesseract binary, pytesseract, Pillow) are directly related to OCR functionality.
Persistence & Privilege
The skill does not request persistent presence (always:false) and does not alter other skills or global agent configuration. It only writes to a user-specified output file if requested.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install image-ocr-reader
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /image-ocr-reader 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release
元数据
Slug image-ocr-reader
版本 1.0.0
许可证
累计安装 9
当前安装数 9
历史版本数 1
常见问题

Image OCR Reader 是什么?

Extract text from images using OCR with support for Chinese and English in common formats like jpg, png, and jpeg. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 714 次。

如何安装 Image OCR Reader?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install image-ocr-reader」即可一键安装,无需额外配置。

Image OCR Reader 是免费的吗?

是的,Image OCR Reader 完全免费(开源免费),可自由下载、安装和使用。

Image OCR Reader 支持哪些平台?

Image OCR Reader 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Image OCR Reader?

由 igetmm(@igetmm)开发并维护,当前版本 v1.0.0。

💬 留言讨论