← Back to Skills Marketplace

Img Ocr

Name: Img Ocr
Author: ginntech

by Frank_Jin · GitHub ↗ · v1.0.1 · MIT-0

cross-platform ✓ Security Clean

110

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install img-ocr

Description

图片文字识别工具，基于Tesseract OCR。当用户提到：图片识别、OCR、文字提取、图片转文字、识别图片里的文字时使用。

README (SKILL.md)

img-ocr

Image text recognition tool based on Tesseract OCR.

Dependencies

Python3
pytesseract: pip3 install pytesseract
Pillow: pip3 install Pillow
Tesseract OCR: sudo apt install tesseract-ocr
Chinese lang pack: sudo apt install tesseract-ocr-chi-sim

Quick Usage

python3 skills/img-ocr/scripts/ocr.py /path/to/image.jpg

Typical Scenarios

# Recognize Chinese + English
python3 skills/img-ocr/scripts/ocr.py screenshot.png

# Extract text from screenshot
python3 skills/img-ocr/scripts/ocr.py /path/to/screenshot.jpg

Usage Guidance

This skill appears to do exactly what it says: run a local Tesseract OCR on an image file and print the text. Before installing or running it: 1) be prepared to install system packages (tesseract and language packs) with sudo on Debian/Ubuntu; 2) note the SKILL.md path mismatch (the docs reference skills/img-ocr/scripts/ocr.py but the repo contains ocr.py at the root) — verify the correct path when invoking; 3) the script reads any filesystem path you pass it, so avoid giving paths to sensitive files you don't want processed or exposed; 4) this skill does not transmit results over the network, but if you integrate it into automated workflows, confirm downstream handling of extracted text (where it might be sent or stored).

Capability Analysis

Type: OpenClaw Skill Name: img-ocr Version: 1.0.1 The skill is a straightforward implementation of an OCR tool using the Tesseract engine. The Python script (ocr.py) uses standard libraries (pytesseract and Pillow) to process local image files and print the extracted text, with no evidence of network activity, data exfiltration, or malicious instructions in the documentation (SKILL.md).

Capability Assessment

✓ Purpose & Capability

Name/description (image OCR) align with the included code and instructions: the Python script uses pytesseract and Pillow to read an image path and print recognized text.

ℹ Instruction Scope

Instructions are narrowly scoped to installing Tesseract/pytesseract and running the OCR script, but SKILL.md references a different script path (skills/img-ocr/scripts/ocr.py) than the provided file (ocr.py). Installation steps use apt (sudo) which requires privilege to install system packages—expected for Tesseract but should be noted.

✓ Install Mechanism

No install spec in the registry; dependency instructions are manual (pip and apt). No downloads from third-party URLs or archive extraction are present in the package.

✓ Credentials

No environment variables, credentials, or config paths are requested. The script only reads the image file path provided as an argument.

✓ Persistence & Privilege

Skill is not always-enabled and does not attempt to modify agent/system configs. It does not request persistent privileges or autonomous elevated presence.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install img-ocr
After installation, invoke the skill by name or use /img-ocr
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.1

英文描述更新

v1.0.0

图片文字识别工具，基于Tesseract OCR，支持中文和英文

Metadata

Slug img-ocr

Version 1.0.1

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 2

Frequently Asked Questions

What is Img Ocr?

图片文字识别工具，基于Tesseract OCR。当用户提到：图片识别、OCR、文字提取、图片转文字、识别图片里的文字时使用。 It is an AI Agent Skill for Claude Code / OpenClaw, with 110 downloads so far.

How do I install Img Ocr?

Run "/install img-ocr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Img Ocr free?

Yes, Img Ocr is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Img Ocr support?

Img Ocr is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Img Ocr?

It is built and maintained by Frank_Jin (@ginntech); the current version is v1.0.1.

More Skills