When dealing with text within an image, the system automatically recognizes it as an OCR (Optical Character Recognition) task and applies the corresponding capabilities.

Name: When dealing with text within an image, the system automatically recognizes it as an OCR (Optical Character Recognition) task and applies the corresponding capabilities.
Author: liuzhengmcc-debug

by liuzhengmcc-debug · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ Security Clean

191

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install ocr-tool

Description

OCR (Optical Character Recognition) tool using Tesseract for extracting text from images. Use when: (1) processing screenshots, charts, or documents in image...

Usage Guidance

This skill appears to do exactly what it claims: run Tesseract locally and parse the resulting text. Before installing, ensure you have (or want) a local Tesseract installation and any required language packs (chi_sim) for Chinese OCR. Review and, if desired, run the included test_ocr.py to verify behavior on sample images. Be mindful of privacy: OCR will read any image you feed it (including potentially sensitive content) and produce plain-text output; do not point it at images you don't want converted or stored. If you deploy this in an automated agent, note the agent can invoke local tesseract commands—there are no network exfiltration steps in the provided code, but still avoid giving it images that contain secrets you wouldn't want processed.

Capability Analysis

Type: OpenClaw Skill Name: ocr-tool Version: 1.0.0 The ocr-tool skill bundle is a legitimate utility designed to wrap the Tesseract OCR engine for text extraction from images, specifically optimized for financial announcements and charts. The bundle includes a Bash script (ocr.sh), a Python script (ocr_extract.py), and comprehensive documentation (SKILL.md) that align perfectly with its stated purpose. Analysis of the code and instructions found no evidence of malicious intent, data exfiltration, or unauthorized command execution; the use of subprocesses and shell commands is appropriate for interfacing with the Tesseract CLI tool.

Capability Assessment

✓ Purpose & Capability

Name/description, required binaries (tesseract), install entries (apt/brew/choco/winget), and included scripts all match an OCR tool focused on Chinese/English financial images.

✓ Instruction Scope

SKILL.md and example code only instruct running tesseract, local preprocessing, and local parsing of OCR text (grep/regex). The skill references a local media directory for integration (HOME/.openclaw/media/inbound/) which is consistent with OpenClaw integration and the stated use case.

✓ Install Mechanism

Install metadata points to standard package managers (brew/apt/choco/winget) for installing Tesseract; no arbitrary downloads or extracted archives are used.

✓ Credentials

The skill requires no environment variables, no credentials, and no config paths beyond expecting a local tesseract binary and optional tessdata language files—proportionate for OCR functionality.

✓ Persistence & Privilege

always is false and the skill doesn't request permanent or cross-skill configuration changes. It runs local commands and scripts without elevating privileges or modifying other skills.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install ocr-tool
After installation, invoke the skill by name or use /ocr-tool
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

- Initial release of the ocr-tool skill for extracting text from images using Tesseract OCR. - Supports English, Chinese (Simplified and Traditional), Japanese, and Korean text extraction. - Optimized for processing financial charts, announcements, reports, screenshots, and tabular data images. - Includes guidance for setup, advanced usage (batch processing, preprocessing), and integration examples (e.g., financial analysis scripts, Telegram workflows). - Troubleshooting tips and best practices for improving OCR accuracy provided.

Metadata

Slug ocr-tool

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is When dealing with text within an image, the system automatically recognizes it as an OCR (Optical Character Recognition) task and applies the corresponding capabilities.?

OCR (Optical Character Recognition) tool using Tesseract for extracting text from images. Use when: (1) processing screenshots, charts, or documents in image... It is an AI Agent Skill for Claude Code / OpenClaw, with 191 downloads so far.

How do I install When dealing with text within an image, the system automatically recognizes it as an OCR (Optical Character Recognition) task and applies the corresponding capabilities.?

Run "/install ocr-tool" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is When dealing with text within an image, the system automatically recognizes it as an OCR (Optical Character Recognition) task and applies the corresponding capabilities. free?

Yes, When dealing with text within an image, the system automatically recognizes it as an OCR (Optical Character Recognition) task and applies the corresponding capabilities. is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does When dealing with text within an image, the system automatically recognizes it as an OCR (Optical Character Recognition) task and applies the corresponding capabilities. support?

When dealing with text within an image, the system automatically recognizes it as an OCR (Optical Character Recognition) task and applies the corresponding capabilities. is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created When dealing with text within an image, the system automatically recognizes it as an OCR (Optical Character Recognition) task and applies the corresponding capabilities.?

It is built and maintained by liuzhengmcc-debug (@liuzhengmcc-debug); the current version is v1.0.0.

More Skills