← Back to Skills Marketplace
tom859174-sketch

Tesseract OCR文字识别

by flyingants · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ⚠ suspicious
239
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install tesseract-ocr-skill
Description
基于Tesseract引擎的OCR文字识别技能,支持中文、英文、中英混合三种模式,输出text/structured/question_answer三种格式。
Usage Guidance
This skill appears to be a local Tesseract OCR tool and does not contact external servers or ask for credentials, but there are important practical issues to consider before installing: - Install Tesseract on the host first (the SKILL.md mentions it, but the skill's metadata did not declare this dependency). Without the Tesseract binary the skill will fail. - The included Python has bugs: a duplicated function, a missing import (os) and a relative import that may fail when run as a script. Expect runtime errors unless the code is fixed. - Because the package metadata is inconsistent (version mismatch, differing descriptions), treat this as low-quality packaging rather than a polished official skill. - Run and review the code in an isolated/sandboxed environment before enabling autonomous invocation. Test the CLI on non-sensitive images to confirm behavior. If you need a stable OCR skill, prefer one that explicitly declares system dependencies (Tesseract binary) in metadata, has matching versions, and has been tested to run without code errors.
Capability Analysis
Type: OpenClaw Skill Name: tesseract-ocr-skill Version: 1.0.1 The skill provides legitimate OCR functionality using the Tesseract engine, specifically tailored for extracting structured text and Q&A pairs from images (e.g., TCM textbooks). The code in `scripts/tesseract_ocr.py` and `tesseract_ocr_skill.py` uses standard libraries like pytesseract and Pillow without any evidence of malicious intent, data exfiltration, or prompt injection. A minor bug exists in `scripts/tesseract_ocr.py` where a duplicate function definition uses the `os` module without importing it.
Capability Assessment
Purpose & Capability
The README and code clearly require the Tesseract OCR engine to be installed, but the skill metadata lists no required binaries/config paths. SKILL.json describes TCM (traditional Chinese medicine) material focus while SKILL.md is more generic; version in SKILL.json (1.0.0) differs from registry (1.0.1). These inconsistencies indicate sloppy packaging and incomplete metadata.
Instruction Scope
SKILL.md instructs running the included Python CLI on a local image path (no external endpoints). The runtime instructions stay within the stated purpose (OCR). However, the included code has issues: a duplicated extract_text_with_tesseract definition (the second uses os.path.exists but os is not imported) and potential import path problems (relative import in tesseract_ocr_skill.py). These will likely cause runtime errors and make actual behavior unpredictable.
Install Mechanism
There is no install spec (instruction-only + bundled scripts). This is low-risk from an installer perspective — nothing is downloaded at install time. The skill does expect external system dependency (Tesseract engine), but that is not declared in the install metadata.
Credentials
The skill requests no environment variables or credentials, and the code does not access environment secrets or external network endpoints. This is proportionate for a local OCR tool. Note: the missing declaration of the required Tesseract binary reduces clarity but is not a credential concern.
Persistence & Privilege
The skill does not request persistent/always-on presence; default invocation settings are used (agent may invoke autonomously). The skill does not attempt to modify other skill configs or system-wide settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install tesseract-ocr-skill
  3. After installation, invoke the skill by name or use /tesseract-ocr-skill
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
v1.0.1 - Bump version
v1.0.0
v1.0.0 - First release
Metadata
Slug tesseract-ocr-skill
Version 1.0.1
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 2
Frequently Asked Questions

What is Tesseract OCR文字识别?

基于Tesseract引擎的OCR文字识别技能,支持中文、英文、中英混合三种模式,输出text/structured/question_answer三种格式。 It is an AI Agent Skill for Claude Code / OpenClaw, with 239 downloads so far.

How do I install Tesseract OCR文字识别?

Run "/install tesseract-ocr-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Tesseract OCR文字识别 free?

Yes, Tesseract OCR文字识别 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Tesseract OCR文字识别 support?

Tesseract OCR文字识别 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Tesseract OCR文字识别?

It is built and maintained by flyingants (@tom859174-sketch); the current version is v1.0.1.

💬 Comments