← Back to Skills Marketplace

Tesseract OCR文字识别

Name: Tesseract OCR文字识别
Author: tom859174-sketch

by flyingants · GitHub ↗ · v1.0.1 · MIT-0

cross-platform ⚠ suspicious

239

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install tesseract-ocr-skill

Description

基于Tesseract引擎的OCR文字识别技能，支持中文、英文、中英混合三种模式，输出text/structured/question_answer三种格式。

Usage Guidance

This skill appears to be a local Tesseract OCR tool and does not contact external servers or ask for credentials, but there are important practical issues to consider before installing: - Install Tesseract on the host first (the SKILL.md mentions it, but the skill's metadata did not declare this dependency). Without the Tesseract binary the skill will fail. - The included Python has bugs: a duplicated function, a missing import (os) and a relative import that may fail when run as a script. Expect runtime errors unless the code is fixed. - Because the package metadata is inconsistent (version mismatch, differing descriptions), treat this as low-quality packaging rather than a polished official skill. - Run and review the code in an isolated/sandboxed environment before enabling autonomous invocation. Test the CLI on non-sensitive images to confirm behavior. If you need a stable OCR skill, prefer one that explicitly declares system dependencies (Tesseract binary) in metadata, has matching versions, and has been tested to run without code errors.

Capability Analysis

Type: OpenClaw Skill Name: tesseract-ocr-skill Version: 1.0.1 The skill provides legitimate OCR functionality using the Tesseract engine, specifically tailored for extracting structured text and Q&A pairs from images (e.g., TCM textbooks). The code in `scripts/tesseract_ocr.py` and `tesseract_ocr_skill.py` uses standard libraries like pytesseract and Pillow without any evidence of malicious intent, data exfiltration, or prompt injection. A minor bug exists in `scripts/tesseract_ocr.py` where a duplicate function definition uses the `os` module without importing it.

Capability Assessment

⚠ Purpose & Capability

The README and code clearly require the Tesseract OCR engine to be installed, but the skill metadata lists no required binaries/config paths. SKILL.json describes TCM (traditional Chinese medicine) material focus while SKILL.md is more generic; version in SKILL.json (1.0.0) differs from registry (1.0.1). These inconsistencies indicate sloppy packaging and incomplete metadata.

ℹ Instruction Scope

SKILL.md instructs running the included Python CLI on a local image path (no external endpoints). The runtime instructions stay within the stated purpose (OCR). However, the included code has issues: a duplicated extract_text_with_tesseract definition (the second uses os.path.exists but os is not imported) and potential import path problems (relative import in tesseract_ocr_skill.py). These will likely cause runtime errors and make actual behavior unpredictable.

✓ Install Mechanism

There is no install spec (instruction-only + bundled scripts). This is low-risk from an installer perspective — nothing is downloaded at install time. The skill does expect external system dependency (Tesseract engine), but that is not declared in the install metadata.

✓ Credentials

The skill requests no environment variables or credentials, and the code does not access environment secrets or external network endpoints. This is proportionate for a local OCR tool. Note: the missing declaration of the required Tesseract binary reduces clarity but is not a credential concern.

✓ Persistence & Privilege

The skill does not request persistent/always-on presence; default invocation settings are used (agent may invoke autonomously). The skill does not attempt to modify other skill configs or system-wide settings.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install tesseract-ocr-skill
After installation, invoke the skill by name or use /tesseract-ocr-skill
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.1

v1.0.1 - Bump version

v1.0.0

v1.0.0 - First release

Metadata

Slug tesseract-ocr-skill

Version 1.0.1

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 2

Frequently Asked Questions

What is Tesseract OCR文字识别?

基于Tesseract引擎的OCR文字识别技能，支持中文、英文、中英混合三种模式，输出text/structured/question_answer三种格式。 It is an AI Agent Skill for Claude Code / OpenClaw, with 239 downloads so far.

How do I install Tesseract OCR文字识别?

Run "/install tesseract-ocr-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Tesseract OCR文字识别 free?

Yes, Tesseract OCR文字识别 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Tesseract OCR文字识别 support?

Tesseract OCR文字识别 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Tesseract OCR文字识别?

It is built and maintained by flyingants (@tom859174-sketch); the current version is v1.0.1.

More Skills