← 返回 Skills 市场
Tesseract OCR文字识别
作者
flyingants
· GitHub ↗
· v1.0.1
· MIT-0
239
总下载
0
收藏
0
当前安装
2
版本数
在 OpenClaw 中安装
/install tesseract-ocr-skill
功能描述
基于Tesseract引擎的OCR文字识别技能,支持中文、英文、中英混合三种模式,输出text/structured/question_answer三种格式。
安全使用建议
This skill appears to be a local Tesseract OCR tool and does not contact external servers or ask for credentials, but there are important practical issues to consider before installing:
- Install Tesseract on the host first (the SKILL.md mentions it, but the skill's metadata did not declare this dependency). Without the Tesseract binary the skill will fail.
- The included Python has bugs: a duplicated function, a missing import (os) and a relative import that may fail when run as a script. Expect runtime errors unless the code is fixed.
- Because the package metadata is inconsistent (version mismatch, differing descriptions), treat this as low-quality packaging rather than a polished official skill.
- Run and review the code in an isolated/sandboxed environment before enabling autonomous invocation. Test the CLI on non-sensitive images to confirm behavior.
If you need a stable OCR skill, prefer one that explicitly declares system dependencies (Tesseract binary) in metadata, has matching versions, and has been tested to run without code errors.
功能分析
Type: OpenClaw Skill
Name: tesseract-ocr-skill
Version: 1.0.1
The skill provides legitimate OCR functionality using the Tesseract engine, specifically tailored for extracting structured text and Q&A pairs from images (e.g., TCM textbooks). The code in `scripts/tesseract_ocr.py` and `tesseract_ocr_skill.py` uses standard libraries like pytesseract and Pillow without any evidence of malicious intent, data exfiltration, or prompt injection. A minor bug exists in `scripts/tesseract_ocr.py` where a duplicate function definition uses the `os` module without importing it.
能力评估
Purpose & Capability
The README and code clearly require the Tesseract OCR engine to be installed, but the skill metadata lists no required binaries/config paths. SKILL.json describes TCM (traditional Chinese medicine) material focus while SKILL.md is more generic; version in SKILL.json (1.0.0) differs from registry (1.0.1). These inconsistencies indicate sloppy packaging and incomplete metadata.
Instruction Scope
SKILL.md instructs running the included Python CLI on a local image path (no external endpoints). The runtime instructions stay within the stated purpose (OCR). However, the included code has issues: a duplicated extract_text_with_tesseract definition (the second uses os.path.exists but os is not imported) and potential import path problems (relative import in tesseract_ocr_skill.py). These will likely cause runtime errors and make actual behavior unpredictable.
Install Mechanism
There is no install spec (instruction-only + bundled scripts). This is low-risk from an installer perspective — nothing is downloaded at install time. The skill does expect external system dependency (Tesseract engine), but that is not declared in the install metadata.
Credentials
The skill requests no environment variables or credentials, and the code does not access environment secrets or external network endpoints. This is proportionate for a local OCR tool. Note: the missing declaration of the required Tesseract binary reduces clarity but is not a credential concern.
Persistence & Privilege
The skill does not request persistent/always-on presence; default invocation settings are used (agent may invoke autonomously). The skill does not attempt to modify other skill configs or system-wide settings.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install tesseract-ocr-skill - 安装完成后,直接呼叫该 Skill 的名称或使用
/tesseract-ocr-skill触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
v1.0.1 - Bump version
v1.0.0
v1.0.0 - First release
元数据
常见问题
Tesseract OCR文字识别 是什么?
基于Tesseract引擎的OCR文字识别技能,支持中文、英文、中英混合三种模式,输出text/structured/question_answer三种格式。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 239 次。
如何安装 Tesseract OCR文字识别?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install tesseract-ocr-skill」即可一键安装,无需额外配置。
Tesseract OCR文字识别 是免费的吗?
是的,Tesseract OCR文字识别 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Tesseract OCR文字识别 支持哪些平台?
Tesseract OCR文字识别 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Tesseract OCR文字识别?
由 flyingants(@tom859174-sketch)开发并维护,当前版本 v1.0.1。
推荐 Skills