← 返回 Skills 市场

Ocr Scanner Image

Name: Ocr Scanner Image
Author: kaarl92

作者 kaarl92 · GitHub ↗ · v1.0.1 · MIT-0

cross-platform ⚠ suspicious

238

总下载

当前安装

版本数

在 OpenClaw 中安装

/install ocr-scanner-image

功能描述

Perform OCR on image files (jpg, png, bmp, gif, tiff) using the system's `tesseract` binary and return extracted plain text.

安全使用建议

This skill's primary wrapper (scripts/ocr.sh) performs local OCR using tesseract and is consistent with the description—use this if you want offline processing. Before installing or running: (1) Ensure you have tesseract and pdftoppm (or equivalent) installed—the SKILL metadata does not declare these but the scripts depend on them. (2) Inspect and avoid running scripts/example.py on sensitive images: it uploads files to the external ocr.space API using a public demo key, which will transmit your image contents off-host. (3) If you only want local OCR, delete or ignore example.py and run ocr.sh directly. (4) Be cautious about adding the optional alias to your shell config—it's safe but modifies your shell environment. If you want more assurance, ask the skill author to (a) declare required binaries in metadata, (b) remove or clearly document the network-upload example, or (c) provide a pure-local example only.

功能分析

Type: OpenClaw Skill Name: ocr-scanner-image Version: 1.0.1 The skill provides OCR functionality using either a local Tesseract binary (scripts/ocr.sh) or the external OCR.space API (scripts/example.py). While the Python script sends file data to an external endpoint (api.ocr.space), this behavior is clearly documented in references/api_reference.md and is consistent with the tool's stated purpose. The bash script handles local processing safely with proper quoting, and no evidence of malicious intent, hidden exfiltration, or prompt injection was found.

能力评估

ℹ Purpose & Capability

The stated purpose is local OCR via the system tesseract binary and the provided ocr.sh wrapper implements that (and also PDF→PNG conversion via pdftoppm). However, the package also contains scripts/example.py which uses the external ocr.space API (network call) and a demo API key; that behavior is not described in SKILL.md and is not necessary for the stated local-tesseract purpose.

⚠ Instruction Scope

SKILL.md instructs the agent/user to run the included bash wrapper (ocr.sh) which operates locally and prints output to stdout. It does not mention uploading files to external services. The presence of example.py that will POST local files to a remote OCR API means there is code in the skill that would transmit image contents off-host—this is out-of-band relative to the SKILL.md guidance and is a potential privacy/exfiltration risk if run without understanding.

✓ Install Mechanism

There is no install spec (instruction-only), which minimizes installation risk. The skill ships scripts that will run from disk, but nothing is downloaded or installed automatically.

ℹ Credentials

The skill requests no environment variables or credentials. The example Python script embeds a public demo API key ('helloworld') which is not secret but does cause local files to be uploaded to a third-party service if used. Also, SKILL.md and scripts implicitly require system binaries (tesseract, pdftoppm, possibly pdftoppm/png conversion tools) even though the registry metadata lists none—this omission is a proportionality/information gap to be aware of.

✓ Persistence & Privilege

The skill is not always-enabled, does not request elevated or persistent agent privileges, and only suggests an optional shell alias (editing ~/.bashrc) if the user chooses to do so.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install ocr-scanner-image
安装完成后，直接呼叫该 Skill 的名称或使用 /ocr-scanner-image 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.1

- Implemented a working OCR scanner skill using a local Bash script (`ocr.sh`) that utilizes `tesseract` for text extraction from image files. - Updated documentation in SKILL.md to reflect actual usage, options, and integration steps with system aliases. - Removed placeholder and structuring guidance from documentation, providing concrete, ready-to-use instructions. - The skill now provides immediate OCR capability for JPG, PNG, BMP, GIF, and TIFF files using a local Tesseract installation.

v1.0.0

Initial release of ocr-scanner-image. - Perform OCR on image files (jpg, png, bmp, gif, tiff) and return extracted text. - Supports images such as screenshots, documents, receipts, and handwritten notes. - Accepts image uploads or URLs for processing. - Offers optional language selection for OCR.

元数据

Slug ocr-scanner-image

版本 1.0.1

许可证 MIT-0

累计安装 1

当前安装数 0

历史版本数 2

常见问题

Ocr Scanner Image 是什么？

Perform OCR on image files (jpg, png, bmp, gif, tiff) using the system's `tesseract` binary and return extracted plain text. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 238 次。

如何安装 Ocr Scanner Image？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ocr-scanner-image」即可一键安装，无需额外配置。

Ocr Scanner Image 是免费的吗？

是的，Ocr Scanner Image 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Ocr Scanner Image 支持哪些平台？

Ocr Scanner Image 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Ocr Scanner Image？

由 kaarl92（@kaarl92）开发并维护，当前版本 v1.0.1。