← Back to Skills Marketplace

Image OCR

Name: Image OCR
Author: mzlzyca

by mzlzyCA · GitHub ↗ · v0.4.0 · MIT-0

cross-platform ✓ Security Clean

168

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install photo-ocr

Description

OCR for photos and images using MinerU. Extract text from photographs, screenshots, camera captures, and image files with high accuracy. Features: image OCR...

Usage Guidance

This skill appears coherent, but follow standard precautions before installing: verify the npm package and the GitHub repo (publisher identity, recent commits, stars/issues) to reduce risk of typosquatting or malicious packages; only provide MINERU_TOKEN if you trust the service and give the token least privilege; if you prefer not to supply credentials, use 'flash-extract' (no token) for small quick OCR; review what the installed mineru-open-api binary does (source code) before running on sensitive images, and revoke the token if you observe unexpected behavior.

Capability Analysis

Type: OpenClaw Skill Name: photo-ocr Version: 0.4.0 The photo-ocr skill is a legitimate wrapper for the MinerU OCR tool (OpenDataLab). It facilitates text extraction from images using the 'mineru-open-api' CLI via npm or Go. The skill's metadata and instructions in SKILL.md are consistent with its stated purpose, requiring standard permissions for file access and network requests to the MinerU API (mineru.net). No evidence of malicious intent, data exfiltration, or prompt injection was found.

Capability Assessment

✓ Purpose & Capability

Name/description (image OCR via MinerU) match the declared binary (mineru-open-api) and the commands documented in SKILL.md. Required binary and primary credential are appropriate for an OCR CLI wrapper.

✓ Instruction Scope

SKILL.md only instructs the agent to run mineru-open-api commands, set or use MINERU_TOKEN for authenticated calls, and points to mineru.net/GitHub for tokens and source. It does not direct the agent to read unrelated files, exfiltrate data to unexpected endpoints, or access other environment variables.

ℹ Install Mechanism

Install options are standard: an npm package (mineru-open-api) and a Go 'go install' from an OpenDataLab GitHub repo. These are expected for a CLI. As with any third‑party package, installing from npm or go pulls code onto the host and should be verified (package page, repository, checksums/tags).

✓ Credentials

Only MINERU_TOKEN is required and is the primary credential; SKILL.md documents that some commands (flash-extract) work without a token while extract requires it. Requesting a single service token is proportional to the skill's features.

✓ Persistence & Privilege

always:false and normal autonomous invocation behavior. The skill does not request persistent system-wide privileges or modify other skills' configs in the instructions.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install photo-ocr
After installation, invoke the skill by name or use /photo-ocr
Provide required inputs per the skill's parameter spec and get structured output

Version History

v0.4.0

SEO: expand description for better ClawHub vector search discovery

v0.3.0

Rollback to original version

v0.2.1

SEO optimization v0.2.1

v0.2.0

SEO optimization v0.2.0

v1.0.1

Minor update

v1.0.0

Initial release

Metadata

Slug photo-ocr

Version 0.4.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 6

Frequently Asked Questions

What is Image OCR?

OCR for photos and images using MinerU. Extract text from photographs, screenshots, camera captures, and image files with high accuracy. Features: image OCR... It is an AI Agent Skill for Claude Code / OpenClaw, with 168 downloads so far.

How do I install Image OCR?

Run "/install photo-ocr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Image OCR free?

Yes, Image OCR is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Image OCR support?

Image OCR is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Image OCR?

It is built and maintained by mzlzyCA (@mzlzyca); the current version is v0.4.0.

More Skills