← Back to Skills Marketplace
168
Downloads
0
Stars
0
Active Installs
6
Versions
Install in OpenClaw
/install photo-ocr
Description
OCR for photos and images using MinerU. Extract text from photographs, screenshots, camera captures, and image files with high accuracy. Features: image OCR...
Usage Guidance
This skill appears coherent, but follow standard precautions before installing: verify the npm package and the GitHub repo (publisher identity, recent commits, stars/issues) to reduce risk of typosquatting or malicious packages; only provide MINERU_TOKEN if you trust the service and give the token least privilege; if you prefer not to supply credentials, use 'flash-extract' (no token) for small quick OCR; review what the installed mineru-open-api binary does (source code) before running on sensitive images, and revoke the token if you observe unexpected behavior.
Capability Analysis
Type: OpenClaw Skill
Name: photo-ocr
Version: 0.4.0
The photo-ocr skill is a legitimate wrapper for the MinerU OCR tool (OpenDataLab). It facilitates text extraction from images using the 'mineru-open-api' CLI via npm or Go. The skill's metadata and instructions in SKILL.md are consistent with its stated purpose, requiring standard permissions for file access and network requests to the MinerU API (mineru.net). No evidence of malicious intent, data exfiltration, or prompt injection was found.
Capability Assessment
Purpose & Capability
Name/description (image OCR via MinerU) match the declared binary (mineru-open-api) and the commands documented in SKILL.md. Required binary and primary credential are appropriate for an OCR CLI wrapper.
Instruction Scope
SKILL.md only instructs the agent to run mineru-open-api commands, set or use MINERU_TOKEN for authenticated calls, and points to mineru.net/GitHub for tokens and source. It does not direct the agent to read unrelated files, exfiltrate data to unexpected endpoints, or access other environment variables.
Install Mechanism
Install options are standard: an npm package (mineru-open-api) and a Go 'go install' from an OpenDataLab GitHub repo. These are expected for a CLI. As with any third‑party package, installing from npm or go pulls code onto the host and should be verified (package page, repository, checksums/tags).
Credentials
Only MINERU_TOKEN is required and is the primary credential; SKILL.md documents that some commands (flash-extract) work without a token while extract requires it. Requesting a single service token is proportional to the skill's features.
Persistence & Privilege
always:false and normal autonomous invocation behavior. The skill does not request persistent system-wide privileges or modify other skills' configs in the instructions.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install photo-ocr - After installation, invoke the skill by name or use
/photo-ocr - Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.4.0
SEO: expand description for better ClawHub vector search discovery
v0.3.0
Rollback to original version
v0.2.1
SEO optimization v0.2.1
v0.2.0
SEO optimization v0.2.0
v1.0.1
Minor update
v1.0.0
Initial release
Metadata
Frequently Asked Questions
What is Image OCR?
OCR for photos and images using MinerU. Extract text from photographs, screenshots, camera captures, and image files with high accuracy. Features: image OCR... It is an AI Agent Skill for Claude Code / OpenClaw, with 168 downloads so far.
How do I install Image OCR?
Run "/install photo-ocr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Image OCR free?
Yes, Image OCR is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Image OCR support?
Image OCR is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Image OCR?
It is built and maintained by mzlzyCA (@mzlzyca); the current version is v0.4.0.
More Skills