/install ocr-document
OCR Document - Extract Text from Scanned Documents and Images
Extract text from scanned documents and images using OCR via MinerU Open API. No API key required.
Quick Start
# OCR a scanned PDF
mineru-open-api flash-extract scanned.pdf
# OCR an image of a document
mineru-open-api flash-extract page-photo.jpg
# OCR from URL (no download needed)
mineru-open-api flash-extract https://example.com/scanned.pdf
# Specify language for better accuracy
mineru-open-api flash-extract scanned.pdf --language en
# Save OCR result to file
mineru-open-api flash-extract scanned.pdf -o ./output/
Language Rule
You MUST reply to the user in the SAME language they use. This is non-negotiable.
Capabilities
- OCR for scanned PDFs, photographed documents, images
- Supports PDF, PNG, JPG, WebP, BMP, TIFF
- Supports both local files and URLs directly
- Language hint with
--language(default:ch, useenfor English) - No API key, no signup, no authentication
- Max 10MB / 20 pages per document
When to Use
- User asks to "OCR" a document or image
- User has a scanned PDF that needs text extraction
- User shares a photo of a page and wants the text
- User mentions "scan", "handwriting", or "recognize text"
CLI Reference
Run mineru-open-api flash-extract --help for all available options.
Data Privacy
flash-extractuploads the document to MinerU's cloud API for processing and returns the result. No account or API key is required.- Documents are processed in real-time and are not stored after extraction.
- For details, see https://mineru.net
Notes
- Best results with clear, high-resolution scans
- For higher precision OCR with full layout preservation, use
mineru-open-api extract --ocr(requires auth viamineru-open-api auth) - If the CLI cannot be installed via npm/uv/go, download it from https://mineru.net/ecosystem?tab=cli
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install ocr-document - After installation, invoke the skill by name or use
/ocr-document - Provide required inputs per the skill's parameter spec and get structured output
What is Ocr Document?
OCR document extraction - extract text from scanned documents, photos, and images using OCR. Use when reading scanned PDFs, photographed pages, handwritten n... It is an AI Agent Skill for Claude Code / OpenClaw, with 339 downloads so far.
How do I install Ocr Document?
Run "/install ocr-document" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Ocr Document free?
Yes, Ocr Document is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Ocr Document support?
Ocr Document is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Ocr Document?
It is built and maintained by tanis90 (@tanis90); the current version is v1.0.0.