/install image-to-markdown
Image to Markdown - OCR Extract Text from Images
Extract text from images to Markdown using MinerU Open API. No API key required.
Quick Start
# Extract text from a local image
mineru-open-api flash-extract screenshot.png
# Extract text from an image URL (no download needed)
mineru-open-api flash-extract https://example.com/image.png
# Save to file
mineru-open-api flash-extract photo.jpg -o ./output/
# Specify language for better accuracy
mineru-open-api flash-extract scan.jpg --language en
Language Rule
You MUST reply to the user in the SAME language they use. This is non-negotiable.
Capabilities
- OCR text extraction from PNG, JPG, JPEG, WebP, BMP, TIFF
- Supports both local files and URLs directly
- Language hint with
--language(default:ch, useenfor English) - No API key, no signup, no authentication
- Max 10MB per image
When to Use
- User asks to "read", "extract", or "OCR" an image
- User shares a screenshot and asks what it says
- User wants text from a photo of a document or whiteboard
- User needs image content converted to Markdown
CLI Reference
Run mineru-open-api flash-extract --help for all available options.
Data Privacy
flash-extractuploads the image to MinerU's cloud API for processing and returns the result. No account or API key is required.- Images are processed in real-time and are not stored after extraction.
- For details, see https://mineru.net
Notes
- Output is Markdown text extracted via OCR
- For higher precision or batch processing, use
mineru-open-api extract(requires auth viamineru-open-api auth) - If the CLI cannot be installed via npm/uv/go, download it from https://mineru.net/ecosystem?tab=cli
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install image-to-markdown - After installation, invoke the skill by name or use
/image-to-markdown - Provide required inputs per the skill's parameter spec and get structured output
What is Image To Markdown?
Image to Markdown - extract text from images (PNG, JPG, WebP) to Markdown with OCR. Use when reading text from screenshots, photos, scanned pages, or any ima... It is an AI Agent Skill for Claude Code / OpenClaw, with 136 downloads so far.
How do I install Image To Markdown?
Run "/install image-to-markdown" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Image To Markdown free?
Yes, Image To Markdown is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Image To Markdown support?
Image To Markdown is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Image To Markdown?
It is built and maintained by tanis90 (@tanis90); the current version is v1.0.0.