← Back to Skills Marketplace
mineru-extract

MinerU Doc Parser

by MinerU-Extract · GitHub ↗ · v0.2.1 · MIT-0
cross-platform ⚠ suspicious
1010
Downloads
6
Stars
2
Active Installs
9
Versions
Install in OpenClaw
/install mineru-ai
Description
MinerU AI document parser — intelligent document extraction powered by AI. Parse PDFs, scanned documents, images, Word files, PowerPoint slides, and web page...
Usage Guidance
This skill appears to be a normal CLI wrapper for MinerU's document parsing service, but there are a few things to check before installing or using it: - Expect that documents you parse are likely uploaded to MinerU's servers (token management and the site URL are referenced). Do not use it with sensitive or confidential documents unless you have verified MinerU's privacy/security policies. - The SKILL.md mentions an environment variable (MINERU_TOKEN) and a config file (~/.mineru/config.yaml) but the registry metadata did not declare these. Verify how tokens are stored, file permissions, and whether tokens are encrypted or stored in plaintext. - Confirm the npm package and GitHub repo: inspect the mineru-open-api package contents and the GitHub source (github.com/MinerU-Extract/mineru-ai) before installing globally. Prefer verifying package integrity (checksums, signed releases) and review the code that performs network uploads. - If you need offline/local-only parsing, confirm the tool supports local-only mode; otherwise avoid using it for private documents. If you want, I can: (1) fetch the mineru-open-api npm package metadata and popular-vote/comment history, (2) summarize the GitHub repo's README and code paths related to network I/O, or (3) draft safer usage instructions (e.g., run in an isolated VM, inspect config file contents).
Capability Analysis
Type: OpenClaw Skill Name: mineru-ai Version: 0.2.1 The mineru-ai skill is a legitimate wrapper for the MinerU AI document parsing service, providing tools for converting PDFs and images into structured formats like Markdown. The SKILL.md file contains well-defined instructions for the agent to handle authentication, choose between extraction modes, and manage output directories safely. No indicators of data exfiltration, malicious execution, or harmful prompt injection were found; the tool's behavior is consistent with its stated purpose of document processing via the mineru-open-api CLI.
Capability Assessment
Purpose & Capability
Name, description, and required binary (mineru-open-api) match a CLI-based document parser. Supported formats, extraction modes, and npm/go install options are coherent with the stated purpose.
Instruction Scope
SKILL.md instructs the agent to run mineru-open-api commands including flash-extract, extract, crawl, and auth. It additionally documents token resolution that includes an on-disk config (~/.mineru/config.yaml) and an environment variable (MINERU_TOKEN). The registry metadata listed no required config paths or env vars, so the instructions read a file and environment state that were not declared. The instructions also imply sending documents/URLs to mineru.net endpoints (token creation at mineru.net), but do not explicitly state privacy/where processing occurs.
Install Mechanism
Install is via npm package or go install from a GitHub repo — standard mechanisms for a CLI. Both create the expected binary name mineru-open-api. No high-risk download URLs or extract-from-unknown-host actions are present.
Credentials
Registry metadata lists no required env vars, yet SKILL.md documents MINERU_TOKEN and interactive auth. The skill will accept an API token and can read ~/.mineru/config.yaml; requesting/using tokens is reasonable for an API service, but the lack of declaration is an inconsistency. Also, processing likely involves uploading documents to MinerU servers — requiring a token is proportionate for production use but increases sensitivity of data sent.
Persistence & Privilege
Skill is user-invocable and not always-enabled. It does not request permanent platform privileges or claim to modify other skills. Normal autonomous invocation is allowed by default.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install mineru-ai
  3. After installation, invoke the skill by name or use /mineru-ai
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.2.1
Fix: republish with complete SKILL.md content (previous publish had truncated CLI documentation)
v0.2.0
SEO optimization: new AI-focused description with 200+ words, bilingual keywords, trigger phrases for AI document parsing search queries
v1.0.10
- No file changes detected; maintenance version release. - Documentation, description, features, and usage unchanged from previous release. - No new features, bug fixes, or command updates included in this version.
v1.0.9
- Added table and formula recognition support to token-free flash extraction mode. - Updated documentation to reflect that flash-extract now recognizes tables and formulas in quick extraction. - Adjusted comparison tables to show feature parity between flash-extract and extract for table/formula recognition. - Clarified best use cases for flash-extract and extract modes.
v1.0.8
Version 1.0.8 (no file changes detected): - No detectable changes to files or documentation in this release. - All features and documentation remain the same as in the previous version.
v1.0.7
mineru-ai v1.0.7 - Documentation streamlined for conciseness and ease of use, especially in the Core workflow section. - No changes to code or binaries; documentation only. - Command usage instructions remain unchanged. - All features and limits are consistent with previous versions.
v1.0.6
**Improved extraction workflow and error handling for MinerU CLI.** - Updated core workflow to always try `flash-extract` first for any input (local file or URL), for faster and simpler usage. - Documented how to interpret `flash-extract` exit codes and next actions (e.g., when to switch to `extract` with a token). - Clarified difference between document URL handling and web page extraction (`flash-extract` vs `crawl`). - Enhanced troubleshooting guidance for error cases in the extraction flow. - Improved workflow steps and user instructions for better clarity and usability.
v1.0.5
- Added npm and Go installation instructions for mineru-open-api; now supports install via npm or go install. - Updated metadata to reflect new install methods, replacing previous direct download scripts. - Removed curl/PowerShell script installation instructions. - No CLI feature changes.
v1.0.3
mineru 1.0.3 - Added CONTRIBUTING.md to guide community contributions. - Added _meta.json for enhanced metadata management. - Overhauled SKILL.md to provide detailed CLI-based instructions, including installation, usage, and feature comparison for flash-extract and extract modes. - Expanded documentation on supported file types, output formats, command-line flags, web crawling, and batch processing. - Clarified extraction limits, setup steps, and model selection guidance for improved user onboarding.
Metadata
Slug mineru-ai
Version 0.2.1
License MIT-0
All-time Installs 2
Active Installs 2
Total Versions 9
Frequently Asked Questions

What is MinerU Doc Parser?

MinerU AI document parser — intelligent document extraction powered by AI. Parse PDFs, scanned documents, images, Word files, PowerPoint slides, and web page... It is an AI Agent Skill for Claude Code / OpenClaw, with 1010 downloads so far.

How do I install MinerU Doc Parser?

Run "/install mineru-ai" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is MinerU Doc Parser free?

Yes, MinerU Doc Parser is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does MinerU Doc Parser support?

MinerU Doc Parser is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created MinerU Doc Parser?

It is built and maintained by MinerU-Extract (@mineru-extract); the current version is v0.2.1.

💬 Comments