← Back to Skills Marketplace
tsekaluk

MinerU PDF Parser

by TsekaLuk · GitHub ↗ · v2.1.0
cross-platform ✓ Security Clean
421
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install mineru-skill
Description
Parse PDFs, Word docs, PPTs, and images into clean Markdown using MinerU's VLM engine. Use when: (1) Converting PDF/Word/PPT/image to Markdown, (2) Extractin...
Usage Guidance
This skill uploads your documents to mineru.net for parsing — do not use it with confidential or regulated documents unless you trust the service and its privacy policy. Provide only the MINERU_TOKEN (no other credentials required). Note the minor metadata bug in the registry (shows [object Object]) and that the async script currently enumerates only PDFs; if you need DOCX/PPTX images in async mode use the other scripts or confirm behavior. Installing the skill will require installing standard Python packages (requests, aiohttp). Verify output paths (especially cloud/Obsidian paths) before running to avoid accidental writes or syncs.
Capability Analysis
Type: OpenClaw Skill Name: mineru-skill Version: 2.1.0 The OpenClaw AgentSkills bundle for MinerU is benign. All Python scripts (`mineru_async.py`, `mineru_stable.py`, `mineru_v2.py`) consistently implement the stated purpose: uploading user-specified documents to the `https://mineru.net/api/v4` endpoint for parsing and then downloading the processed results. The `SKILL.md` instructions are clear, functional, and do not contain any prompt injection attempts or directives for unauthorized actions. The skill transparently requires an API token and sends user data to a third-party service, which aligns with its described functionality and does not indicate malicious intent from the skill bundle itself.
Capability Assessment
Purpose & Capability
Name/description (convert PDFs/Word/PPT/images to Markdown) aligns with the provided scripts and API usage. The scripts call mineru.net endpoints, accept files, and produce Markdown — functionality matches the stated purpose.
Instruction Scope
SKILL.md instructs the agent to set MINERU_TOKEN and run provided Python scripts, upload files, and save extracted output locally or to an Obsidian path. All referenced operations (reading input files, uploading to mineru.net, downloading/ extracting zip results, writing output) are within the expected scope. Minor inconsistency: scripts/mineru_async.py only enumerates PDF files (glob *.pdf) while SKILL.md claims async version supports all listed formats; this is likely an implementation detail but worth noting.
Install Mechanism
Dependencies are standard PyPI packages (requests, aiohttp) and there is a requirements.txt. No suspicious remote downloads or archive extraction from untrusted URLs are used. This is a typical pip-based installation footprint.
Credentials
The only required credential is MINERU_TOKEN (Bearer API token), which is proportionate for a cloud parsing service. One metadata glitch in the registry shows 'Required env vars: [object Object]' — a UI/metadata bug but not a functional request for additional unrelated secrets.
Persistence & Privilege
Skill is not always-enabled and does not request persistent elevated privileges or modify other skills. It runs only when invoked and only uses the provided token and local filesystem paths for input/output.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install mineru-skill
  3. After installation, invoke the skill by name or use /mineru-skill
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v2.1.0
Initial release — convert PDF, Word, PPT, and images to clean Markdown with LaTeX formula preservation, table extraction, multilingual OCR, and batch parallel processing (up to 15 workers) with resume support.
Metadata
Slug mineru-skill
Version 2.1.0
License
All-time Installs 1
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is MinerU PDF Parser?

Parse PDFs, Word docs, PPTs, and images into clean Markdown using MinerU's VLM engine. Use when: (1) Converting PDF/Word/PPT/image to Markdown, (2) Extractin... It is an AI Agent Skill for Claude Code / OpenClaw, with 421 downloads so far.

How do I install MinerU PDF Parser?

Run "/install mineru-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is MinerU PDF Parser free?

Yes, MinerU PDF Parser is completely free (open-source). You can download, install and use it at no cost.

Which platforms does MinerU PDF Parser support?

MinerU PDF Parser is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created MinerU PDF Parser?

It is built and maintained by TsekaLuk (@tsekaluk); the current version is v2.1.0.

💬 Comments