← Back to Skills Marketplace

MinerU PDF Parser

Name: MinerU PDF Parser
Author: tsekaluk

by TsekaLuk · GitHub ↗ · v2.1.0

cross-platform ✓ Security Clean

421

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install mineru-skill

Description

Parse PDFs, Word docs, PPTs, and images into clean Markdown using MinerU's VLM engine. Use when: (1) Converting PDF/Word/PPT/image to Markdown, (2) Extractin...

Usage Guidance

This skill uploads your documents to mineru.net for parsing — do not use it with confidential or regulated documents unless you trust the service and its privacy policy. Provide only the MINERU_TOKEN (no other credentials required). Note the minor metadata bug in the registry (shows [object Object]) and that the async script currently enumerates only PDFs; if you need DOCX/PPTX images in async mode use the other scripts or confirm behavior. Installing the skill will require installing standard Python packages (requests, aiohttp). Verify output paths (especially cloud/Obsidian paths) before running to avoid accidental writes or syncs.

Capability Analysis

Type: OpenClaw Skill Name: mineru-skill Version: 2.1.0 The OpenClaw AgentSkills bundle for MinerU is benign. All Python scripts (`mineru_async.py`, `mineru_stable.py`, `mineru_v2.py`) consistently implement the stated purpose: uploading user-specified documents to the `https://mineru.net/api/v4` endpoint for parsing and then downloading the processed results. The `SKILL.md` instructions are clear, functional, and do not contain any prompt injection attempts or directives for unauthorized actions. The skill transparently requires an API token and sends user data to a third-party service, which aligns with its described functionality and does not indicate malicious intent from the skill bundle itself.

Capability Assessment

✓ Purpose & Capability

Name/description (convert PDFs/Word/PPT/images to Markdown) aligns with the provided scripts and API usage. The scripts call mineru.net endpoints, accept files, and produce Markdown — functionality matches the stated purpose.

ℹ Instruction Scope

SKILL.md instructs the agent to set MINERU_TOKEN and run provided Python scripts, upload files, and save extracted output locally or to an Obsidian path. All referenced operations (reading input files, uploading to mineru.net, downloading/ extracting zip results, writing output) are within the expected scope. Minor inconsistency: scripts/mineru_async.py only enumerates PDF files (glob *.pdf) while SKILL.md claims async version supports all listed formats; this is likely an implementation detail but worth noting.

✓ Install Mechanism

Dependencies are standard PyPI packages (requests, aiohttp) and there is a requirements.txt. No suspicious remote downloads or archive extraction from untrusted URLs are used. This is a typical pip-based installation footprint.

ℹ Credentials

The only required credential is MINERU_TOKEN (Bearer API token), which is proportionate for a cloud parsing service. One metadata glitch in the registry shows 'Required env vars: [object Object]' — a UI/metadata bug but not a functional request for additional unrelated secrets.

✓ Persistence & Privilege

Skill is not always-enabled and does not request persistent elevated privileges or modify other skills. It runs only when invoked and only uses the provided token and local filesystem paths for input/output.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install mineru-skill
After installation, invoke the skill by name or use /mineru-skill
Provide required inputs per the skill's parameter spec and get structured output

Version History

v2.1.0

Initial release — convert PDF, Word, PPT, and images to clean Markdown with LaTeX formula preservation, table extraction, multilingual OCR, and batch parallel processing (up to 15 workers) with resume support.

Metadata

Slug mineru-skill

Version 2.1.0

License —

All-time Installs 1

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is MinerU PDF Parser?

Parse PDFs, Word docs, PPTs, and images into clean Markdown using MinerU's VLM engine. Use when: (1) Converting PDF/Word/PPT/image to Markdown, (2) Extractin... It is an AI Agent Skill for Claude Code / OpenClaw, with 421 downloads so far.

How do I install MinerU PDF Parser?

Run "/install mineru-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is MinerU PDF Parser free?

Yes, MinerU PDF Parser is completely free (open-source). You can download, install and use it at no cost.

Which platforms does MinerU PDF Parser support?

MinerU PDF Parser is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created MinerU PDF Parser?

It is built and maintained by TsekaLuk (@tsekaluk); the current version is v2.1.0.

More Skills