← Back to Skills Marketplace
kesslerio

MinerU PDF Parser Clawdbot Skill

by kesslerio · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
2288
Downloads
2
Stars
7
Active Installs
1
Versions
Install in OpenClaw
/install mineru-pdf-parser-clawdbot-skill
Description
Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.
Usage Guidance
This skill appears to do exactly what it claims: call a local MinerU CLI to parse PDFs into Markdown/JSON and write results to a local output folder. Before installing or running: (1) ensure you have a trustworthy MinerU CLI binary installed (the skill will run whatever MINERU_CMD points to), (2) be cautious if you set MINERU_EXTRA_ARGS or override MINERU_CMD — they can change the executed command or inject unexpected arguments, (3) run the skill on non-sensitive PDFs first to confirm output and behavior, and (4) be aware MinerU may use significant CPU and disk (models can be large). No credentials or network endpoints are required by the skill itself.
Capability Analysis
Type: OpenClaw Skill Name: Developer: Version: Description: OpenClaw Agent Skill The `scripts/mineru_parse.sh` script contains a shell injection vulnerability. The `MINERU_EXTRA_ARGS` environment variable is unquoted and word-split (`extra=( ${MINERU_EXTRA_ARGS} )`), and these split arguments are then passed to the `mineru` command. This allows an attacker to inject arbitrary shell commands (e.g., `MINERU_EXTRA_ARGS="; evil_command"`) which would be executed by the shell after the `mineru` command, enabling arbitrary command execution. While the skill itself does not explicitly instruct the agent to perform malicious actions, this risky capability presents a significant security vulnerability.
Capability Assessment
Purpose & Capability
The skill's name/description match its contents: the README, SKILL.md, and the shell script are all focused on invoking a local MinerU CLI to parse PDFs. No unrelated binaries, services, or credentials are requested.
Instruction Scope
SKILL.md instructs the agent to run scripts/mineru_parse.sh on a local PDF. The script validates the input file and checks for the MinerU CLI. It does not read or transmit other files or credentials. Note: the script accepts MINERU_EXTRA_ARGS and expands them into the command line (word splitting via shell expansion), so untrusted or user-supplied extra-arg strings could change command behavior — this is a normal tradeoff for CLI wrappers but worth awareness.
Install Mechanism
There is no install spec (instruction-only plus a helper script). Nothing is downloaded or written by the skill itself during install; the skill relies on an externally installed MinerU CLI as documented.
Credentials
The skill declares no required environment variables or credentials. It supports optional env overrides (MINERU_CMD, flag-name overrides, MINERU_EXTRA_ARGS, etc.) which are appropriate for wrapping different MinerU installations. No secret-like variables are requested.
Persistence & Privilege
The skill is user-invocable, not always-enabled, and does not modify other skills or system-wide configuration. It does not request elevated or persistent privileges.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install mineru-pdf-parser-clawdbot-skill
  3. After installation, invoke the skill by name or use /mineru-pdf-parser-clawdbot-skill
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of mineru-pdf skill. - Parse PDFs locally using MinerU (CPU) with support for Markdown and JSON output. - Optional extraction of tables and images when specified. - Assumes per-document output folders created by MinerU. - Provides simple command-line usage with customizable output format and options. - Batch processing is not enabled by default; only implement upon request.
Metadata
Slug mineru-pdf-parser-clawdbot-skill
Version 1.0.0
License
All-time Installs 7
Active Installs 7
Total Versions 1
Frequently Asked Questions

What is MinerU PDF Parser Clawdbot Skill?

Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction. It is an AI Agent Skill for Claude Code / OpenClaw, with 2288 downloads so far.

How do I install MinerU PDF Parser Clawdbot Skill?

Run "/install mineru-pdf-parser-clawdbot-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is MinerU PDF Parser Clawdbot Skill free?

Yes, MinerU PDF Parser Clawdbot Skill is completely free (open-source). You can download, install and use it at no cost.

Which platforms does MinerU PDF Parser Clawdbot Skill support?

MinerU PDF Parser Clawdbot Skill is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created MinerU PDF Parser Clawdbot Skill?

It is built and maintained by kesslerio (@kesslerio); the current version is v1.0.0.

💬 Comments