← 返回 Skills 市场
baokui

pdf-parser-mineru

作者 baokui · GitHub ↗ · v1.0.2
cross-platform ✓ 安全检测通过
1551
总下载
0
收藏
1
当前安装
3
版本数
在 OpenClaw 中安装
/install pdf-parser-mineru
功能描述
PDF document parsing tool based on local MinerU, supports converting PDF to Markdown, JSON, and other machine-readable formats.
安全使用建议
This skill is internally coherent: it installs MinerU and runs the mineru CLI to convert PDFs to Markdown/JSON. Before installing, consider the following: (1) mineru is a third-party PyPI package — review its project page and dependencies and prefer installation into an isolated virtual environment or container; (2) MinerU may download models or contact network endpoints at install/time or runtime — if you need offline/sandboxed processing, verify or block network access; (3) the skill requires absolute file paths and can read any PDF you point it at — avoid supplying sensitive documents to untrusted third-party binaries; (4) the included install.sh is safe-looking but will run pip installs and assumes Python 3.10–3.13; run it manually rather than automatically if you want to inspect it first. If you want stronger assurance, review the mineru package source and any model download behavior before use.
功能分析
Type: OpenClaw Skill Name: pdf-parser-mineru Version: 1.0.2 The OpenClaw AgentSkills skill bundle for PDF parsing using MinerU is classified as benign. All files, including `SKILL.md`, `install.sh`, and `script/pdf_parser.py`, align with the stated purpose of converting PDFs to Markdown or JSON. The `install.sh` script uses standard Python package management tools (`pip`, `uv`) to install the `mineru` dependency without any suspicious remote execution or persistence mechanisms. Crucially, the `script/pdf_parser.py` uses `subprocess.run` with a list of arguments, which safely prevents shell injection vulnerabilities from user-controlled parameters like `file_path` and `output_dir`. There are no indications of prompt injection attempts, data exfiltration, unauthorized network activity, or other malicious behaviors.
能力评估
Purpose & Capability
Name/description match the included files: SKILL.md documents running MinerU and the repository provides an install script and a Python wrapper that invokes the mineru CLI. Required capabilities (MinerU installation, Python) are proportional to the stated parsing functionality.
Instruction Scope
Runtime instructions and the Python script stay within the skill's scope: they accept an absolute file path and output directory, run a local mineru CLI process, and read/return generated files. The script sets a couple of local env vars to control device selection for the subprocess but does not read or transmit unrelated system secrets or contact hidden endpoints itself. Note: mineru (the third-party tool) may perform network activity or model downloads — that behavior is external to the skill and should be reviewed if you need offline guarantees.
Install Mechanism
There is no platform install spec in registry metadata, but an included install.sh performs pip and 'uv pip install -U "mineru[all]"'. Installing MinerU via PyPI is expected here; it's a moderate-risk operation (pulling packages from PyPI and possibly downloading models/data at runtime). No obscure URLs, shorteners, or direct archive downloads are used in the provided scripts.
Credentials
The skill requests no environment variables or credentials. The code sets PYTORCH_ENABLE_MPS_FALLBACK and MPS_DEVICE locally for the mineru subprocess (device control only). There are no requests for unrelated secrets or config paths.
Persistence & Privilege
Skill flags are standard (always: false, agent invocation allowed). The package does not request permanent system changes or modify other skills' configs. install.sh and the Python script only install MinerU and run it; they do not attempt to persist credentials or enable automatic always-on behavior.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install pdf-parser-mineru
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /pdf-parser-mineru 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.2
- Added a Chinese-language documentation file: SKILL_zh.md - No changes to code or functionality; documentation is now available in both English and Chinese.
v1.0.1
- 修正 SKILL.md 文件冒头字段,将原有的 YAML 字段 name 和描述语句合并为 name/description 形式。 - 其余内容未发生实质性变更。
v1.0.0
Initial release with MinerU-based PDF parsing and conversion tools. - Added pdf_to_markdown: Convert PDFs to Markdown with structure, formulas, tables, and image extraction using MinerU. - Added pdf_to_json: Convert PDFs to structured JSON with detailed layout, blocks, images, tables, and formulas. - Both tools support OCR, formula/table extraction toggles, multi-language, page range selection, and multiple parsing backends. - Included setup and system requirements, backend selection tips, troubleshooting, and usage scenarios.
元数据
Slug pdf-parser-mineru
版本 1.0.2
许可证
累计安装 1
当前安装数 1
历史版本数 3
常见问题

pdf-parser-mineru 是什么?

PDF document parsing tool based on local MinerU, supports converting PDF to Markdown, JSON, and other machine-readable formats. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 1551 次。

如何安装 pdf-parser-mineru?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install pdf-parser-mineru」即可一键安装,无需额外配置。

pdf-parser-mineru 是免费的吗?

是的,pdf-parser-mineru 完全免费(开源免费),可自由下载、安装和使用。

pdf-parser-mineru 支持哪些平台?

pdf-parser-mineru 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 pdf-parser-mineru?

由 baokui(@baokui)开发并维护,当前版本 v1.0.2。

💬 留言讨论