/install pymupdf-pdf-parser-clawdbot-skill
PyMuPDF PDF
Overview
Parse PDFs locally using PyMuPDF for fast, lightweight extraction into Markdown by default, with optional JSON and image/table outputs in a per-document directory.
Prereqs / when to read references
If you hit import errors (PyMuPDF not installed) or Nix libstdc++ issues, read:
references/pymupdf-notes.md
Quick start (single PDF)
# Run from the skill directory
./scripts/pymupdf_parse.py /path/to/file.pdf \
--format md \
--outroot ./pymupdf-output
Options
--format md|json|both(default:md)--imagesto extract images--tablesto extract a simple line-based table JSON (quick/rough)--outroot DIRto change output root--langadds a language hint into JSON output metadata
Output conventions
- Create
./pymupdf-output/\x3Cpdf-basename>/by default. - Markdown output:
output.md - JSON output:
output.json(includeslang) - Images:
images/subdir - Tables:
tables.json(rough line-based)
Notes
- PyMuPDF is fast but less robust on complex PDFs.
- For more robust parsing, use a heavy-duty OCR parser (e.g., MinerU) if installed.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install pymupdf-pdf-parser-clawdbot-skill - 安装完成后,直接呼叫该 Skill 的名称或使用
/pymupdf-pdf-parser-clawdbot-skill触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
PyMuPDF PDF Parser Clawdbot Skill 是什么?
Fast local PDF parsing with PyMuPDF (fitz) for Markdown/JSON outputs and optional images/tables. Use when speed matters more than robustness, or as a fallback while heavier parsers are unavailable. Default to single-PDF parsing with per-document output folders. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 5382 次。
如何安装 PyMuPDF PDF Parser Clawdbot Skill?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install pymupdf-pdf-parser-clawdbot-skill」即可一键安装,无需额外配置。
PyMuPDF PDF Parser Clawdbot Skill 是免费的吗?
是的,PyMuPDF PDF Parser Clawdbot Skill 完全免费(开源免费),可自由下载、安装和使用。
PyMuPDF PDF Parser Clawdbot Skill 支持哪些平台?
PyMuPDF PDF Parser Clawdbot Skill 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 PyMuPDF PDF Parser Clawdbot Skill?
由 kesslerio(@kesslerio)开发并维护,当前版本 v1.0.0。