← Back to Skills Marketplace
Paper Parse
by
Chen-Li-17
· GitHub ↗
· v1.0.0
· MIT-0
274
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install paper-parse-figures
Description
Parse academic PDF papers into markdown with figure extraction.
Usage Guidance
This skill appears coherent and only does local PDF parsing and image output. Before installing: (1) review the included script (you already have it) to confirm behavior; (2) be aware 'uv' will install two Python packages (PyMuPDF and pymupdf4llm) — review those package sources or run the skill in an isolated/ephemeral environment; (3) test with non-sensitive PDFs first and monitor network activity if you're concerned about dependencies contacting external endpoints.
Capability Analysis
Type: OpenClaw Skill
Name: paper-parse-figures
Version: 1.0.0
The skill bundle provides a legitimate tool for parsing academic PDFs into markdown and extracting figures. The primary script, `scripts/parse_paper.py`, uses PyMuPDF and pymupdf4llm to perform complex layout analysis, caption matching, and image cropping. The code is well-structured, lacks any signs of data exfiltration, malicious execution, or obfuscation, and its behavior aligns perfectly with the documentation in `SKILL.md`.
Capability Assessment
Purpose & Capability
Name/description ask for PDF parsing and figure extraction; SKILL.md and the included script both implement that. The required binary 'uv' is used to run the inline script and is appropriate for the declared usage.
Instruction Scope
Runtime instructions only call the included script with a --pdf and optional --output-dir. The script reads the supplied PDF, writes markdown/JSON/images to the output dir, and does not reference other files, system configuration, or environment variables.
Install Mechanism
There is no install spec (instruction-only) and the script declares PyMuPDF and pymupdf4llm as Python dependencies via inline metadata that 'uv' will manage. This is expected, but installing third‑party PyPI packages carries the usual supply-chain risk — review those packages or install in an isolated environment.
Credentials
The skill requires no environment variables, no credentials, and no config paths. The requested permissions are proportional to the task (only file read/write for the provided PDF and output folder).
Persistence & Privilege
always:false and no modifications to other skills or global agent settings. The skill does not request persistent/system-level privileges.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install paper-parse-figures - After installation, invoke the skill by name or use
/paper-parse-figures - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of paper-parse-figures:
- Parse academic PDF papers into markdown, extracting figures and metadata.
- Outputs structured markdown, JSON metadata, high-resolution figure images, and a cover title/authors snapshot.
- Command-line tool with configurable PDF and output directory options.
- Uses PyMuPDF for PDF processing; dependencies managed automatically.
Metadata
Frequently Asked Questions
What is Paper Parse?
Parse academic PDF papers into markdown with figure extraction. It is an AI Agent Skill for Claude Code / OpenClaw, with 274 downloads so far.
How do I install Paper Parse?
Run "/install paper-parse-figures" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Paper Parse free?
Yes, Paper Parse is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Paper Parse support?
Paper Parse is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Paper Parse?
It is built and maintained by Chen-Li-17 (@chen-li-17); the current version is v1.0.0.
More Skills