/install pdf-to-docx
PDF to DOCX
Convert PDF files to editable Word (.docx) format using MinerU.
⚠️ Token required.
flash-extractdoes not support DOCX output. You must configure a token viamineru-open-api authbefore using this skill.⚠️ Output to file required. DOCX is a binary format and cannot be streamed to stdout — you must always specify
-o \x3Cdirectory>.
Install
npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest
Authentication
Token required — create one at https://mineru.net/apiManage/token:
mineru-open-api auth # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable
Quick Start
# Convert PDF to DOCX (token required, -o is mandatory)
mineru-open-api extract report.pdf -f docx -o ./out/
# From URL
mineru-open-api extract https://example.com/report.pdf -f docx -o ./out/
# With language hint
mineru-open-api extract report.pdf -f docx --language en -o ./out/
# With VLM model for better layout accuracy (complex PDFs)
mineru-open-api extract report.pdf -f docx --model vlm -o ./out/
# Batch convert multiple PDFs
mineru-open-api extract *.pdf -f docx -o ./out/
Capabilities
- Supported input: .pdf (local file or URL)
- Output format: Word (.docx) via
-f docx - Token required (
mineru-open-api authorMINERU_TOKENenv) -o \x3Cdir>is mandatory — DOCX cannot stream to stdout- Language hint with
--language(default:ch, useenfor English) - Page range with
--pages(e.g.1-10) - Batch mode supported:
extract *.pdf -f docx -o ./out/
Notes
flash-extractdoes NOT support DOCX output — always useextractwith token- DOCX output cannot be streamed to stdout;
-oflag is required - Use
--model vlmfor PDFs with complex layouts, tables, or mixed content - Use
--model pipelineif you need guaranteed fidelity with no hallucination risk - Output directory will be created if it does not exist
- All progress/status messages go to stderr
- MinerU is open-source by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install pdf-to-docx - After installation, invoke the skill by name or use
/pdf-to-docx - Provide required inputs per the skill's parameter spec and get structured output
What is PDF to DOCX?
Convert PDF documents to Word (.docx) format using MinerU. Transforms PDF files into editable Word documents preserving layout, text, tables, and formatting.... It is an AI Agent Skill for Claude Code / OpenClaw, with 201 downloads so far.
How do I install PDF to DOCX?
Run "/install pdf-to-docx" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is PDF to DOCX free?
Yes, PDF to DOCX is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does PDF to DOCX support?
PDF to DOCX is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created PDF to DOCX?
It is built and maintained by mzlzyCA (@mzlzyca); the current version is v0.4.0.