/install pdf-to-docx
PDF to DOCX
Convert PDF files to editable Word (.docx) format using MinerU.
⚠️ Token required.
flash-extractdoes not support DOCX output. You must configure a token viamineru-open-api authbefore using this skill.⚠️ Output to file required. DOCX is a binary format and cannot be streamed to stdout — you must always specify
-o \x3Cdirectory>.
Install
npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest
Authentication
Token required — create one at https://mineru.net/apiManage/token:
mineru-open-api auth # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable
Quick Start
# Convert PDF to DOCX (token required, -o is mandatory)
mineru-open-api extract report.pdf -f docx -o ./out/
# From URL
mineru-open-api extract https://example.com/report.pdf -f docx -o ./out/
# With language hint
mineru-open-api extract report.pdf -f docx --language en -o ./out/
# With VLM model for better layout accuracy (complex PDFs)
mineru-open-api extract report.pdf -f docx --model vlm -o ./out/
# Batch convert multiple PDFs
mineru-open-api extract *.pdf -f docx -o ./out/
Capabilities
- Supported input: .pdf (local file or URL)
- Output format: Word (.docx) via
-f docx - Token required (
mineru-open-api authorMINERU_TOKENenv) -o \x3Cdir>is mandatory — DOCX cannot stream to stdout- Language hint with
--language(default:ch, useenfor English) - Page range with
--pages(e.g.1-10) - Batch mode supported:
extract *.pdf -f docx -o ./out/
Notes
flash-extractdoes NOT support DOCX output — always useextractwith token- DOCX output cannot be streamed to stdout;
-oflag is required - Use
--model vlmfor PDFs with complex layouts, tables, or mixed content - Use
--model pipelineif you need guaranteed fidelity with no hallucination risk - Output directory will be created if it does not exist
- All progress/status messages go to stderr
- MinerU is open-source by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install pdf-to-docx - 安装完成后,直接呼叫该 Skill 的名称或使用
/pdf-to-docx触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
PDF to DOCX 是什么?
Convert PDF documents to Word (.docx) format using MinerU. Transforms PDF files into editable Word documents preserving layout, text, tables, and formatting.... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 201 次。
如何安装 PDF to DOCX?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install pdf-to-docx」即可一键安装,无需额外配置。
PDF to DOCX 是免费的吗?
是的,PDF to DOCX 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
PDF to DOCX 支持哪些平台?
PDF to DOCX 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 PDF to DOCX?
由 mzlzyCA(@mzlzyca)开发并维护,当前版本 v0.4.0。