/install doc-to-text
Doc To Text
Extract plain readable text from Word (.doc/.docx) documents using MinerU. MinerU outputs Markdown, which is the closest format to plain text it supports.
Install
npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest
Quick Start
# Extract text from .docx to stdout (no token required)
mineru-open-api flash-extract report.docx
# Save to file
mineru-open-api flash-extract report.docx -o ./out/
# Extract .doc (requires token)
mineru-open-api extract report.doc -o ./out/
# JSON output contains plain text fields (requires token)
mineru-open-api extract report.docx -f json -o ./out/
Authentication
No token needed for flash-extract on .docx. Token required for .doc and extract:
mineru-open-api auth # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable
Create token at: https://mineru.net/apiManage/token
Capabilities
- Supported input: .doc, .docx (local file or URL)
.docx: supportsflash-extract(no token, Markdown output to stdout).doc: requiresextractwith token- For truly plain text: use
extract -f jsonand read the text fields from the JSON output - Language hint with
--language(default:ch, useenfor English)
Notes
- MinerU does not have a
-f textoption; Markdown is the closest to plain text .docrequiresextractwith token;.docxworks withflash-extract- Output goes to stdout by default; use
-o \x3Cdir>to save to a file or directory - All progress/status messages go to stderr; document content goes to stdout
- MinerU is open-source by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install doc-to-text - After installation, invoke the skill by name or use
/doc-to-text - Provide required inputs per the skill's parameter spec and get structured output
What is Doc To Text?
Extract plain readable text from Word documents (.doc, .docx) using MinerU. Outputs Markdown (the closest plain-text format supported) for easy reading and p... It is an AI Agent Skill for Claude Code / OpenClaw, with 186 downloads so far.
How do I install Doc To Text?
Run "/install doc-to-text" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Doc To Text free?
Yes, Doc To Text is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Doc To Text support?
Doc To Text is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Doc To Text?
It is built and maintained by mzlzyCA (@mzlzyca); the current version is v0.4.0.