/install extract-tables-from-pdf
Extract Tables From Pdf
Convert and extract content from .pdf using MinerU (mineru-open-api).
Install
npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest
Quick Start
# Extract tables from PDF (requires token)
mineru-open-api extract report.pdf -o ./out/
# With explicit table flag and OCR for scanned docs
mineru-open-api extract scanned.pdf --ocr --table -o ./out/
Authentication
Token required for extract and crawl:
mineru-open-api auth # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable
Create token at: https://mineru.net/apiManage/token
Capabilities
- Supports local files and URLs
- Requires token (
mineru-open-api authorMINERU_TOKENenv) - Supported input: .pdf
- Language hint with
--language(default:ch, useenfor English) - Page range with
--pages(where applicable)
Notes
- Table recognition requires
extractwith token.flash-extractdoes NOT support tables. Use--tableflag (enabled by default). - Output goes to stdout by default; use
-o \x3Cdir>to save to file - Binary formats (docx) require
-oflag (cannot stream to stdout) - All progress/status messages go to stderr
- MinerU is an open-source project by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install extract-tables-from-pdf - After installation, invoke the skill by name or use
/extract-tables-from-pdf - Provide required inputs per the skill's parameter spec and get structured output
What is Extract Tables From Pdf?
Extract tables from PDF documents using MinerU's table detection engine. Identifies and extracts structured table data from both native and scanned PDFs. Fea... It is an AI Agent Skill for Claude Code / OpenClaw, with 253 downloads so far.
How do I install Extract Tables From Pdf?
Run "/install extract-tables-from-pdf" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Extract Tables From Pdf free?
Yes, Extract Tables From Pdf is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Extract Tables From Pdf support?
Extract Tables From Pdf is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Extract Tables From Pdf?
It is built and maintained by mzlzyCA (@mzlzyca); the current version is v0.4.0.