/install pdf-reader
PDF Reader
Extract text from any PDF — text-layer or scanned image.
How It Works
PDF received
├─ Has text layer? ──→ pdftotext (fast, high quality)
│ └─ Text too sparse? ──→ Fall back to OCR
└─ Detected as scan? ──→ Skip text, go straight to OCR
pdftoppm → tesseract
Quick Start
Run the bundled script via exec:
bash \x3Cskill-dir>/scripts/pdf-extract.sh /path/to/file.pdf
Save to file:
bash \x3Cskill-dir>/scripts/pdf-extract.sh /path/to/file.pdf --output /tmp/result.txt
Then read /tmp/result.txt with the read tool.
When This Skill Triggers
- User sends a PDF in chat but no
\x3Cfile>text content was injected (only file path visible) - Injected content is empty, garbled, or truncated
- User explicitly asks to read/extract/OCR a PDF file
- A PDF on disk needs text extraction for downstream processing
Typical Workflow
- Identify the PDF file path (usually
/root/.openclaw/media/inbound/...) - Run the extraction script
- Read the output and respond to the user
Example:
# Extract and save
bash \x3Cskill-dir>/scripts/pdf-extract.sh "/root/.openclaw/media/inbound/document.pdf" -o /tmp/pdf-text.txt
# Then use read tool on /tmp/pdf-text.txt
Script Options
| Flag | Description | Default |
|---|---|---|
--lang |
Tesseract languages (validated against allowlist) | chi_sim+eng |
--dpi |
Image resolution for OCR | 300 |
--output / -o |
Save to file instead of stdout | stdout |
--ocr-only |
Force OCR, skip text extraction | off |
--text-only |
Text extraction only, no OCR fallback | off |
--auto-install |
Auto-install missing tools (poppler, tesseract) | off |
Dependencies
By default, the script does not install packages automatically. If tools are missing, it prints install instructions and exits.
To enable auto-install, pass --auto-install:
bash \x3Cskill-dir>/scripts/pdf-extract.sh file.pdf --auto-install
This installs poppler-utils and tesseract-ocr via apt-get, yum, or brew as needed.
Pre-install recommended (run once on the server):
apt-get install -y poppler-utils tesseract-ocr tesseract-ocr-chi-sim
Language Support
Default: Chinese Simplified + English (chi_sim+eng).
The --lang parameter is validated against a strict allowlist of official tesseract language codes. Invalid or malformed values are rejected.
Other languages:
# Japanese + English
bash \x3Cskill-dir>/scripts/pdf-extract.sh file.pdf --lang jpn+eng
# Korean
bash \x3Cskill-dir>/scripts/pdf-extract.sh file.pdf --lang kor
Tesseract language packs are auto-installed based on --lang.
Limitations
- OCR quality depends on scan quality; low-resolution or handwritten PDFs may produce errors
- Encrypted/password-protected PDFs are not supported
- Large PDFs (50+ pages) may take 1-2 minutes for OCR
- Pure-image pages (photos, diagrams without text) produce noise — this is expected
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install pdf-reader - 安装完成后,直接呼叫该 Skill 的名称或使用
/pdf-reader触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
PDF Reader 是什么?
Extract text from PDF files with automatic OCR fallback for scanned/image-based PDFs. Use when: (1) a user sends a PDF file and the framework did not auto-in... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 126 次。
如何安装 PDF Reader?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install pdf-reader」即可一键安装,无需额外配置。
PDF Reader 是免费的吗?
是的,PDF Reader 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
PDF Reader 支持哪些平台?
PDF Reader 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 PDF Reader?
由 Peter Pan(@panpeter2024)开发并维护,当前版本 v1.1.0。