Description

Use this skill whenever the user wants to do anything with PDF files. Triggers include: reading or extracting text/tables from PDFs, combining or merging mul...

README (SKILL.md)

PDF Skill

Name: docs-pdf
Author: echodjx

Complete guide for PDF operations using Python libraries and CLI tools.

⚡ Feature Cheat Sheet

One-line lookup for every supported operation — find the right tool instantly.

What you want to do	Command / Script	One-liner example
📖 Extract text	`scripts/extract_text.py`	`python scripts/extract_text.py doc.pdf`
📊 Extract tables → Excel	`scripts/extract_tables.py`	`python scripts/extract_tables.py report.pdf -o tables.xlsx`
🔗 Merge PDFs	`scripts/merge_pdfs.py`	`python scripts/merge_pdfs.py "*.pdf" -o merged.pdf`
✂️ Split PDF	`scripts/split_pdf.py`	`python scripts/split_pdf.py big.pdf --each`
🔄 Rotate pages	`scripts/batch_convert.py rotate`	`python scripts/batch_convert.py rotate input.pdf -d 90`
🔀 Reorder pages	`scripts/reorder_pdf.py`	`python scripts/reorder_pdf.py input.pdf --order "3,1,2,4-" -o reordered.pdf`
💧 Add text watermark	`scripts/watermark.py`	`python scripts/watermark.py doc.pdf -t "CONFIDENTIAL"`
🖼️ Add image watermark	`scripts/watermark.py`	`python scripts/watermark.py doc.pdf --image logo.png --alpha 0.3`
🔒 Encrypt PDF	pypdf (inline)	see Password Protect below
📝 Fill PDF form	`scripts/fill_pdf_form.py`	`python scripts/fill_pdf_form.py form.pdf -o filled.pdf --set name="Alice"`
🔍 Check form fields	`scripts/check_fillable_fields.py`	`python scripts/check_fillable_fields.py form.pdf`
🖼️ OCR scanned PDF	`scripts/ocr_pdf.py`	`python scripts/ocr_pdf.py scan.pdf --lang eng`
📄 Create PDF from scratch	reportlab (inline)	see references/create.md
📦 Batch operations	`scripts/batch_convert.py`	`python scripts/batch_convert.py merge --help`
📏 Compress / optimize	`scripts/compress_pdf.py`	`python scripts/compress_pdf.py input.pdf -o output.pdf --quality medium`
ℹ️ View PDF info	`scripts/pdf_info.py`	`python scripts/pdf_info.py input.pdf`
🖼️→📄 Images to PDF	`scripts/images_to_pdf.py`	`python scripts/images_to_pdf.py "photos/*.jpg" -o album.pdf --page-size A4`
📄→🖼️ PDF to images	`scripts/pdf_to_images.py`	`python scripts/pdf_to_images.py input.pdf -o pages/ --format png --dpi 200`
🔎 Compare two PDFs	`scripts/compare_pdf.py`	`python scripts/compare_pdf.py old.pdf new.pdf -o diff_report.html`
🔧 Repair corrupted PDF	`scripts/repair_pdf.py`	`python scripts/repair_pdf.py broken.pdf -o fixed.pdf`
🔤 List fonts	`scripts/list_fonts.py`	`python scripts/list_fonts.py input.pdf`

💡 Run any script with --help to see all available options.

Quick Decision Guide

What do you need?
├── Create a new PDF from scratch     → reportlab  (see references/create.md)
├── Extract text / tables             → pdfplumber  (see references/extract.md)
├── Merge / split / rotate pages      → pypdf or qpdf CLI
├── Reorder pages                     → scripts/reorder_pdf.py
├── Add watermark / encrypt / protect → pypdf
├── Fill out a PDF form               → pdf-lib (JS) or pypdf  (see FORMS.md)
├── Extract images from PDF           → pdfimages CLI or pypdf
├── OCR a scanned PDF                 → pdf2image + pytesseract
├── Compress / reduce file size       → scripts/compress_pdf.py (qpdf + pypdf)
├── View PDF info / metadata          → scripts/pdf_info.py
├── Convert images → PDF              → scripts/images_to_pdf.py (reportlab)
├── Convert PDF → images              → scripts/pdf_to_images.py (pdf2image)
├── Compare / diff two PDFs           → scripts/compare_pdf.py
├── Repair a corrupted PDF            → scripts/repair_pdf.py (qpdf + pypdf)
└── List fonts in a PDF               → scripts/list_fonts.py

Installation

Linux (Ubuntu/Debian)

# Python libraries
pip install pypdf pdfplumber reportlab pdf2image pytesseract Pillow --break-system-packages

# System tools
sudo apt-get install -y poppler-utils tesseract-ocr qpdf

# For Chinese OCR
sudo apt-get install -y tesseract-ocr-chi-sim tesseract-ocr-chi-tra

# Node.js (form filling)
npm install pdf-lib

macOS (Homebrew)

# System tools (required for OCR and CLI operations)
brew install qpdf poppler tesseract

# IMPORTANT: Language packs must be installed separately for non-English OCR
brew install tesseract-lang

# Python libraries
pip install pypdf pdfplumber reportlab pdf2image pytesseract Pillow --break-system-packages

# Node.js (form filling)
npm install pdf-lib

⚠️ macOS 注意: tesseract-lang 必须单独安装，否则中文/日文等非英文 OCR 会失败。安装后运行 tesseract --list-langs 确认可用语言。

Verify Installation

# Check Python libraries
python3 -c "import pypdf, pdfplumber, reportlab, PIL; print('✓ Python libs OK')"

# Check system tools
which qpdf       && echo "✓ qpdf OK"       || echo "✗ qpdf not installed"
which tesseract   && echo "✓ tesseract OK"  || echo "✗ tesseract not installed"
which pdftotext   && echo "✓ poppler OK"    || echo "✗ poppler not installed"

# Check OCR languages
tesseract --list-langs 2>/dev/null | head -5

Core Operations

Read & Extract Text

import pdfplumber

with pdfplumber.open("document.pdf") as pdf:
    for page in pdf.pages:
        print(page.extract_text())

→ For advanced extraction options, see references/extract.md

Extract Tables → DataFrame

import pdfplumber, pandas as pd

with pdfplumber.open("report.pdf") as pdf:
    for page in pdf.pages:
        for table in page.extract_tables():
            df = pd.DataFrame(table[1:], columns=table[0])
            print(df)

Merge PDFs

from pypdf import PdfWriter, PdfReader

writer = PdfWriter()
for path in ["a.pdf", "b.pdf", "c.pdf"]:
    writer.append(PdfReader(path))
with open("merged.pdf", "wb") as f:
    writer.write(f)

Split PDF

from pypdf import PdfReader, PdfWriter

reader = PdfReader("input.pdf")
for i, page in enumerate(reader.pages):
    w = PdfWriter()
    w.add_page(page)
    with open(f"page_{i+1}.pdf", "wb") as f:
        w.write(f)

Rotate Pages

reader = PdfReader("scan.pdf")
writer = PdfWriter()
for page in reader.pages:
    page.rotate(90)   # 90 / 180 / 270
    writer.add_page(page)
with open("rotated.pdf", "wb") as f:
    writer.write(f)

Password Protect

from pypdf import PdfReader, PdfWriter

reader = PdfReader("doc.pdf")
writer = PdfWriter()
for page in reader.pages:
    writer.add_page(page)
writer.encrypt("user_pass", "owner_pass", use_128bit=False)  # AES-256
with open("encrypted.pdf", "wb") as f:
    writer.write(f)

CLI Quick Reference (qpdf)

# Merge
qpdf --empty --pages a.pdf b.pdf -- merged.pdf

# Extract pages 1-5
qpdf input.pdf --pages . 1-5 -- out.pdf

# Rotate all pages 90°
qpdf input.pdf output.pdf --rotate=+90

# Remove password
qpdf --password=secret --decrypt locked.pdf unlocked.pdf

# Linearize (web-optimized)
qpdf --linearize input.pdf output.pdf

Available Scripts

Use these scripts directly — no need to rewrite from scratch:

Script	Purpose
`scripts/extract_text.py`	Extract all text, page by page, to .txt
`scripts/extract_tables.py`	Extract all tables to .xlsx
`scripts/merge_pdfs.py`	Merge multiple PDFs from a glob pattern
`scripts/split_pdf.py`	Split by page ranges
`scripts/reorder_pdf.py`	Reorder pages (flexible syntax: "3,1,2,4-")
`scripts/watermark.py`	Add text or image watermark
`scripts/ocr_pdf.py`	Full OCR pipeline for scanned PDFs
`scripts/batch_convert.py`	Batch operations (merge/split/rotate) CLI
`scripts/check_fillable_fields.py`	List all form fields in a PDF
`scripts/fill_pdf_form.py`	Fill AcroForm fields programmatically
`scripts/create_test_form.py`	Generate a sample fillable PDF form for testing
`scripts/compress_pdf.py`	Compress / optimize PDF to reduce file size
`scripts/pdf_info.py`	View PDF metadata, page count, encryption, fonts
`scripts/images_to_pdf.py`	Convert images (JPG/PNG/etc.) to PDF
`scripts/pdf_to_images.py`	Convert PDF pages to PNG/JPEG images
`scripts/compare_pdf.py`	Compare two PDFs and generate diff report
`scripts/repair_pdf.py`	Attempt to repair corrupted PDF files
`scripts/list_fonts.py`	List all fonts used in a PDF

Run any script with --help to see its options.

Reference Files

Load these when you need deeper guidance:

references/create.md — Building PDFs from scratch with reportlab (Platypus, Canvas, styles, tables, headers/footers)
references/extract.md — Advanced text/table/image extraction, coordinate-based cropping, word-level data
references/security.md — Watermarks, encryption, permissions, digital signatures
references/ocr.md — OCR pipeline, language packs, image preprocessing, quality tuning
FORMS.md — Complete guide to PDF form filling (AcroForm + XFA, pdf-lib JS)

Quick Reference Table

Task	Best Tool	Key Method
Extract text	pdfplumber	`page.extract_text()`
Extract tables	pdfplumber	`page.extract_tables()`
Merge PDFs	pypdf	`writer.append()`
Split PDFs	pypdf	one page per writer
Rotate pages	pypdf	`page.rotate(90)`
Reorder pages	pypdf	`writer.add_page(reader.pages[i])`
Create PDF	reportlab	Platypus or Canvas
Watermark	pypdf + reportlab	`page.merge_page()`
Encrypt	pypdf	`writer.encrypt()`
Fill form	pypdf / pdf-lib	see FORMS.md
OCR scanned	pytesseract	see references/ocr.md
Compress PDF	qpdf + pypdf	`compress_identical_objects()`
View PDF info	pypdf	`PdfReader` metadata + fields
Images → PDF	reportlab	`canvas.drawImage()`
PDF → images	pdf2image	`convert_from_path()`
Compare PDFs	pdfplumber + difflib	text diff per page
Repair PDF	qpdf / pypdf	`qpdf --linearize` or re-write
List fonts	pypdf	page `/Resources` → `/Font`
CLI merge	qpdf	`--empty --pages`
Extract images	pypdf / pdfimages	`page.images`

Common Pitfalls

Never use Unicode subscripts/superscripts (₂, ⁰) in reportlab — use \x3Csub> / \x3Csuper> XML tags instead, or they render as black boxes
pdfplumber, not pypdf, for text extraction — pypdf's extract_text() loses layout; pdfplumber is layout-aware
Encrypted PDFs: pass password= to PdfReader() and pdfplumber.open()
pip in sandbox: always add --break-system-packages flag
qpdf for speed: for large batch jobs, prefer qpdf CLI over Python loops
macOS OCR 语言包: brew install tesseract 仅含英文；非英文 OCR 需额外执行 brew install tesseract-lang
macOS 系统依赖: OCR 和 CLI 操作需先安装 brew install qpdf poppler tesseract
测试表单填充: 没有可填写 PDF 时，先运行 python scripts/create_test_form.py 生成测试表单
OCR vs pdfplumber: OCR 只适用于扫描件（图片型 PDF）。对原生文本 PDF 提取内容，应使用 pdfplumber（更快更准）
中文表单填充: pypdf 内置字体不支持 CJK 字符，中文值可能显示为方块。需要中文表单填充时，使用 pdf-lib (JS) 方案（见 FORMS.md）
旋转页面: 没有独立 rotate 脚本，使用 python scripts/batch_convert.py rotate input.pdf -d 90

⛔ Limitations (Not Suitable For)

场景	原因	替代方案
复杂排版 PDF（杂志、海报）	提取会丢失格式布局	使用专业排版工具
扫描件中的表格提取	OCR 表格精度有限	使用专业表格识别工具如 Camelot
CJK 字符的表单填充	pypdf 内置字体不含 CJK	使用 pdf-lib (JS)，见 FORMS.md
超大 PDF (>500MB)	内存可能不足	用 qpdf CLI 或分批处理

Usage Guidance

This skill appears to be a genuine PDF toolkit. Before installing or running it: (1) review the included scripts (they run locally and will read/write files) and only run them on files you trust; (2) install the recommended system packages from official package managers (apt, brew) and Python/Node packages from recognized registries; (3) be aware the skill's trigger rules are broad — the agent may suggest using it whenever 'PDF' is mentioned; and (4) because the skill runs code from an unknown source (no homepage provided), consider running it in a restricted environment (container or VM) if you need stronger isolation or if PDFs contain sensitive data.

Capability Analysis

Type: OpenClaw Skill Name: docs-pdf Version: 1.0.3 The docs-pdf skill bundle is a comprehensive and well-documented set of utilities for PDF manipulation, including text/table extraction, merging, splitting, OCR, and form filling. It utilizes standard, reputable libraries such as pypdf, pdfplumber, reportlab, and pytesseract, along with system tools like qpdf. The scripts follow secure coding practices, such as using argument lists in subprocess calls to prevent shell injection, and the instructions in SKILL.md are strictly focused on PDF processing without any signs of prompt injection or malicious intent.

Capability Assessment

✓ Purpose & Capability

The name/description (PDF operations) matches the included scripts and SKILL.md: all scripts perform PDF tasks (extracting text/tables, OCR, merge/split, watermarking, etc.). The skill does not request unrelated credentials or config paths. The only mild mismatch is that the registry metadata lists no required binaries while the README/installation instructions document several external system tools (qpdf, poppler, tesseract) and Python/Node packages — but those are legitimate dependencies for the stated PDF operations.

ℹ Instruction Scope

The SKILL.md and scripts are focused on local file operations and standard CLI/Python libraries. They instruct installing system packages and running included scripts against user-provided PDFs. The trigger guidance is broad ('mention "PDF" in any context'), which may cause the agent to invoke the skill frequently; otherwise the runtime instructions do not read unrelated system files, environment variables, or contact external endpoints.

✓ Install Mechanism

There is no automated install spec in the registry (no download/extract), and the SKILL.md suggests installing common packages via apt/homebrew/pip/npm. All recommended installs come from standard package managers (apt, brew, pip, npm) — no obscure or remote download URLs are used. The skill includes multiple Python scripts that will run locally when invoked.

✓ Credentials

The skill declares no required environment variables or primary credential, and the scripts do not attempt to read secrets or other environment variables. The only environment impact is the need to install/availability of system tools (qpdf, poppler, tesseract) and Python/Node packages, which are proportional to the functionality.

✓ Persistence & Privilege

The skill is not force-included (always:false) and does not request persistent system-wide privileges. It does contain executable scripts that, when run by the agent, will operate on local files — which is expected for a file-manipulation skill. There is no indication the skill modifies other skills or global agent settings.

Version History

v1.0.3

**Expanded PDF utilities and tools.** - Added scripts for compressing, inspecting, repairing, and reordering PDFs, as well as font listing and PDF comparison. - New conversion tools: images-to-PDF and PDF-to-images scripts included. - README and skill trigger description updated to reflect new operations: compress/optimize, info/metadata, compare, repair, convert, reorder, and font listing. - Installation instructions updated (now requires Pillow for image conversions). - Quick decision guide and feature table extended to cover all new tools.

v1.0.2

docs-pdf v1.0.2 - Added support for image watermarks with transparency (`--image`, `--alpha`) in `scripts/watermark.py`. - Updated documentation to include image watermarking and clarify usage. - Added installation verification commands to SKILL.md for easier troubleshooting. - Minor usability improvements and clarifications in scripts and documentation.

v1.0.1

docs-pdf 1.0.1 - Added a sample fillable PDF form script: scripts/create_test_form.py, making it easier to generate test PDF forms. - New .codebuddy/memory/2026-03-12.md file added. - SKILL.md updated with a "Feature Cheat Sheet" table and clearer platform-specific (Linux/macOS) install instructions. - The "Available Scripts" section now lists the new test form generator. - Various documentation improvements and formatting tweaks for quick reference.

v1.0.0

Initial release of the docs-pdf skill: a complete reference toolkit for handling PDF files via Python and CLI. - Supports text/table extraction, merging, splitting, rotation, watermarking, encryption, form filling, image extraction, and OCR. - Quick decision guide and reference tables to choose the right library or CLI tool for any PDF task. - Step-by-step Python code snippets and CLI commands for common PDF operations. - Ready-to-use scripts for batch processing, extraction, splitting, watermarking, OCR, and form handling. - Installation instructions for all necessary Python packages and system dependencies. - Troubleshooting advice for common pitfalls in PDF manipulation.

Metadata

Slug docs-pdf

Version 1.0.3

License MIT-0

All-time Installs 2

Active Installs 2

Total Versions 4

Frequently Asked Questions

What is docs-pdf?

Use this skill whenever the user wants to do anything with PDF files. Triggers include: reading or extracting text/tables from PDFs, combining or merging mul... It is an AI Agent Skill for Claude Code / OpenClaw, with 531 downloads so far.

How do I install docs-pdf?

Run "/install docs-pdf" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is docs-pdf free?

Yes, docs-pdf is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does docs-pdf support?

docs-pdf is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created docs-pdf?

It is built and maintained by echodjx (@echodjx); the current version is v1.0.3.

More Skills

docs-pdf