/install pdf-filler
pdf-filler
Operate on PDF AcroForms: list every field with its type and current value, then fill the PDF with values supplied as JSON. The skill calls a small Python package (oc-pdf-filler) that wraps a fallback chain of PDF libraries, so a single recalcitrant PDF doesn't block the workflow.
Workspace rules (read this first)
Many agent hosts run inside a sandbox that only allows reads/writes inside a specific workspace folder. Files written outside that folder show up as "Unavailable / Outside allowed folders" and the user can't download them.
The CLI enforces this for you:
- The "workspace" is resolved from the first set environment variable in this list,
falling back to the current working directory:
OC_PDF_FILLER_WORKSPACE,OPENCLAW_WORKSPACE,CLAWHUB_WORKSPACE,AGENT_WORKSPACE,SKILL_WORKSPACE,WORKSPACE. - All output paths (schema JSON and filled PDF) are resolved relative to the workspace.
- If you pass an absolute
--outputthat points outside the workspace, the CLI rewrites it to the same basename inside the workspace (and prints a warning to stderr). - You can override the workspace explicitly with
--workspace DIR.
In practice: pass relative paths (e.g. -o form_done.pdf), or omit --output
entirely. The default is \x3Cinput-stem>_done.pdf inside the workspace.
When to use
Trigger this skill when the user:
- Asks to inspect, list, or extract the form fields of a PDF
- Wants to fill out / populate / complete a PDF form programmatically
- Mentions AcroForm, checkbox, radio button, or dropdown handling in a PDF
- Has a batch of PDFs to fill from structured data (JSON)
Setup (once per workspace)
The skill scripts call the oc-pdf-filler Python package. Install it first:
pip install "oc-pdf-filler[all]"
# or, if working from the source repo: pip install -e ".[all]"
The [all] extra pulls in pdfrw and PyMuPDF for the full fallback chain. Install pdftk from your package manager for the last-resort backend (optional, but useful for stubborn PDFs).
Verify which backends are active:
python scripts/list_backends.py
Step 1: Extract the field schema
Always extract first so you know the exact field names and types before constructing the JSON values file. Use a workspace-relative path for the output (the CLI confines it to the workspace automatically).
python scripts/extract.py /path/to/form.pdf --output schema.json --include-values
Each entry in the resulting JSON has:
name: the AcroForm field name (use this verbatim as the key when filling)type: one oftext,checkbox,radio,choice,signature,pushbutton,unknownoptions: for radios and checkboxes, the accepted export values; for choices, the dropdown optionsvalue: current value if the form is partially filled (only when--include-valuesis set)max_length,multiline,required,read_only: hints for validation
See references/FIELD_TYPES.md for the value contract per field type.
Step 2: Build a values JSON file
The fill input is a flat JSON object { "FieldName": value }. Example:
{
"Name Verantwortlicher": "ACME GmbH",
"Postleitzahl Verantwortlicher": "10115",
"Beschäftigte": true,
"Verarbeitungstyp": "Automatisiert"
}
A starter template is included at assets/values.example.json.
Critical: include every checkbox and radio explicitly
LLMs tend to omit fields they're unsure about, which silently leaves checkboxes unchecked in the output PDF. Don't do that. For every field in the schema:
- Checkbox (
type: checkbox): settrueorfalse. If the user didn't mention it, default tofalserather than omitting the key. - Radio (
type: radio): set the exact export string fromoptions. If the user didn't pick one, leave it out only if it's truly optional; otherwise ask the user or pick the most plausible value. - Text / choice / signature: omit only if the field is genuinely blank.
If you are unsure for a checkbox, choose false, not omission. The CLI's unset_checkboxes and unset_radios summary fields tell you which fields were left out so you can self-correct on the next pass. As safety nets you can pass:
--default-unset-checkboxes offto force every untouched checkbox to false in one go.--default-unset-radios firstto pick the first available option for every untouched radio group.
Step 3: Fill the PDF
Omit --output to get the recommended default \x3Cinput-stem>_done.pdf inside the workspace, or pass a workspace-relative filename. Absolute paths outside the workspace are automatically rewritten into it (the host's sandbox would reject them otherwise).
# preferred: keep the original name with _done suffix, in the workspace
python scripts/fill.py /path/to/form.pdf values.json
# or pass an explicit relative path
python scripts/fill.py /path/to/form.pdf values.json --output form_done.pdf
By default the orchestrator uses --backend auto, walking the chain pypdf -> pdfrw -> PyMuPDF -> pdftk and stopping at the first backend that fills every field.
Useful flags:
--backend pymupdf-- force a specific backend (e.g. when the auto winner produces a PDF that doesn't render correctly in your viewer)--best-effort-- chain backends so partial fills accumulate (use when no single backend handles every field)--flatten-- bake values into the PDF so they can't be edited (best support: PyMuPDF, pdftk)--strict-- exit non-zero if any requested field is missing or unfillable
The script prints a JSON summary including winning_backend, workspace, output_path (absolute path of the resulting PDF, always inside the workspace), filled, missing, failed, unset_checkboxes, unset_radios, and per-attempt details. If filling fails, see references/BACKENDS.md for backend-specific troubleshooting tips.
Delivering the result to the user
Send the file back automatically as soon as the fill succeeds. Do not wait for the user to ask. Users on chat platforms (Telegram, Slack, Discord, Teams, the ClawHub web client, etc.) expect the finished PDF to arrive in the conversation as an attachment immediately after you announce success.
Procedure:
- Read
output_pathfrom the fill summary. It is guaranteed to be inside the workspace, so the host can attach it. - Use whatever attachment / file-return mechanism the host provides — Telegram bot
sendDocument, Slackfiles.upload, the agent runtime's ownattach_file/send_filetool, or simply emit it as a workspace artifact. Do not ask the user "do you want me to send it?"; just send it. - In the same turn, briefly confirm what was sent (filename + filled-field count). The user shouldn't have to ask twice.
- If you genuinely cannot find an attachment channel, surface the workspace-relative path so the user can fetch it manually — but treat that as a last resort.
Never write the PDF outside the workspace (e.g. /tmp, /var, your home directory). Sandboxed hosts will mark it as "Unavailable / Outside allowed folders".
End-to-end example
python scripts/extract.py form.pdf -o schema.json
# ... agent inspects schema.json, builds values.json based on user input ...
python scripts/fill.py form.pdf values.json
# writes \x3Cworkspace>/form_done.pdf
After filling, re-run extract.py --include-values form_done.pdf and confirm the values stuck before delivering the PDF to the user.
Notes and edge cases
- Field names may contain spaces, German umlauts, or punctuation. Always copy them verbatim from
extract.pyoutput. - For radio groups, set the value to the export name of the chosen option (one of the strings in
options), not a boolean. - Signature fields (
type: signature) are reported but not auto-filled. - Encrypted PDFs are out of scope; the tool will surface the underlying library error.
- Some PDF viewers cache appearance streams; if a viewer shows blank fields after filling, try opening with a different viewer or use
--flatten.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install pdf-filler - 安装完成后,直接呼叫该 Skill 的名称或使用
/pdf-filler触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Pdf Filler 是什么?
Extract and fill PDF AcroForm fields with a multi-backend fallback chain. Reads field schemas (text inputs, checkboxes, radio buttons, dropdowns, multi-line... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 55 次。
如何安装 Pdf Filler?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install pdf-filler」即可一键安装,无需额外配置。
Pdf Filler 是免费的吗?
是的,Pdf Filler 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Pdf Filler 支持哪些平台?
Pdf Filler 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Pdf Filler?
由 qubit999(@qubit999)开发并维护,当前版本 v0.1.5。