← Back to Skills Marketplace
718
Downloads
0
Stars
12
Active Installs
3
Versions
Install in OpenClaw
/install pdf-toolkit
Description
Run a local script to work with PDF files, DOCX documents, OCR, and text-to-speech. Use the read tool to load this SKILL.md, then exec the uv run command ins...
Usage Guidance
This skill appears to do what it says: local PDF/DOCX processing, OCR, and text→speech. Before installing or running it: 1) Be aware it operates on any filesystem path you provide — do not pass paths to secrets or system files you don't want read or written. 2) TTS uses edge-tts (network): avoid sending sensitive text to TTS because it will leave the host. 3) The tool uses 'uv run' to fetch Python packages at runtime (packages are listed but not version-pinned); if you require stricter supply-chain controls, run it in an isolated environment or pin/verify dependency versions. 4) Optional binaries (ffmpeg, tesseract, pdfimages, pandoc, libreoffice) are only needed for their respective features — install them only if you use those features. 5) If you want maximum safety, review the included source files locally and run the skill in a sandboxed container before granting it access to important host paths.
Capability Analysis
Type: OpenClaw Skill
Name: pdf-toolkit
Version: 0.0.3
The pdf-toolkit bundle is a comprehensive utility for PDF and document manipulation, including OCR and text-to-speech capabilities. It uses standard Python libraries (pypdf, reportlab) and system binaries (tesseract, ffmpeg, pandoc) for its stated purposes. While it performs network requests for the TTS feature (via edge-tts) and has broad file system access, these behaviors are explicitly disclosed and warned about in SKILL.md. The code follows security best practices by using list-based arguments in subprocess calls to prevent shell injection.
Capability Assessment
Purpose & Capability
The name/description match the included code: the scripts provide PDF metadata, extraction, merging, splitting, OCR, DOCX read/write, conversion, and TTS. The only required binary declared in the registry is 'uv', which matches the SKILL.md and the included wrapper (pdf-toolkit.sh) that executes 'uv run'. Optional system binaries (ffmpeg, tesseract, pdfimages, pandoc, libreoffice) are documented in SKILL.md and used only when those features are invoked.
Instruction Scope
SKILL.md instructs the agent to run 'uv run' on the included main.py and documents which system tools are optional. The skill explicitly states it will read/write any host path the caller supplies; the code resolves and opens absolute paths, so if a user provides an arbitrary system path the skill will read/write it. The only external network behavior described is edge-tts for TTS. This broad filesystem access is coherent for a file-manipulation tool but is a sensitive capability the user should be aware of.
Install Mechanism
There is no formal install spec; this is run via 'uv run', and main.py includes a header listing Python dependencies. That means uv will fetch and install (or use) packages from the Python ecosystem at runtime. Dependencies are not pinned to specific versions in the header. Runtime installation from PyPI is expected for this skill's Python libraries, but it increases the attack surface compared to a purely local, pre-vetted bundle.
Credentials
The skill requests no environment variables or credentials (primaryEnv none). That's proportionate. However, it will send TTS text to an external service (edge-tts) if TTS is used — SKILL.md warns about this. The code will access any filesystem path supplied by the caller (including ~ and absolute paths), which is necessary for the tool but can expose sensitive files if the user supplies them.
Persistence & Privilege
always:false and user-invocable:true; the skill does not request elevated/system-wide persistence and does not modify other skills or agent-wide configuration. It runs on demand and does not require automatic always-on privileges.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install pdf-toolkit - After installation, invoke the skill by name or use
/pdf-toolkit - Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.0.3
### v0.0.3 Changelog
- Added "Chat Delivery" section to SKILL.md with guidelines for sending generated files as chat attachments and cleaning up temporary files.
- Clarified file delivery and deletion behavior for output files in chat interfaces.
- No functional code or command changes; documentation update only.
v0.0.2
- Added a new "File Access And Network Behavior" section to document file handling and network usage.
- Clarified that the skill can read from and write to any file path provided by the caller, not just workspace paths.
- Documented that the `tts` command uses `edge-tts` and sends text over the network; warned users not to send sensitive/private text.
- Explained that all other commands run locally, except for required optional binaries.
- No changes to supported commands or core functionality.
v0.0.1
Initial release of pdf-toolkit.
- Provides commands to extract text, tables, and images from PDFs.
- Supports merging, splitting, rotating, and creating PDFs.
- Includes DOCX reading and writing capabilities.
- Adds OCR support for scanned PDFs (requires tesseract).
- Offers text-to-speech output to MP3 (requires ffmpeg).
- Enables document format conversion (requires pandoc or libreoffice).
- `doctor` command checks availability of optional system tools.
Metadata
Frequently Asked Questions
What is Pdf Toolkit?
Run a local script to work with PDF files, DOCX documents, OCR, and text-to-speech. Use the read tool to load this SKILL.md, then exec the uv run command ins... It is an AI Agent Skill for Claude Code / OpenClaw, with 718 downloads so far.
How do I install Pdf Toolkit?
Run "/install pdf-toolkit" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Pdf Toolkit free?
Yes, Pdf Toolkit is completely free (open-source). You can download, install and use it at no cost.
Which platforms does Pdf Toolkit support?
Pdf Toolkit is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Pdf Toolkit?
It is built and maintained by youpele52 (@youpele52); the current version is v0.0.3.
More Skills