← Back to Skills Marketplace
Pdf Parser Agent
by
Ezequiel Techera
· GitHub ↗
· v1.0.1
· MIT-0
158
Downloads
0
Stars
1
Active Installs
2
Versions
Install in OpenClaw
/install pdf-parser-agent
Description
Parses local PDF files into structured Markdown and JSON using opendataloader-pdf for deterministic, local document content extraction.
Usage Guidance
This skill appears to do what it says: convert local PDFs using the opendataloader-pdf package. Before installing or running it:
- Inspect and vet the external dependency (opendataloader-pdf) you will pip-install; that package will execute on your machine and is the primary runtime risk. Prefer to install it in a fresh virtualenv rather than system-wide.
- Note the script adds your user-site-packages to sys.path, so whatever is installed there will be imported. If you share an environment, ensure no untrusted packages are present in user-site.
- The script validates that input files are inside the current working directory; still run it in a controlled workspace to avoid accidental processing of sensitive files.
- The included package.json is unexpected for a Python-only skill but appears harmless; it may be leftover metadata.
- If you need stronger isolation, run this tool in a container or VM and audit opendataloader-pdf's behavior (it may spawn Java or other subprocesses according to the tests/notes).
Capability Analysis
Type: OpenClaw Skill
Name: pdf-parser-agent
Version: 1.0.1
The skill is a straightforward utility for parsing local PDF files into Markdown or JSON using the 'opendataloader-pdf' library. It includes proactive security measures such as path traversal validation in 'scripts/pdf_extractor.py' to ensure file access remains within the workspace, and the code logic is well-documented and aligned with its stated purpose.
Capability Assessment
Purpose & Capability
Name and description match the included script and SKILL.md: the skill runs a local Python-based converter (opendataloader-pdf) on local PDF files. The only minor oddity is an included package.json (Node metadata) despite this being a Python script; this appears cosmetic and does not contradict the stated purpose.
Instruction Scope
SKILL.md instructs the agent to run a local script against local PDFs and references a single dependency (opendataloader-pdf). The runtime instructions do not request unrelated files, environment variables, or external endpoints. The script does append the user's site-packages path to sys.path to locate a --user pip installation, which is reasonable for a dependency lookup but means it will import whatever opendataloader-pdf is installed in the user's site.
Install Mechanism
No install spec is provided by the skill (instruction-only), so nothing is downloaded or written by the skill itself. Dependency installation is left to the user (pip install --user opendataloader-pdf). This is low-risk for the skill bundle, though the external Python package remains a separate trust decision.
Credentials
The skill declares no environment variables, credentials, or config paths and its code does not read secrets. It only reads a user-supplied local file path (validated to be inside the current workspace) and writes output to a specified directory — which is proportionate to the stated function.
Persistence & Privilege
The skill does not request persistent or elevated presence (always:false). It does not modify other skills or system-wide agent settings. Autonomous invocation is allowed by default but is not combined with any broad credential access or unusual privileges.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install pdf-parser-agent - After installation, invoke the skill by name or use
/pdf-parser-agent - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
- Added a "Core Technology Attribution" section, crediting **bundolee** and **claude** as original authors of `opendataloader-pdf`.
- No changes to functionality or usage; documentation update only.
v1.0.0
Initial release of pdf-parser-agent skill:
- Parses local PDF files into structured Markdown and JSON formats using opendataloader-pdf.
- Operates locally for deterministic extraction, avoiding LLM context limitations.
- Dynamically locates user-installed dependencies for improved portability.
- Includes example usage and implementation notes in documentation.
Metadata
Frequently Asked Questions
What is Pdf Parser Agent?
Parses local PDF files into structured Markdown and JSON using opendataloader-pdf for deterministic, local document content extraction. It is an AI Agent Skill for Claude Code / OpenClaw, with 158 downloads so far.
How do I install Pdf Parser Agent?
Run "/install pdf-parser-agent" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Pdf Parser Agent free?
Yes, Pdf Parser Agent is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Pdf Parser Agent support?
Pdf Parser Agent is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Pdf Parser Agent?
It is built and maintained by Ezequiel Techera (@trshdesigns); the current version is v1.0.1.
More Skills