← Back to Skills Marketplace
trshdesigns

Pdf Parser Agent

by Ezequiel Techera · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ✓ Security Clean
158
Downloads
0
Stars
1
Active Installs
2
Versions
Install in OpenClaw
/install pdf-parser-agent
Description
Parses local PDF files into structured Markdown and JSON using opendataloader-pdf for deterministic, local document content extraction.
Usage Guidance
This skill appears to do what it says: convert local PDFs using the opendataloader-pdf package. Before installing or running it: - Inspect and vet the external dependency (opendataloader-pdf) you will pip-install; that package will execute on your machine and is the primary runtime risk. Prefer to install it in a fresh virtualenv rather than system-wide. - Note the script adds your user-site-packages to sys.path, so whatever is installed there will be imported. If you share an environment, ensure no untrusted packages are present in user-site. - The script validates that input files are inside the current working directory; still run it in a controlled workspace to avoid accidental processing of sensitive files. - The included package.json is unexpected for a Python-only skill but appears harmless; it may be leftover metadata. - If you need stronger isolation, run this tool in a container or VM and audit opendataloader-pdf's behavior (it may spawn Java or other subprocesses according to the tests/notes).
Capability Analysis
Type: OpenClaw Skill Name: pdf-parser-agent Version: 1.0.1 The skill is a straightforward utility for parsing local PDF files into Markdown or JSON using the 'opendataloader-pdf' library. It includes proactive security measures such as path traversal validation in 'scripts/pdf_extractor.py' to ensure file access remains within the workspace, and the code logic is well-documented and aligned with its stated purpose.
Capability Assessment
Purpose & Capability
Name and description match the included script and SKILL.md: the skill runs a local Python-based converter (opendataloader-pdf) on local PDF files. The only minor oddity is an included package.json (Node metadata) despite this being a Python script; this appears cosmetic and does not contradict the stated purpose.
Instruction Scope
SKILL.md instructs the agent to run a local script against local PDFs and references a single dependency (opendataloader-pdf). The runtime instructions do not request unrelated files, environment variables, or external endpoints. The script does append the user's site-packages path to sys.path to locate a --user pip installation, which is reasonable for a dependency lookup but means it will import whatever opendataloader-pdf is installed in the user's site.
Install Mechanism
No install spec is provided by the skill (instruction-only), so nothing is downloaded or written by the skill itself. Dependency installation is left to the user (pip install --user opendataloader-pdf). This is low-risk for the skill bundle, though the external Python package remains a separate trust decision.
Credentials
The skill declares no environment variables, credentials, or config paths and its code does not read secrets. It only reads a user-supplied local file path (validated to be inside the current workspace) and writes output to a specified directory — which is proportionate to the stated function.
Persistence & Privilege
The skill does not request persistent or elevated presence (always:false). It does not modify other skills or system-wide agent settings. Autonomous invocation is allowed by default but is not combined with any broad credential access or unusual privileges.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install pdf-parser-agent
  3. After installation, invoke the skill by name or use /pdf-parser-agent
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
- Added a "Core Technology Attribution" section, crediting **bundolee** and **claude** as original authors of `opendataloader-pdf`. - No changes to functionality or usage; documentation update only.
v1.0.0
Initial release of pdf-parser-agent skill: - Parses local PDF files into structured Markdown and JSON formats using opendataloader-pdf. - Operates locally for deterministic extraction, avoiding LLM context limitations. - Dynamically locates user-installed dependencies for improved portability. - Includes example usage and implementation notes in documentation.
Metadata
Slug pdf-parser-agent
Version 1.0.1
License MIT-0
All-time Installs 1
Active Installs 1
Total Versions 2
Frequently Asked Questions

What is Pdf Parser Agent?

Parses local PDF files into structured Markdown and JSON using opendataloader-pdf for deterministic, local document content extraction. It is an AI Agent Skill for Claude Code / OpenClaw, with 158 downloads so far.

How do I install Pdf Parser Agent?

Run "/install pdf-parser-agent" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Pdf Parser Agent free?

Yes, Pdf Parser Agent is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Pdf Parser Agent support?

Pdf Parser Agent is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Pdf Parser Agent?

It is built and maintained by Ezequiel Techera (@trshdesigns); the current version is v1.0.1.

💬 Comments