← 返回 Skills 市场
trshdesigns

Pdf Parser Agent

作者 Ezequiel Techera · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ✓ 安全检测通过
158
总下载
0
收藏
1
当前安装
2
版本数
在 OpenClaw 中安装
/install pdf-parser-agent
功能描述
Parses local PDF files into structured Markdown and JSON using opendataloader-pdf for deterministic, local document content extraction.
安全使用建议
This skill appears to do what it says: convert local PDFs using the opendataloader-pdf package. Before installing or running it: - Inspect and vet the external dependency (opendataloader-pdf) you will pip-install; that package will execute on your machine and is the primary runtime risk. Prefer to install it in a fresh virtualenv rather than system-wide. - Note the script adds your user-site-packages to sys.path, so whatever is installed there will be imported. If you share an environment, ensure no untrusted packages are present in user-site. - The script validates that input files are inside the current working directory; still run it in a controlled workspace to avoid accidental processing of sensitive files. - The included package.json is unexpected for a Python-only skill but appears harmless; it may be leftover metadata. - If you need stronger isolation, run this tool in a container or VM and audit opendataloader-pdf's behavior (it may spawn Java or other subprocesses according to the tests/notes).
功能分析
Type: OpenClaw Skill Name: pdf-parser-agent Version: 1.0.1 The skill is a straightforward utility for parsing local PDF files into Markdown or JSON using the 'opendataloader-pdf' library. It includes proactive security measures such as path traversal validation in 'scripts/pdf_extractor.py' to ensure file access remains within the workspace, and the code logic is well-documented and aligned with its stated purpose.
能力评估
Purpose & Capability
Name and description match the included script and SKILL.md: the skill runs a local Python-based converter (opendataloader-pdf) on local PDF files. The only minor oddity is an included package.json (Node metadata) despite this being a Python script; this appears cosmetic and does not contradict the stated purpose.
Instruction Scope
SKILL.md instructs the agent to run a local script against local PDFs and references a single dependency (opendataloader-pdf). The runtime instructions do not request unrelated files, environment variables, or external endpoints. The script does append the user's site-packages path to sys.path to locate a --user pip installation, which is reasonable for a dependency lookup but means it will import whatever opendataloader-pdf is installed in the user's site.
Install Mechanism
No install spec is provided by the skill (instruction-only), so nothing is downloaded or written by the skill itself. Dependency installation is left to the user (pip install --user opendataloader-pdf). This is low-risk for the skill bundle, though the external Python package remains a separate trust decision.
Credentials
The skill declares no environment variables, credentials, or config paths and its code does not read secrets. It only reads a user-supplied local file path (validated to be inside the current workspace) and writes output to a specified directory — which is proportionate to the stated function.
Persistence & Privilege
The skill does not request persistent or elevated presence (always:false). It does not modify other skills or system-wide agent settings. Autonomous invocation is allowed by default but is not combined with any broad credential access or unusual privileges.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install pdf-parser-agent
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /pdf-parser-agent 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
- Added a "Core Technology Attribution" section, crediting **bundolee** and **claude** as original authors of `opendataloader-pdf`. - No changes to functionality or usage; documentation update only.
v1.0.0
Initial release of pdf-parser-agent skill: - Parses local PDF files into structured Markdown and JSON formats using opendataloader-pdf. - Operates locally for deterministic extraction, avoiding LLM context limitations. - Dynamically locates user-installed dependencies for improved portability. - Includes example usage and implementation notes in documentation.
元数据
Slug pdf-parser-agent
版本 1.0.1
许可证 MIT-0
累计安装 1
当前安装数 1
历史版本数 2
常见问题

Pdf Parser Agent 是什么?

Parses local PDF files into structured Markdown and JSON using opendataloader-pdf for deterministic, local document content extraction. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 158 次。

如何安装 Pdf Parser Agent?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install pdf-parser-agent」即可一键安装,无需额外配置。

Pdf Parser Agent 是免费的吗?

是的,Pdf Parser Agent 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Pdf Parser Agent 支持哪些平台?

Pdf Parser Agent 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Pdf Parser Agent?

由 Ezequiel Techera(@trshdesigns)开发并维护,当前版本 v1.0.1。

💬 留言讨论