← 返回 Skills 市场
nantes

Pdfreader

作者 Ivan Cetta · GitHub ↗ · v1.0.3
cross-platform ✓ 安全检测通过
643
总下载
2
收藏
4
当前安装
4
版本数
在 OpenClaw 中安装
/install pdfreader
功能描述
Extract text and metadata from PDF files using PyMuPDF, supporting large files and outputting results in JSON format.
安全使用建议
This skill appears to do what it claims: extract text and metadata from PDFs using PyMuPDF. Before installing or running it, consider: 1) Run pip install pymupdf in an isolated environment (virtualenv/container) — PyMuPDF includes compiled code from PyPI. 2) The script enforces 'within current working directory' but allows subdirectories and does not resolve symlinks; avoid placing untrusted symlinks inside the working directory to prevent escapes. 3) Because the source/homepage is unknown, prefer running the script in a sandbox and review the code yourself (or run it on non-sensitive PDFs) before giving it access to important files. If you need stricter confinement (no subdirectories or symlink protections), request a code change to use os.path.realpath checks and a configurable safe directory.
功能分析
Type: OpenClaw Skill Name: pdfreader Version: 1.0.3 The OpenClaw skill bundle is designed to extract text from PDF files using PyMuPDF. The `SKILL.md` documentation provides clear, non-malicious instructions and explicitly states security restrictions. The `pdf_reader.py` script implements robust path validation (`is_safe_input_path`, `is_safe_output_path`) to prevent path traversal and restrict file operations to the current working directory and specific file types (.pdf for input, .json for output). There are no signs of data exfiltration, malicious execution, persistence, or prompt injection attempts against the agent. The code is well-contained and aligns with its stated purpose and security measures.
能力评估
Purpose & Capability
Name/description match the files and instructions. The code uses PyMuPDF (fitz) to open PDFs, extract text and metadata, and produce JSON — exactly what the description promises. No extraneous binaries, credentials, or services are requested.
Instruction Scope
SKILL.md usage aligns with the script's behavior (pip install pymupdf; run python pdf_reader.py ...). The SKILL.md states files must be 'within the current working directory' and forbids '../' traversal; the script enforces that by checking absolute paths are inside os.getcwd(). However, the script allows files in subdirectories of the current working directory (contrary to an implication that only the top-level cwd is allowed) and uses os.path.abspath rather than realpath, so a symlink inside the cwd that points outside could bypass the directory restriction. This is an implementation caveat rather than evidence of malicious behavior.
Install Mechanism
No install spec is embedded (instruction-only install guidance in SKILL.md recommends 'pip install pymupdf'). That is low-risk from the skill bundle perspective. Note: installing PyMuPDF via pip will run compiled extension code from PyPI — treat pip installs from unknown sources with standard care.
Credentials
The skill requests no environment variables, credentials, or config paths. The functionality does not require additional secrets. The code does not read environment variables or access unrelated system configuration.
Persistence & Privilege
always is false and the skill does not request persistent/autoincluded privileges. It does not modify other skills or system-wide settings. Autonomous invocation remains the platform default but is not combined with other concerning privileges here.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install pdfreader
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /pdfreader 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.3
Fixed instruction mismatch: Separated input (.pdf) and output (.json) validation. Added security documentation to SKILL.md
v1.0.2
Security fix: Added .pdf extension validation to prevent arbitrary file read (CVE-like vulnerability)
v1.0.1
Security fix: Added path validation to prevent arbitrary file write (CVE-like vulnerability)
v1.0.0
Initial release of PDF Reader Skill for OpenClaw: - Extracts text from any PDF using PyMuPDF. - Supports large and multi-page PDF files. - Outputs extracted content in JSON for AI reading compatibility. - Handles text encoding issues. - Displays PDF metadata (title, author, etc.). - Includes clear installation and usage instructions.
元数据
Slug pdfreader
版本 1.0.3
许可证
累计安装 4
当前安装数 4
历史版本数 4
常见问题

Pdfreader 是什么?

Extract text and metadata from PDF files using PyMuPDF, supporting large files and outputting results in JSON format. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 643 次。

如何安装 Pdfreader?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install pdfreader」即可一键安装,无需额外配置。

Pdfreader 是免费的吗?

是的,Pdfreader 完全免费(开源免费),可自由下载、安装和使用。

Pdfreader 支持哪些平台?

Pdfreader 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Pdfreader?

由 Ivan Cetta(@nantes)开发并维护,当前版本 v1.0.3。

💬 留言讨论