← 返回 Skills 市场
382
总下载
2
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install pdfagent
功能描述
Self-hosted PDF operations and conversions with metered usage output.
安全使用建议
This package implements a comprehensive self-hosted PDF CLI (merging, splitting, OCR, conversions, redaction, an 'agent' mode) and the code mostly matches that purpose — but pay attention to these issues before running it:
- Dependency mismatch: The registry lists no required binaries/env but SKILL.md and code expect many external tools (gs, qpdf, pdftoppm, soffice, ocrmypdf, wkhtmltopdf/Chrome, and optionally ollama). Ensure those are installed intentionally.
- Network and external execution: html_to_pdf can fetch remote URLs; core.llm can run arbitrary commands or call 'ollama' (a local LLM runner). Running the CLI with remote sources or LLM provider=command may cause the tool to access the network or execute untrusted commands. Treat any use that passes URLs or enables an external LLM/command as potentially exfiltrative.
- Undeclared env usage: The code reads PDFAGENT_SOFFICE_TIMEOUT (and subprocess code supports custom env). Review environment variables and avoid exposing secrets to the runtime environment you use for this tool.
- Run in isolation first: Test the tool in a sandbox / disposable VM, with non-sensitive PDFs, and confirm behavior (doctor command reports available binaries). Inspect CLI flags (especially anything enabling LLM/agent mode or remote fetching) before using on private data.
- Origin and trust: The source 'homepage' and origin are unknown. If you need to run this in production or on sensitive documents, consider auditing the remaining omitted files, or prefer a vetted implementation from a known source.
If you want, I can: (1) list every place the code can perform network I/O or spawn external processes, (2) locate where the CLI accepts LLM provider/command options, or (3) highlight any remaining omitted files for further review.
功能分析
Type: OpenClaw Skill
Name: pdfagent
Version: 0.1.0
The skill bundle provides extensive PDF manipulation capabilities but contains high-risk features that could be exploited via prompt injection. Specifically, the 'agent' and 'translate' commands in 'pdfagent/cli.py' allow for arbitrary command execution through the '--llm-cmd' parameter (processed in 'pdfagent/core/llm.py'), which is intended for local LLM integration but lacks sanitization against malicious instructions. Additionally, 'pdfagent/tools/html_to_pdf.py' uses 'urllib.request.urlopen' to fetch content from user-provided URLs, introducing a potential Server-Side Request Forgery (SSRF) risk.
能力评估
Purpose & Capability
Name/description promise self-hosted PDF operations and the repo code implements that. However the skill metadata declares no required binaries or env vars while SKILL.md and the code require/expect uv, Ghostscript (gs), qpdf, poppler (pdftoppm), soffice (LibreOffice), ocrmypdf, wkhtmltopdf/Chrome, and optionally ollama and other Python libs. The registry declarations (no requirements) are inconsistent with the actual capabilities and dependencies.
Instruction Scope
SKILL.md focuses on local disk-based PDF processing, but the code can fetch remote HTML (urllib.request.urlopen in html_to_pdf) and can invoke external commands/LLM providers (core.llm uses arbitrary commands or 'ollama' via subprocess). Those behaviors allow network I/O and arbitrary process execution that go beyond simple file manipulation; the documentation does mention some of these tools but the risk/implications are not made explicit in the SKILL.md.
Install Mechanism
No install spec is provided (instruction-only for running via 'uv run'), so nothing is downloaded or installed automatically by the registry. The presence of source files means code will execute locally when run, but there is no remote installer or archive URL to review.
Credentials
The registry declares no required env vars, but code reads at least one env var (PDFAGENT_SOFFICE_TIMEOUT) and the subprocess execution paths allow passing custom env to commands. The tool also exposes options to call external LLMs or arbitrary commands; those uses can require secrets or expose sensitive data if misconfigured. Overall requested/used environment access is under-declared relative to what the code can leverage.
Persistence & Privilege
The skill is not always-enabled, does not request to modify other skills, and has no install hook. It writes usage logs optionally to a --usage-file, creates per-command output files and local LibreOffice profile directories, which is normal for a CLI tool.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install pdfagent - 安装完成后,直接呼叫该 Skill 的名称或使用
/pdfagent触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.0
Initial release of pdfagent: self-hosted PDF operations with usage metering.
- Supports PDF merge, split, compression, conversion (including image to PDF), and OCR.
- Designed for local file processing; inputs and outputs remain on disk.
- Provides detailed, machine-readable output with usage statistics via --json.
- Flexible agent mode for multi-step PDF instruction execution.
- Includes dependency and system binary checking for robust setup.
- Runs standalone from source with `uv run`; no PyPI publishing required.
元数据
常见问题
PR's PDF Agent 是什么?
Self-hosted PDF operations and conversions with metered usage output. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 382 次。
如何安装 PR's PDF Agent?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install pdfagent」即可一键安装,无需额外配置。
PR's PDF Agent 是免费的吗?
是的,PR's PDF Agent 完全免费(开源免费),可自由下载、安装和使用。
PR's PDF Agent 支持哪些平台?
PR's PDF Agent 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 PR's PDF Agent?
由 cap-txt(@cap-txt)开发并维护,当前版本 v0.1.0。
推荐 Skills