← 返回 Skills 市场
142
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install pdf-invoice-parser
功能描述
Extract structured data from PDF invoices and documents. Handles scanned PDFs (OCR) and digital PDFs. Outputs clean CSV/Excel with vendor, invoice number, da...
安全使用建议
The skill appears coherent and limited to local PDF parsing, but follow best practices before running it on sensitive data: (1) Run in a virtualenv or container to isolate pip-installed packages; (2) review and pin dependency versions if you will install them on production systems; (3) install tesseract from your OS package manager as instructed (verify the source); (4) test on non-sensitive sample invoices to confirm parsing quality; and (5) if you need network or cloud integration later, prefer adding explicit, minimal credentials and review any new code for unexpected network activity. Overall this skill is fit-for-purpose but exercise standard supply-chain caution when installing third-party Python packages.
功能分析
Type: OpenClaw Skill
Name: pdf-invoice-parser
Version: 1.0.0
The skill bundle is a legitimate tool for extracting structured data from PDF invoices using standard libraries such as PyMuPDF, PyPDF2, and Tesseract OCR. The scripts (parse-invoice.py and parse-invoices.py) perform local file processing and regex-based data extraction as described, with no evidence of data exfiltration, malicious command execution, or prompt injection attacks.
能力标签
能力评估
Purpose & Capability
Name/description match the included scripts and declared functionality: parsing searchable PDFs, optional OCR via pytesseract, and writing CSV/JSON/Excel-ready output. Required libraries (PyMuPDF, PyPDF2, Pillow, pytesseract, openpyxl) are appropriate for the stated purpose.
Instruction Scope
SKILL.md instructs the agent/user to install dependencies and run the provided scripts on local PDF files or directories. The runtime instructions and the scripts operate only on user-provided PDFs and produce local output files; they do not attempt to read unrelated system files, environment variables, or contact external endpoints.
Install Mechanism
This is an instruction-only skill (no automated install spec). SKILL.md asks the user to pip install third-party packages and to install the tesseract system package via apt/brew. Installing packages via pip can execute arbitrary code during installation (normal for Python packages) — recommend using a virtualenv/container and verifying package sources. The pip flag --break-system-packages appears in the example; it's not harmful in itself but is uncommon and may be unnecessary for many users.
Credentials
The skill requests no environment variables, no credentials, and no config paths. All data access is limited to PDF files supplied by the user. There are no hidden credential usages in the code.
Persistence & Privilege
always is false and the skill does not modify other skills or system-wide agent settings. It does not persist credentials or enable itself automatically.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install pdf-invoice-parser - 安装完成后,直接呼叫该 Skill 的名称或使用
/pdf-invoice-parser触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of pdf-invoice-parser.
- Extracts structured data from digital and scanned PDF invoices.
- Supports OCR for scanned/image-based PDFs.
- Outputs invoice data as CSV, JSON, or Excel-ready TSV formats.
- Captures key fields: vendor, invoice number, dates, line items, totals, and currency.
- Supports batch processing of invoice directories.
- Includes CLI usage examples and required dependencies for setup.
元数据
常见问题
Pdf Invoice Parser 是什么?
Extract structured data from PDF invoices and documents. Handles scanned PDFs (OCR) and digital PDFs. Outputs clean CSV/Excel with vendor, invoice number, da... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 142 次。
如何安装 Pdf Invoice Parser?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install pdf-invoice-parser」即可一键安装,无需额外配置。
Pdf Invoice Parser 是免费的吗?
是的,Pdf Invoice Parser 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Pdf Invoice Parser 支持哪些平台?
Pdf Invoice Parser 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Pdf Invoice Parser?
由 tktk-ai(@tktk-ai)开发并维护,当前版本 v1.0.0。
推荐 Skills