← 返回 Skills 市场

pdf-miner

Name: pdf-miner
Author: baichenwzj

作者 baichen · GitHub ↗ · v1.0.2 · MIT-0

cross-platform ⚠ suspicious

116

总下载

当前安装

版本数

在 OpenClaw 中安装

/install pdf-miner

功能描述

Extract text and tables from PDF files with robust support for global market data formats (currencies, percentages, units). Use when: (1) User asks to read/e...

安全使用建议

What to consider before installing/using this skill: - Documentation mismatch: the SKILL.md both says OCR is not supported and also includes full OCR support. Treat that as poor documentation and verify behavior before trusting it with sensitive PDFs. - Data exfiltration risk: OCR sends page images (base64) to a configured vision API (default base URL: https://openrouter.ai/api/v1). If you provide an API key or a base URL, pages may be uploaded to an external service. Do not supply credentials or run auto-OCR on sensitive documents unless you trust the provider and understand their data retention/privacy. - Undeclared env vars/config: The skill expects OCR_API_KEY / OPENROUTER_API_KEY, OCR_MODEL, OCR_BASE_URL or a config.json storing a vision_api_key. These were not listed in the registry metadata; treat them as optional but required if you need OCR. Storing API keys in config.json will put secrets on disk in plain text — prefer environment variables. - If you do not want OCR behavior: either avoid installing PyMuPDF/openai and do not provide an API key, or run the script with --no-auto-ocr / avoid --ocr. Test the tool on non-sensitive PDFs first to confirm it behaves as you expect. - If you need assurance: ask the publisher why the README contradicts OCR support and request a minimal build without OCR if you only need text/tables. Review the included scripts locally to confirm endpoints and behavior before running them on confidential documents.

功能分析

Type: OpenClaw Skill Name: pdf-miner Version: 1.0.2 The pdf-miner skill is a legitimate PDF extraction utility that uses pdfplumber for text/table parsing and an external vision API (via the openai library) for OCR. The code in extract_pdf.py and ocr_engine.py implements standard features like keyword search, metrics extraction, and table of contents detection without any signs of malicious intent, obfuscation, or unauthorized data exfiltration. While SKILL.md contains a minor documentation contradiction regarding OCR support, the overall bundle is well-structured, follows standard security practices for API key management (env vars/config.json), and lacks any dangerous execution patterns like shell injection or persistence mechanisms.

能力评估

⚠ Purpose & Capability

The top-level SKILL.md README states the skill is NOT for OCR on scanned/image PDFs, yet the package includes two OCR-capable scripts, explicit OCR instructions, automatic OCR behavior, and guidance to configure a vision API. This is an internal contradiction: OCR-related code and instructions clearly belong to the skill, but the description contradicts that capability.

⚠ Instruction Scope

Runtime instructions and the scripts will (by default) automatically OCR pages with low extractable text and send base64-encoded page images to a configurable third-party vision API. The SKILL.md tells the agent/user to set OCR_API_KEY, OCR_MODEL, OCR_BASE_URL or create a config.json — these environment/config values are not declared in the skill metadata. Automatic OCR behavior may send sensitive PDF contents to an external endpoint unless the user disables auto-OCR or withholds credentials.

✓ Install Mechanism

No install spec is provided (instruction-only install), and dependencies are standard Python packages (pdfplumber, pymupdf, openai). No remote downloads or unusual installers are used. This is low install risk.

⚠ Credentials

Registry metadata lists no required env vars, but SKILL.md and scripts expect OCR_API_KEY / OPENROUTER_API_KEY, OCR_MODEL, OCR_BASE_URL and allow storing keys in config.json. Requesting a vision API key is proportionate to OCR functionality, but the metadata omission and the presence of persistent config-file credential guidance (config.json) are inconsistent and worth flagging.

✓ Persistence & Privilege

The skill is not always-included, does not request elevated platform privileges, and does not modify other skills. It only reads a skill-scoped config.json (or env vars) and writes output files; no suspicious persistence or privilege escalation behavior is present.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install pdf-miner
安装完成后，直接呼叫该 Skill 的名称或使用 /pdf-miner 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.2

No user-facing changes were detected in this release. - No file changes or updates to documentation, features, or behavior.

v1.0.1

v1.0.1 adds robust OCR support and flexible global configuration options: - Added automatic OCR for scanned/image-based PDFs, triggered when extracted text is minimal (default threshold: 100 characters). - New configuration via `config.json`, environment variables, or command-line for vision API credentials and model selection. - Introduced new script: `scripts/ocr_engine.py` to power OCR features. - Added advanced OCR CLI options (e.g., page selection, adjustable threshold, DPI, model override, and disabling auto-OCR). - Updated documentation to cover OCR setup, troubleshooting, and workflow.

v1.0.0

Initial release of pdf-miner — a robust PDF text/table extraction tool with advanced features. - Extracts text and tables from PDFs, with strong support for Chinese (CJK) documents. - Supports keyword search, metrics extraction, TOC detection, PDF diff/comparison, LLM chunk splitting, batch processing, and header/footer cleanup. - Multiple output and extraction modes available by command line options. - Not designed for image-based (scanned) PDFs or PDF editing/creation. - Comparison table with other tools provided; requires installation of pdfplumber.

元数据

Slug pdf-miner

版本 1.0.2

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 3

常见问题

pdf-miner 是什么？

Extract text and tables from PDF files with robust support for global market data formats (currencies, percentages, units). Use when: (1) User asks to read/e... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 116 次。

如何安装 pdf-miner？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install pdf-miner」即可一键安装，无需额外配置。

pdf-miner 是免费的吗？

是的，pdf-miner 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

pdf-miner 支持哪些平台？

pdf-miner 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 pdf-miner？

由 baichen（@baichenwzj）开发并维护，当前版本 v1.0.2。