← 返回 Skills 市场
116
总下载
1
收藏
0
当前安装
3
版本数
在 OpenClaw 中安装
/install pdf-miner
功能描述
Extract text and tables from PDF files with robust support for global market data formats (currencies, percentages, units). Use when: (1) User asks to read/e...
安全使用建议
What to consider before installing/using this skill:
- Documentation mismatch: the SKILL.md both says OCR is not supported and also includes full OCR support. Treat that as poor documentation and verify behavior before trusting it with sensitive PDFs.
- Data exfiltration risk: OCR sends page images (base64) to a configured vision API (default base URL: https://openrouter.ai/api/v1). If you provide an API key or a base URL, pages may be uploaded to an external service. Do not supply credentials or run auto-OCR on sensitive documents unless you trust the provider and understand their data retention/privacy.
- Undeclared env vars/config: The skill expects OCR_API_KEY / OPENROUTER_API_KEY, OCR_MODEL, OCR_BASE_URL or a config.json storing a vision_api_key. These were not listed in the registry metadata; treat them as optional but required if you need OCR. Storing API keys in config.json will put secrets on disk in plain text — prefer environment variables.
- If you do not want OCR behavior: either avoid installing PyMuPDF/openai and do not provide an API key, or run the script with --no-auto-ocr / avoid --ocr. Test the tool on non-sensitive PDFs first to confirm it behaves as you expect.
- If you need assurance: ask the publisher why the README contradicts OCR support and request a minimal build without OCR if you only need text/tables. Review the included scripts locally to confirm endpoints and behavior before running them on confidential documents.
功能分析
Type: OpenClaw Skill
Name: pdf-miner
Version: 1.0.2
The pdf-miner skill is a legitimate PDF extraction utility that uses pdfplumber for text/table parsing and an external vision API (via the openai library) for OCR. The code in extract_pdf.py and ocr_engine.py implements standard features like keyword search, metrics extraction, and table of contents detection without any signs of malicious intent, obfuscation, or unauthorized data exfiltration. While SKILL.md contains a minor documentation contradiction regarding OCR support, the overall bundle is well-structured, follows standard security practices for API key management (env vars/config.json), and lacks any dangerous execution patterns like shell injection or persistence mechanisms.
能力评估
Purpose & Capability
The top-level SKILL.md README states the skill is NOT for OCR on scanned/image PDFs, yet the package includes two OCR-capable scripts, explicit OCR instructions, automatic OCR behavior, and guidance to configure a vision API. This is an internal contradiction: OCR-related code and instructions clearly belong to the skill, but the description contradicts that capability.
Instruction Scope
Runtime instructions and the scripts will (by default) automatically OCR pages with low extractable text and send base64-encoded page images to a configurable third-party vision API. The SKILL.md tells the agent/user to set OCR_API_KEY, OCR_MODEL, OCR_BASE_URL or create a config.json — these environment/config values are not declared in the skill metadata. Automatic OCR behavior may send sensitive PDF contents to an external endpoint unless the user disables auto-OCR or withholds credentials.
Install Mechanism
No install spec is provided (instruction-only install), and dependencies are standard Python packages (pdfplumber, pymupdf, openai). No remote downloads or unusual installers are used. This is low install risk.
Credentials
Registry metadata lists no required env vars, but SKILL.md and scripts expect OCR_API_KEY / OPENROUTER_API_KEY, OCR_MODEL, OCR_BASE_URL and allow storing keys in config.json. Requesting a vision API key is proportionate to OCR functionality, but the metadata omission and the presence of persistent config-file credential guidance (config.json) are inconsistent and worth flagging.
Persistence & Privilege
The skill is not always-included, does not request elevated platform privileges, and does not modify other skills. It only reads a skill-scoped config.json (or env vars) and writes output files; no suspicious persistence or privilege escalation behavior is present.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install pdf-miner - 安装完成后,直接呼叫该 Skill 的名称或使用
/pdf-miner触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.2
No user-facing changes were detected in this release.
- No file changes or updates to documentation, features, or behavior.
v1.0.1
v1.0.1 adds robust OCR support and flexible global configuration options:
- Added automatic OCR for scanned/image-based PDFs, triggered when extracted text is minimal (default threshold: 100 characters).
- New configuration via `config.json`, environment variables, or command-line for vision API credentials and model selection.
- Introduced new script: `scripts/ocr_engine.py` to power OCR features.
- Added advanced OCR CLI options (e.g., page selection, adjustable threshold, DPI, model override, and disabling auto-OCR).
- Updated documentation to cover OCR setup, troubleshooting, and workflow.
v1.0.0
Initial release of pdf-miner — a robust PDF text/table extraction tool with advanced features.
- Extracts text and tables from PDFs, with strong support for Chinese (CJK) documents.
- Supports keyword search, metrics extraction, TOC detection, PDF diff/comparison, LLM chunk splitting, batch processing, and header/footer cleanup.
- Multiple output and extraction modes available by command line options.
- Not designed for image-based (scanned) PDFs or PDF editing/creation.
- Comparison table with other tools provided; requires installation of pdfplumber.
元数据
常见问题
pdf-miner 是什么?
Extract text and tables from PDF files with robust support for global market data formats (currencies, percentages, units). Use when: (1) User asks to read/e... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 116 次。
如何安装 pdf-miner?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install pdf-miner」即可一键安装,无需额外配置。
pdf-miner 是免费的吗?
是的,pdf-miner 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
pdf-miner 支持哪些平台?
pdf-miner 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 pdf-miner?
由 baichen(@baichenwzj)开发并维护,当前版本 v1.0.2。
推荐 Skills