← 返回 Skills 市场
emptyguo

opendataloader-pdf

作者 empty_4399 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
240
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install opendataloader-pdf
功能描述
Use when parsing PDFs for RAG pipelines, extracting structured data from PDFs, or converting PDFs to Markdown/JSON with bounding boxes for AI processing
安全使用建议
This skill appears coherent and focused on local PDF extraction. Before installing: 1) verify the opendataloader-pdf package on PyPI/npm and confirm the upstream GitHub/source and release integrity; 2) be aware that hybrid mode or any server mode may change data flows (it could call external services or require models) — read the hybrid-mode docs and any config for remote endpoints or API keys before enabling; 3) run installations in an isolated environment (virtualenv/container) and test on non-sensitive documents first; 4) ensure Java 11+ and any OCR dependencies are installed from trusted sources; and 5) if you need guarantees about data staying local, confirm implementation details for hybrid/OCR modes in the project's docs or source code.
功能分析
Type: OpenClaw Skill Name: opendataloader-pdf Version: 1.0.0 The skill bundle provides documentation and instructions for 'opendataloader-pdf', a tool designed for parsing PDFs into structured formats like Markdown and JSON for RAG pipelines. The content in SKILL.md and _meta.json consists of standard installation commands (pip/npm), usage examples, and feature descriptions without any evidence of malicious intent, data exfiltration, or prompt injection attacks.
能力评估
Purpose & Capability
Name/description (PDF parsing for RAG, bounding boxes, Markdown/JSON output) align with the SKILL.md: it documents CLI/Python/Node APIs, supported modes (fast/hybrid/OCR), and expected outputs. Required system dependencies (Java, Python/Node) are reasonable for PDF parsing/OCR pipelines.
Instruction Scope
SKILL.md only instructs installing the package(s), running conversion commands, and configuring mode/ocr/languages. It references input file paths and output directories (expected for this purpose). It does not instruct reading unrelated system files, exporting secrets, or sending data to unexpected external endpoints. The only potential scope caveat: 'hybrid' mode and 'start server' are mentioned but not detailed — those could change data flows depending on implementation, so users should verify hybrid behavior before enabling.
Install Mechanism
This is an instruction-only skill with no install spec. The SKILL.md recommends pip/npm installs (standard registries). No embedded download URLs or archive extraction steps in the skill itself. Installing from PyPI/npm is a common, low-risk approach — verify package provenance when installing.
Credentials
The skill declares no required environment variables, credentials, or config paths. The SKILL.md does not reference secret env vars. This is proportionate for a local PDF-extraction tool.
Persistence & Privilege
always is false and the skill does not request persistent system presence or modify other skills. It does not require elevated privileges or access to other agents' configs.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install opendataloader-pdf
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /opendataloader-pdf 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
- Initial release of opendataloader-pdf skill. - Enables PDF parsing for RAG pipelines, extracting structured data, and converting PDFs to Markdown/JSON/HTML with bounding boxes. - Supports Python and Node.js, with both fast local and hybrid AI (OCR, advanced extraction) modes. - Provides element-level data (types, content, bounding boxes, page numbers) and robust table/extraction features. - Includes LangChain integration and offers detailed troubleshooting guidance.
元数据
Slug opendataloader-pdf
版本 1.0.0
许可证 MIT-0
累计安装 1
当前安装数 1
历史版本数 1
常见问题

opendataloader-pdf 是什么?

Use when parsing PDFs for RAG pipelines, extracting structured data from PDFs, or converting PDFs to Markdown/JSON with bounding boxes for AI processing. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 240 次。

如何安装 opendataloader-pdf?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install opendataloader-pdf」即可一键安装,无需额外配置。

opendataloader-pdf 是免费的吗?

是的,opendataloader-pdf 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

opendataloader-pdf 支持哪些平台?

opendataloader-pdf 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 opendataloader-pdf?

由 empty_4399(@emptyguo)开发并维护,当前版本 v1.0.0。

💬 留言讨论