← 返回 Skills 市场
datadrivenconstruction

Pdf To Structured

作者 datadrivenconstruction · GitHub ↗ · v2.0.0
cross-platform ✓ 安全检测通过
4472
总下载
9
收藏
36
当前安装
2
版本数
在 OpenClaw 中安装
/install pdf-to-structured
功能描述
Extract structured data from construction PDFs. Convert specifications, BOMs, schedules, and reports from PDF to Excel/CSV/JSON. Use OCR for scanned documents and pdfplumber for native PDFs.
安全使用建议
Install only if you are comfortable letting the skill read the PDFs you choose and write extracted data to local files. Construction PDFs and exported CSV/Excel/JSON files may contain sensitive project, commercial, or personal information, so store outputs in approved locations and review them before sharing.
功能分析
Type: OpenClaw Skill Name: pdf-to-structured Version: 2.0.0 The skill is designed for extracting structured data from PDFs using local libraries like pdfplumber and pytesseract. All code examples and instructions align with this stated purpose. The `claw.json` requests 'filesystem' permission, which is necessary for reading PDFs and writing output files. Crucially, `instructions.md` explicitly states 'No network access required for basic extraction', and no network calls or attempts to exfiltrate data are found in the provided code. There are no signs of prompt injection, obfuscation, or other malicious behaviors.
能力评估
Purpose & Capability
The instructions and examples consistently support extracting tables/text from construction PDFs and exporting Excel, CSV, JSON, or text outputs.
Instruction Scope
Filesystem access is necessary for reading input PDFs and writing extracted outputs; one line mentions a cloud OCR API option, but the documented default is local pdfplumber/Tesseract and no network is required for basic extraction.
Install Mechanism
The documented installs are normal Python/OCR dependencies such as pdfplumber, pandas, openpyxl, pytesseract, pdf2image, and pypdf; no executable payloads are included in the artifact.
Credentials
Requested filesystem permission is proportionate to PDF processing, and no credentials, tokens, broad local indexing, or external service configuration are required for the core workflow.
Persistence & Privilege
No artifact evidence shows background workers, startup persistence, privilege escalation, credential/session access, deletion routines, or obfuscated behavior.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install pdf-to-structured
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /pdf-to-structured 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v2.0.0
Major update with enhanced extraction features and expanded documentation. - Added detailed README covering ETL workflow, quick start, and advanced use cases. - Now supports both native (pdfplumber) and scanned (OCR) PDF extraction. - Includes construction-specific examples for specifications, BOMs, and schedules. - Provides code samples for extracting tables, text, and specific areas from PDFs. - Outlines recommended packages and installation instructions for various extraction scenarios.
v1.0.0
PDF To Structured 1.0.0 - Initial release. - Extracts structured data (tables, text) from construction-related PDF documents. - Supports both native PDFs (via pdfplumber) and scanned PDFs (via OCR with pytesseract/pdf2image). - Provides utilities to save extracted data to Excel, CSV, or JSON. - Includes targeted extraction examples for specifications, BOMs, schedules, and custom table areas. - Installation and usage instructions included for all processing modes.
元数据
Slug pdf-to-structured
版本 2.0.0
许可证
累计安装 36
当前安装数 36
历史版本数 2
常见问题

Pdf To Structured 是什么?

Extract structured data from construction PDFs. Convert specifications, BOMs, schedules, and reports from PDF to Excel/CSV/JSON. Use OCR for scanned documents and pdfplumber for native PDFs. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 4472 次。

如何安装 Pdf To Structured?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install pdf-to-structured」即可一键安装,无需额外配置。

Pdf To Structured 是免费的吗?

是的,Pdf To Structured 完全免费(开源免费),可自由下载、安装和使用。

Pdf To Structured 支持哪些平台?

Pdf To Structured 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Pdf To Structured?

由 datadrivenconstruction(@datadrivenconstruction)开发并维护,当前版本 v2.0.0。

💬 留言讨论