← Back to Skills Marketplace
datadrivenconstruction

Pdf To Structured

cross-platform ✓ Security Clean
4472
Downloads
9
Stars
36
Active Installs
2
Versions
Install in OpenClaw
/install pdf-to-structured
Description
Extract structured data from construction PDFs. Convert specifications, BOMs, schedules, and reports from PDF to Excel/CSV/JSON. Use OCR for scanned documents and pdfplumber for native PDFs.
Usage Guidance
Install only if you are comfortable letting the skill read the PDFs you choose and write extracted data to local files. Construction PDFs and exported CSV/Excel/JSON files may contain sensitive project, commercial, or personal information, so store outputs in approved locations and review them before sharing.
Capability Analysis
Type: OpenClaw Skill Name: pdf-to-structured Version: 2.0.0 The skill is designed for extracting structured data from PDFs using local libraries like pdfplumber and pytesseract. All code examples and instructions align with this stated purpose. The `claw.json` requests 'filesystem' permission, which is necessary for reading PDFs and writing output files. Crucially, `instructions.md` explicitly states 'No network access required for basic extraction', and no network calls or attempts to exfiltrate data are found in the provided code. There are no signs of prompt injection, obfuscation, or other malicious behaviors.
Capability Assessment
Purpose & Capability
The instructions and examples consistently support extracting tables/text from construction PDFs and exporting Excel, CSV, JSON, or text outputs.
Instruction Scope
Filesystem access is necessary for reading input PDFs and writing extracted outputs; one line mentions a cloud OCR API option, but the documented default is local pdfplumber/Tesseract and no network is required for basic extraction.
Install Mechanism
The documented installs are normal Python/OCR dependencies such as pdfplumber, pandas, openpyxl, pytesseract, pdf2image, and pypdf; no executable payloads are included in the artifact.
Credentials
Requested filesystem permission is proportionate to PDF processing, and no credentials, tokens, broad local indexing, or external service configuration are required for the core workflow.
Persistence & Privilege
No artifact evidence shows background workers, startup persistence, privilege escalation, credential/session access, deletion routines, or obfuscated behavior.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install pdf-to-structured
  3. After installation, invoke the skill by name or use /pdf-to-structured
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v2.0.0
Major update with enhanced extraction features and expanded documentation. - Added detailed README covering ETL workflow, quick start, and advanced use cases. - Now supports both native (pdfplumber) and scanned (OCR) PDF extraction. - Includes construction-specific examples for specifications, BOMs, and schedules. - Provides code samples for extracting tables, text, and specific areas from PDFs. - Outlines recommended packages and installation instructions for various extraction scenarios.
v1.0.0
PDF To Structured 1.0.0 - Initial release. - Extracts structured data (tables, text) from construction-related PDF documents. - Supports both native PDFs (via pdfplumber) and scanned PDFs (via OCR with pytesseract/pdf2image). - Provides utilities to save extracted data to Excel, CSV, or JSON. - Includes targeted extraction examples for specifications, BOMs, schedules, and custom table areas. - Installation and usage instructions included for all processing modes.
Metadata
Slug pdf-to-structured
Version 2.0.0
License
All-time Installs 36
Active Installs 36
Total Versions 2
Frequently Asked Questions

What is Pdf To Structured?

Extract structured data from construction PDFs. Convert specifications, BOMs, schedules, and reports from PDF to Excel/CSV/JSON. Use OCR for scanned documents and pdfplumber for native PDFs. It is an AI Agent Skill for Claude Code / OpenClaw, with 4472 downloads so far.

How do I install Pdf To Structured?

Run "/install pdf-to-structured" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Pdf To Structured free?

Yes, Pdf To Structured is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Pdf To Structured support?

Pdf To Structured is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Pdf To Structured?

It is built and maintained by datadrivenconstruction (@datadrivenconstruction); the current version is v2.0.0.

💬 Comments