← Back to Skills Marketplace
rishabhdugar

PDF to Excel

by Rishabh Dugar · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
74
Downloads
0
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install pdf-to-xlsx
Description
Extract tables and text from a PDF into an Excel workbook (XLSX). Each page becomes a separate sheet.
README (SKILL.md)

PDF to Excel

What It Does

Extracts tables and text from a PDF into an Excel workbook. Each page becomes a separate sheet. Uses PyMuPDF table detection with fallback to line-by-line text extraction.

When to Use

  • Extract tabular data from PDF invoices or reports
  • Convert PDF financial statements to Excel
  • Pull structured data from PDF tables

Required Inputs

Provide one of:

  • url — public URL to a PDF
  • file — base64-encoded PDF
  • Multipart upload with file field

Authentication

Send your API key in the CLIENT-API-KEY header.

Get your free API key at https://pdfapihub.com. Full API documentation is available at https://pdfapihub.com/docs.

Use Cases

  • Invoice Data Extraction — Pull line items and totals from PDF invoices into Excel for accounting
  • Financial Statement Analysis — Convert PDF bank statements to Excel for analysis and reconciliation
  • Report Data Mining — Extract tabular data from PDF reports for further processing
  • Procurement — Convert PDF purchase orders into spreadsheets for tracking
  • Tax Preparation — Extract financial data from PDF tax documents into Excel

Aliases

  • /v1/convert/pdf/excel is an alias

Example Usage

curl -X POST https://pdfapihub.com/api/v1/convert/pdf/xlsx \
  -H "CLIENT-API-KEY: your_api_key" \
  -H "Content-Type: application/json" \
  -d '{ "url": "https://pdfapihub.com/sample-pdfinvoice-with-image.pdf", "output": "url" }'
Usage Guidance
This skill delegates conversion to an external service (https://pdfapihub.com) and requires you to provide an API key. Before using it: (1) Do not upload confidential or regulated documents unless you trust the provider and have checked their privacy/retention policy; (2) Verify the provider's reputation and pricing (the skill's source/homepage is unknown); (3) Be aware the SKILL.md mentions PyMuPDF (likely describing the provider's implementation) but the agent will send your PDF to the remote API for processing; (4) If you need on‑device processing for privacy reasons, consider local tools that avoid sending files off your machine.
Capability Analysis
Type: OpenClaw Skill Name: pdf-to-xlsx Version: 1.0.0 The skill is a standard API wrapper for a third-party service (pdfapihub.com) designed to convert PDF files to Excel workbooks. The documentation in SKILL.md and the technical definitions in skill.json and example.json are consistent with the stated purpose, and there is no evidence of malicious execution, data exfiltration beyond the intended PDF processing, or prompt-injection attacks.
Capability Tags
cryptocan-make-purchasesrequires-sensitive-credentials
Capability Assessment
Purpose & Capability
The name/description (convert PDF tables/text to XLSX) matches the instructions and the skill.json (POST to pdfapihub API). The only minor inconsistency is a claim in SKILL.md that it "Uses PyMuPDF table detection" — the skill is instruction-only and actually calls an external API, so that line appears to describe the remote service implementation (or is misleading) rather than local behavior.
Instruction Scope
Runtime instructions are limited and explicit: supply a PDF (URL/base64/multipart) and an API key via the CLIENT-API-KEY header, POST to the documented endpoint, and receive XLSX output. This will send the full PDF contents to a third‑party endpoint (pdfapihub.com), which is expected for a hosted conversion service but is a privacy consideration and should be understood before uploading sensitive documents.
Install Mechanism
No install spec and no code files — the skill is instruction-only, so nothing is written to disk and no external packages are pulled in by the skill itself.
Credentials
The skill does not request environment variables or local credentials. It requires an API key in the CLIENT-API-KEY header (declared in skill.json), which is proportional to calling an external API. There are no unrelated credentials or config paths requested.
Persistence & Privilege
The skill does not request always:true, does not install or modify agent configuration, and has no persistent/system privileges. Autonomous invocation is allowed by platform default but is not combined with broad privileges here.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install pdf-to-xlsx
  3. After installation, invoke the skill by name or use /pdf-to-xlsx
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Extract tables and text from PDF into Excel workbook (XLSX). Each page becomes a separate sheet. Uses PyMuPDF table detection with fallback to line-by-line extraction. Great for invoices, financial statements, and reports.
Metadata
Slug pdf-to-xlsx
Version 1.0.0
License MIT-0
All-time Installs 1
Active Installs 1
Total Versions 1
Frequently Asked Questions

What is PDF to Excel?

Extract tables and text from a PDF into an Excel workbook (XLSX). Each page becomes a separate sheet. It is an AI Agent Skill for Claude Code / OpenClaw, with 74 downloads so far.

How do I install PDF to Excel?

Run "/install pdf-to-xlsx" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is PDF to Excel free?

Yes, PDF to Excel is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does PDF to Excel support?

PDF to Excel is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created PDF to Excel?

It is built and maintained by Rishabh Dugar (@rishabhdugar); the current version is v1.0.0.

💬 Comments