← Back to Skills Marketplace
techtonicllc

frompdf

by techtonicllc · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
160
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install frompdf-api
Description
PDF extraction API for AI agents and LLM pipelines. Converts any PDF into semantic AST, markdown, HTML, plain text, or LLM-ready chunks — no page limit. Also...
README (SKILL.md)

frompdf

Convert any PDF into structured, LLM-ready content via a single API call. Returns a semantic AST with every element — headings, paragraphs, tables, lists, metadata — properly typed and nested. No page limit. Handles encrypted PDFs, complex layouts, and multi-hundred-page documents.

Quick start

# Register (10 free credits, no credit card)
curl -s -X POST https://api.frompdf.dev/register \
  -H "Content-Type: application/json" \
  -d '{"email": "[email protected]", "password": "yourpassword"}'
# → {"api_key": "frompdf_..."}

# Extract a PDF (returns JSON semantic AST by default)
curl -s -X POST https://api.frompdf.dev/v1/extract \
  -H "Authorization: Bearer $FROMPDF_API_KEY" \
  -F "[email protected]"

Output formats

# Semantic AST — typed elements: headings, paragraphs, tables, lists (default)
-F "format=json"

# Markdown — structure preserved, human-readable
-F "format=markdown"

# HTML — full document with tags intact
-F "format=html"

# Plain text — clean extraction, no markup
-F "format=text"

# LLM-ready chunks — pre-split for RAG / vector store ingestion
-F "format=chunks"

All endpoints

# Extract content from a PDF (1 credit)
curl -s -X POST https://api.frompdf.dev/v1/extract \
  -H "Authorization: Bearer $FROMPDF_API_KEY" \
  -F "[email protected]" \
  -F "format=chunks"

# Encrypted PDF
curl -s -X POST https://api.frompdf.dev/v1/extract \
  -H "Authorization: Bearer $FROMPDF_API_KEY" \
  -F "[email protected]" \
  -F "password=secret"

# Semantic diff — compare two PDFs, get structured changes (2 credits)
curl -s -X POST https://api.frompdf.dev/v1/diff \
  -H "Authorization: Bearer $FROMPDF_API_KEY" \
  -F "[email protected]" \
  -F "[email protected]"

# Readability score — returns 0-100 score for a PDF (1 credit)
curl -s -X POST https://api.frompdf.dev/v1/score \
  -H "Authorization: Bearer $FROMPDF_API_KEY" \
  -F "[email protected]"

# Check credits and subscription status (free)
curl -s https://api.frompdf.dev/v1/usage \
  -H "Authorization: Bearer $FROMPDF_API_KEY"

Example output (JSON)

{
  "title": "AWS Lambda Developer Guide",
  "pages": 87,
  "sections": [
    { "type": "heading", "level": 1, "text": "Getting Started" },
    { "type": "paragraph", "text": "AWS Lambda is a serverless compute service..." },
    {
      "type": "table",
      "headers": ["Runtime", "Version", "Status"],
      "rows": [["Node.js 20", "20.x", "Active"], ["Python 3.12", "3.12", "Active"]]
    },
    { "type": "list", "items": ["Function", "Trigger", "Execution role"] }
  ],
  "metadata": { "author": "Amazon Web Services", "created": "2024-01-15" }
}

Pricing

$0.01/credit — extract (1), diff (2), score (1). First 10 credits free, no credit card required.

Data & privacy

PDF contents are uploaded to api.frompdf.dev for processing. Do not use with confidential documents unless you have reviewed the privacy policy. Requires FROMPDF_API_KEY env var — register free at /register.

Usage Guidance
This skill appears to do what it claims: it uploads PDFs to api.frompdf.dev and returns structured output. Before installing or using it: (1) review the provider's privacy policy and terms to confirm how long files are retained and whether data may be used for training; (2) do not upload sensitive or confidential documents unless you trust the service or have an enterprise/data-processing agreement; (3) treat FROMPDF_API_KEY like any API secret—store it securely, rotate it if compromised, and limit its scope if the provider supports scoped keys; (4) verify the service's legitimacy (homepage, HTTPS/TLS, community feedback) if you intend to process many or highly sensitive files.
Capability Analysis
Type: OpenClaw Skill Name: frompdf-api Version: 1.0.0 The frompdf-api skill is a standard API wrapper for a PDF processing service (api.frompdf.dev). It provides instructions for an AI agent to extract structured content from PDFs using curl commands. The documentation (SKILL.md) is transparent about its functionality, the requirement for an API key, and the fact that PDF files are uploaded to an external endpoint for processing. No evidence of malicious intent, obfuscation, or prompt injection was found.
Capability Assessment
Purpose & Capability
Name, description, and runtime instructions all describe a hosted PDF-extraction API. The single required env var (FROMPDF_API_KEY) is appropriate for an external API. No unrelated credentials, binaries, or installs are requested.
Instruction Scope
SKILL.md explicitly instructs uploading PDF files to api.frompdf.dev and shows endpoints for extract/diff/score/usage. This is in-scope for the stated purpose. Note: uploading document contents to a third-party server is inherent to this skill — users should avoid confidential documents unless they've reviewed the provider's privacy policy and terms.
Install Mechanism
Instruction-only skill with no install spec and no code files. Lowest installation risk — nothing is written to disk by the skill itself.
Credentials
Only one required environment variable (FROMPDF_API_KEY), which matches the documented use of Bearer auth for the API. No extraneous secrets, config paths, or broad credential requests are present.
Persistence & Privilege
always is false and the skill does not request persistent system privileges or modifications to other skills or system settings. Autonomous invocation is allowed (platform default) but not combined with other red flags.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install frompdf-api
  3. After installation, invoke the skill by name or use /frompdf-api
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of frompdf API skill. - Extracts structured content from any PDF: headings, paragraphs, tables, lists, metadata, and more. - Supports multiple output formats: semantic AST (JSON), Markdown, HTML, plain text, and LLM-ready chunks. - No page limit; handles encrypted and complex PDFs. - Provides semantic diff between two PDFs and PDF readability scoring. - Requires FROMPDF_API_KEY; first 10 credits free with registration.
Metadata
Slug frompdf-api
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is frompdf?

PDF extraction API for AI agents and LLM pipelines. Converts any PDF into semantic AST, markdown, HTML, plain text, or LLM-ready chunks — no page limit. Also... It is an AI Agent Skill for Claude Code / OpenClaw, with 160 downloads so far.

How do I install frompdf?

Run "/install frompdf-api" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is frompdf free?

Yes, frompdf is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does frompdf support?

frompdf is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created frompdf?

It is built and maintained by techtonicllc (@techtonicllc); the current version is v1.0.0.

💬 Comments