← Back to Skills Marketplace

frompdf

Name: frompdf
Author: techtonicllc

by techtonicllc · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ Security Clean

160

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install frompdf-api

Description

PDF extraction API for AI agents and LLM pipelines. Converts any PDF into semantic AST, markdown, HTML, plain text, or LLM-ready chunks — no page limit. Also...

README (SKILL.md)

frompdf

Convert any PDF into structured, LLM-ready content via a single API call. Returns a semantic AST with every element — headings, paragraphs, tables, lists, metadata — properly typed and nested. No page limit. Handles encrypted PDFs, complex layouts, and multi-hundred-page documents.

Quick start

# Register (10 free credits, no credit card)
curl -s -X POST https://api.frompdf.dev/register \
  -H "Content-Type: application/json" \
  -d '{"email": "[email protected]", "password": "yourpassword"}'
# → {"api_key": "frompdf_..."}

# Extract a PDF (returns JSON semantic AST by default)
curl -s -X POST https://api.frompdf.dev/v1/extract \
  -H "Authorization: Bearer $FROMPDF_API_KEY" \
  -F "[email protected]"

Output formats

# Semantic AST — typed elements: headings, paragraphs, tables, lists (default)
-F "format=json"

# Markdown — structure preserved, human-readable
-F "format=markdown"

# HTML — full document with tags intact
-F "format=html"

# Plain text — clean extraction, no markup
-F "format=text"

# LLM-ready chunks — pre-split for RAG / vector store ingestion
-F "format=chunks"

All endpoints

# Extract content from a PDF (1 credit)
curl -s -X POST https://api.frompdf.dev/v1/extract \
  -H "Authorization: Bearer $FROMPDF_API_KEY" \
  -F "[email protected]" \
  -F "format=chunks"

# Encrypted PDF
curl -s -X POST https://api.frompdf.dev/v1/extract \
  -H "Authorization: Bearer $FROMPDF_API_KEY" \
  -F "[email protected]" \
  -F "password=secret"

# Semantic diff — compare two PDFs, get structured changes (2 credits)
curl -s -X POST https://api.frompdf.dev/v1/diff \
  -H "Authorization: Bearer $FROMPDF_API_KEY" \
  -F "[email protected]" \
  -F "[email protected]"

# Readability score — returns 0-100 score for a PDF (1 credit)
curl -s -X POST https://api.frompdf.dev/v1/score \
  -H "Authorization: Bearer $FROMPDF_API_KEY" \
  -F "[email protected]"

# Check credits and subscription status (free)
curl -s https://api.frompdf.dev/v1/usage \
  -H "Authorization: Bearer $FROMPDF_API_KEY"

Example output (JSON)

{
  "title": "AWS Lambda Developer Guide",
  "pages": 87,
  "sections": [
    { "type": "heading", "level": 1, "text": "Getting Started" },
    { "type": "paragraph", "text": "AWS Lambda is a serverless compute service..." },
    {
      "type": "table",
      "headers": ["Runtime", "Version", "Status"],
      "rows": [["Node.js 20", "20.x", "Active"], ["Python 3.12", "3.12", "Active"]]
    },
    { "type": "list", "items": ["Function", "Trigger", "Execution role"] }
  ],
  "metadata": { "author": "Amazon Web Services", "created": "2024-01-15" }
}

Pricing

$0.01/credit — extract (1), diff (2), score (1). First 10 credits free, no credit card required.

Data & privacy

PDF contents are uploaded to api.frompdf.dev for processing. Do not use with confidential documents unless you have reviewed the privacy policy. Requires FROMPDF_API_KEY env var — register free at /register.

Usage Guidance

This skill appears to do what it claims: it uploads PDFs to api.frompdf.dev and returns structured output. Before installing or using it: (1) review the provider's privacy policy and terms to confirm how long files are retained and whether data may be used for training; (2) do not upload sensitive or confidential documents unless you trust the service or have an enterprise/data-processing agreement; (3) treat FROMPDF_API_KEY like any API secret—store it securely, rotate it if compromised, and limit its scope if the provider supports scoped keys; (4) verify the service's legitimacy (homepage, HTTPS/TLS, community feedback) if you intend to process many or highly sensitive files.

Capability Analysis

Type: OpenClaw Skill Name: frompdf-api Version: 1.0.0 The frompdf-api skill is a standard API wrapper for a PDF processing service (api.frompdf.dev). It provides instructions for an AI agent to extract structured content from PDFs using curl commands. The documentation (SKILL.md) is transparent about its functionality, the requirement for an API key, and the fact that PDF files are uploaded to an external endpoint for processing. No evidence of malicious intent, obfuscation, or prompt injection was found.

Capability Assessment

✓ Purpose & Capability

Name, description, and runtime instructions all describe a hosted PDF-extraction API. The single required env var (FROMPDF_API_KEY) is appropriate for an external API. No unrelated credentials, binaries, or installs are requested.

ℹ Instruction Scope

SKILL.md explicitly instructs uploading PDF files to api.frompdf.dev and shows endpoints for extract/diff/score/usage. This is in-scope for the stated purpose. Note: uploading document contents to a third-party server is inherent to this skill — users should avoid confidential documents unless they've reviewed the provider's privacy policy and terms.

✓ Install Mechanism

Instruction-only skill with no install spec and no code files. Lowest installation risk — nothing is written to disk by the skill itself.

✓ Credentials

Only one required environment variable (FROMPDF_API_KEY), which matches the documented use of Bearer auth for the API. No extraneous secrets, config paths, or broad credential requests are present.

✓ Persistence & Privilege

always is false and the skill does not request persistent system privileges or modifications to other skills or system settings. Autonomous invocation is allowed (platform default) but not combined with other red flags.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install frompdf-api
After installation, invoke the skill by name or use /frompdf-api
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release of frompdf API skill. - Extracts structured content from any PDF: headings, paragraphs, tables, lists, metadata, and more. - Supports multiple output formats: semantic AST (JSON), Markdown, HTML, plain text, and LLM-ready chunks. - No page limit; handles encrypted and complex PDFs. - Provides semantic diff between two PDFs and PDF readability scoring. - Requires FROMPDF_API_KEY; first 10 credits free with registration.

Metadata

Slug frompdf-api

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is frompdf?

PDF extraction API for AI agents and LLM pipelines. Converts any PDF into semantic AST, markdown, HTML, plain text, or LLM-ready chunks — no page limit. Also... It is an AI Agent Skill for Claude Code / OpenClaw, with 160 downloads so far.

How do I install frompdf?

Run "/install frompdf-api" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is frompdf free?

Yes, frompdf is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does frompdf support?

frompdf is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created frompdf?

It is built and maintained by techtonicllc (@techtonicllc); the current version is v1.0.0.

More Skills

frompdf

frompdf

Quick start

Output formats

All endpoints

Example output (JSON)

Pricing

Data & privacy

What is frompdf?

How do I install frompdf?

Is frompdf free?

Which platforms does frompdf support?

Who created frompdf?

💬 Comments