← Back to Skills Marketplace
unixlamadev-spec

Doc Miner

by unixlamadev-spec · GitHub ↗ · v1.1.0 · MIT-0
cross-platform ✓ Security Clean
351
Downloads
0
Stars
3
Active Installs
3
Versions
Install in OpenClaw
/install doc-miner
Description
Extract summaries, answers, or structured data from any URL, PDF, or raw text. Auto-detects mode from task.
README (SKILL.md)

Doc Miner

Extract insights, answers, and structured data from PDFs, webpages, or raw text. Auto-detects the right mode from your task: summarization, Q&A, or structured extraction of entities, dates, and numbers.

When to Use

  • Summarizing long PDFs or articles
  • Answering questions about document contents
  • Extracting named entities, dates, or figures
  • Analyzing raw text without a URL
  • Research and literature review

Usage Flow

  1. Provide a url (PDF or webpage) or paste text directly
  2. Optionally specify a task — asking a question triggers Q&A mode; "extract" triggers extraction mode; default is summarization
  3. AIProx routes to the doc-miner agent
  4. Returns mode-specific fields: summary/key_points/word_count, or answer/context/confidence, or entities/dates/numbers

Security Manifest

Permission Scope Reason
Network aiprox.dev API calls to orchestration endpoint
Env Read AIPROX_SPEND_TOKEN Authentication for paid API

Make Request

curl -X POST https://aiprox.dev/api/orchestrate \
  -H "Content-Type: application/json" \
  -H "X-Spend-Token: $AIPROX_SPEND_TOKEN" \
  -d '{
    "task": "extract all dates and key entities",
    "text": "On January 15, 2024, Acme Corp announced a merger with GlobalTech valued at $2.4 billion..."
  }'

Response (extraction mode)

{
  "mode": "extraction",
  "key_points": ["Acme Corp merging with GlobalTech", "Deal valued at $2.4 billion"],
  "entities": ["Acme Corp", "GlobalTech"],
  "dates": ["January 15, 2024"],
  "numbers": ["$2.4 billion"],
  "source_type": "text"
}

Response (summary mode)

{
  "mode": "summary",
  "summary": "Q3 2024 product analytics report covering user metrics and strategic recommendations.",
  "key_points": ["User engagement up 23%", "Mobile conversion 40% below desktop"],
  "word_count": 1240,
  "source_type": "webpage"
}

Trust Statement

Doc Miner fetches and analyzes document contents via URL or processes provided text. Documents are processed transiently and not stored. Analysis is performed by Claude via LightningProx. Your spend token is used for payment only.

Usage Guidance
This skill appears coherent for document summarization/extraction, but it sends the documents (or URLs) to aiprox.dev for processing. Before installing: (1) verify you trust aiprox.dev and review its privacy/billing policy, (2) treat AIPROX_SPEND_TOKEN as a sensitive credential (limit its scope and rotate if possible), (3) avoid sending sensitive or proprietary documents until you confirm retention policy, and (4) test with non-sensitive data to confirm behavior and billing. If you need stronger guarantees (no third-party egress or local-only processing), this skill is not appropriate.
Capability Analysis
Type: OpenClaw Skill Name: doc-miner Version: 1.1.0 The doc-miner skill is a legitimate utility designed to extract information from text or URLs using the aiprox.dev API. It transparently requests access to the AIPROX_SPEND_TOKEN environment variable for authentication and limits its network activity to the specified orchestration endpoint (aiprox.dev), with no evidence of malicious intent or hidden data exfiltration.
Capability Assessment
Purpose & Capability
Name/description (document summarization/extraction) align with what the skill asks for: a single AIPROX_SPEND_TOKEN to call aiprox.dev. No unrelated binaries, credentials, or config paths are requested.
Instruction Scope
SKILL.md instructs the agent to POST text or a task to https://aiprox.dev/api/orchestrate using the AIPROX_SPEND_TOKEN header — scope is limited to sending user-provided text or a URL to the external service. Note: the skill will transmit document contents to a third-party endpoint (expected for this functionality); the claim that documents are not stored is a trust statement from the provider and is not verifiable from the SKILL.md alone.
Install Mechanism
Instruction-only skill with no install spec and no code files, so nothing is written to disk by an installer. This is the lowest-risk install mechanism.
Credentials
Only one environment variable is required (AIPROX_SPEND_TOKEN), which is proportionate to a paid external API. The token name implies billing/authorization; treat it as sensitive.
Persistence & Privilege
always is false and there is no install-time persistence or requests to modify other skills or system-wide settings. The skill can be invoked autonomously by the agent (platform default).
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install doc-miner
  3. After installation, invoke the skill by name or use /doc-miner
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.1.0
Now supports model selection — specify any of 19 models across 5 providers per request (e.g. gemini-2.5-flash, mistral-large-latest, claude-opus-4-5-20251101)
v1.0.1
- Adds support for analyzing raw text input in addition to URLs/PDFs. - Auto-detects extraction mode (Q&A, summarization, structured data) based on task prompt. - Extraction mode returns structured fields: entities, dates, and numbers. - API documentation updated with new parameters and response formats. - Expanded usage instructions for both text and URL-based workflows.
v1.0.0
Initial release of doc-miner. - Extracts key points, summaries, and answers from any PDF or webpage URL. - Supports question answering about document contents. - Outputs a structured response with answer, summary, and key_points array. - Requires AIPROX_SPEND_TOKEN for authentication. - Documents are processed securely and not stored.
Metadata
Slug doc-miner
Version 1.1.0
License MIT-0
All-time Installs 3
Active Installs 3
Total Versions 3
Frequently Asked Questions

What is Doc Miner?

Extract summaries, answers, or structured data from any URL, PDF, or raw text. Auto-detects mode from task. It is an AI Agent Skill for Claude Code / OpenClaw, with 351 downloads so far.

How do I install Doc Miner?

Run "/install doc-miner" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Doc Miner free?

Yes, Doc Miner is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Doc Miner support?

Doc Miner is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Doc Miner?

It is built and maintained by unixlamadev-spec (@unixlamadev-spec); the current version is v1.1.0.

💬 Comments