Data Spider
/install data-spider
Data Spider
Scrape and extract structured data from any webpage. Supports schema-guided extraction to match a specific data shape, or auto-detection of structure. Returns data as JSON object, table (columns + rows), or flat list depending on your chosen format.
When to Use
- Extracting product information or pricing from pages
- Gathering statistics and figures from articles
- Building datasets from web sources
- Schema-guided extraction to match your data model
- Research and competitive analysis
Usage Flow
- Provide a webpage
url - Optionally provide a
schemaobject — data will be extracted to match that exact shape - Optionally set
format:json(default),table, orlist - AIProx routes to the data-spider agent
- Returns structured data in the requested format, plus summary and source URL
Security Manifest
| Permission | Scope | Reason |
|---|---|---|
| Network | aiprox.dev | API calls to orchestration endpoint |
| Env Read | AIPROX_SPEND_TOKEN | Authentication for paid API |
Make Request — JSON with Schema
curl -X POST https://aiprox.dev/api/orchestrate \
-H "Content-Type: application/json" \
-H "X-Spend-Token: $AIPROX_SPEND_TOKEN" \
-d '{
"url": "https://example.com/pricing",
"schema": {"free_tier": null, "pro_price": null, "enterprise": null},
"format": "json"
}'
Response — JSON
{
"data": {"free_tier": "$0/month, 1000 API calls", "pro_price": "$29/month", "enterprise": "custom pricing"},
"summary": "SaaS pricing page with three tiers.",
"source": "https://example.com/pricing",
"format": "json"
}
Make Request — Table
curl -X POST https://aiprox.dev/api/orchestrate \
-H "Content-Type: application/json" \
-H "X-Spend-Token: $AIPROX_SPEND_TOKEN" \
-d '{
"task": "extract pricing tiers as a table",
"url": "https://example.com/pricing",
"format": "table"
}'
Response — Table
{
"columns": ["Plan", "Price", "API Calls"],
"rows": [
["Free", "$0/month", "1,000"],
["Pro", "$29/month", "50,000"],
["Enterprise", "Custom", "Unlimited"]
],
"summary": "Three-tier SaaS pricing.",
"source": "https://example.com/pricing",
"format": "table"
}
Response — List
{
"items": ["$0/month — Free tier, 1000 API calls", "$29/month — Pro, 50,000 calls", "Enterprise — custom pricing"],
"summary": "SaaS pricing tiers extracted as flat list.",
"source": "https://example.com/pricing",
"format": "list"
}
Trust Statement
Data Spider fetches and analyzes webpage contents via URL. Content is processed transiently and not stored. Analysis is performed by Claude via LightningProx. Respects robots.txt and rate limits. Your spend token is used for payment only.
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install data-spider - After installation, invoke the skill by name or use
/data-spider - Provide required inputs per the skill's parameter spec and get structured output
What is Data Spider?
Scrape any webpage and extract structured data as JSON, table, or list. Supports schema-guided extraction. It is an AI Agent Skill for Claude Code / OpenClaw, with 597 downloads so far.
How do I install Data Spider?
Run "/install data-spider" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Data Spider free?
Yes, Data Spider is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Data Spider support?
Data Spider is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Data Spider?
It is built and maintained by unixlamadev-spec (@unixlamadev-spec); the current version is v1.1.0.