Data Spider
/install data-spider
Data Spider
Scrape and extract structured data from any webpage. Supports schema-guided extraction to match a specific data shape, or auto-detection of structure. Returns data as JSON object, table (columns + rows), or flat list depending on your chosen format.
When to Use
- Extracting product information or pricing from pages
- Gathering statistics and figures from articles
- Building datasets from web sources
- Schema-guided extraction to match your data model
- Research and competitive analysis
Usage Flow
- Provide a webpage
url - Optionally provide a
schemaobject — data will be extracted to match that exact shape - Optionally set
format:json(default),table, orlist - AIProx routes to the data-spider agent
- Returns structured data in the requested format, plus summary and source URL
Security Manifest
| Permission | Scope | Reason |
|---|---|---|
| Network | aiprox.dev | API calls to orchestration endpoint |
| Env Read | AIPROX_SPEND_TOKEN | Authentication for paid API |
Make Request — JSON with Schema
curl -X POST https://aiprox.dev/api/orchestrate \
-H "Content-Type: application/json" \
-H "X-Spend-Token: $AIPROX_SPEND_TOKEN" \
-d '{
"url": "https://example.com/pricing",
"schema": {"free_tier": null, "pro_price": null, "enterprise": null},
"format": "json"
}'
Response — JSON
{
"data": {"free_tier": "$0/month, 1000 API calls", "pro_price": "$29/month", "enterprise": "custom pricing"},
"summary": "SaaS pricing page with three tiers.",
"source": "https://example.com/pricing",
"format": "json"
}
Make Request — Table
curl -X POST https://aiprox.dev/api/orchestrate \
-H "Content-Type: application/json" \
-H "X-Spend-Token: $AIPROX_SPEND_TOKEN" \
-d '{
"task": "extract pricing tiers as a table",
"url": "https://example.com/pricing",
"format": "table"
}'
Response — Table
{
"columns": ["Plan", "Price", "API Calls"],
"rows": [
["Free", "$0/month", "1,000"],
["Pro", "$29/month", "50,000"],
["Enterprise", "Custom", "Unlimited"]
],
"summary": "Three-tier SaaS pricing.",
"source": "https://example.com/pricing",
"format": "table"
}
Response — List
{
"items": ["$0/month — Free tier, 1000 API calls", "$29/month — Pro, 50,000 calls", "Enterprise — custom pricing"],
"summary": "SaaS pricing tiers extracted as flat list.",
"source": "https://example.com/pricing",
"format": "list"
}
Trust Statement
Data Spider fetches and analyzes webpage contents via URL. Content is processed transiently and not stored. Analysis is performed by Claude via LightningProx. Respects robots.txt and rate limits. Your spend token is used for payment only.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install data-spider - 安装完成后,直接呼叫该 Skill 的名称或使用
/data-spider触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Data Spider 是什么?
Scrape any webpage and extract structured data as JSON, table, or list. Supports schema-guided extraction. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 597 次。
如何安装 Data Spider?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install data-spider」即可一键安装,无需额外配置。
Data Spider 是免费的吗?
是的,Data Spider 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Data Spider 支持哪些平台?
Data Spider 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Data Spider?
由 unixlamadev-spec(@unixlamadev-spec)开发并维护,当前版本 v1.1.0。