← Back to Skills Marketplace
dxg852621787

智能数据采集器

by dxg852621787 · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ⚠ suspicious
55
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install data-harvester
Description
智能数据采集器 - 自动化数据采集、处理和导出工具。支持Web抓取、API调用、数据库查询、文件读取,导出JSON/CSV/Excel/PDF。
README (SKILL.md)

智能数据采集器 - Smart Data Harvester

OpenClaw Skill Version License

专为OpenClaw设计的自动化数据采集工具。从网页、API、数据库、文件中采集数据,智能清洗处理后导出为你需要的格式。

🚀 核心功能

  • 多数据源适配器:Web抓取、API调用、数据库查询、文件读取
  • 智能数据处理管道:数据清洗、转换、聚合、分析
  • 多种导出格式:JSON、CSV、Excel、SQLite、PDF报告
  • 定时任务调度:支持定时自动执行数据采集任务
  • OpenClaw无缝集成:原生OpenClaw技能,一键安装使用
  • 全中文支持:中文界面和文档,专为中国用户设计

📦 安装

clawhub install data-harvester

或手动安装:

git clone https://gitee.com/du-xuegong/openclaw-wealth-guide.git
cd openclaw-wealth-guide
uv pip install -r requirements.txt

🛠️ 快速使用

在OpenClaw中

/技能 数据采集器
采集网页 https://example.com 保存为 data.json
定时采集 https://api.example.com/data 每天 09:00
导出数据为 Excel 报表

Python API

from data_harvester import DataHarvester

harvester = DataHarvester()
result = harvester.harvest({
    "sources": [{"type": "web", "url": "https://example.com"}],
    "export": {"format": "json", "path": "output.json"}
})
print(f"采集完成:{result['stats']['total_records']}条记录")

⚙️ 配置

支持数据源类型:

  • Web适配器:CSS选择器、XPath抓取
  • API适配器:REST API,支持认证
  • 数据库适配器:MySQL/PostgreSQL/SQLite
  • 文件适配器:CSV、Excel、JSON

❓ 常见问题

Q: 安装后怎么激活?
A: 在OpenClaw对话中使用 /技能 数据采集器 激活。

Q: 支持哪些数据源?
A: Web页面、API接口、数据库、文件四种类型。

Q: 需要定制开发或部署咨询?
A: 访问 https://openclawx.asia 获取企业级支持服务。

📄 许可证

MIT-0 — 免费使用,无需署名。

📞 联系

  • 作者:dxg
  • 邮箱[email protected]
  • GitHub:https://github.com/dxg852621787
  • 企业服务:https://openclawx.asia
Usage Guidance
Review carefully before installing. Only use this skill with least-privilege, read-only database/API credentials; avoid giving it OpenClaw admin or publishing tokens; configure exports to non-sensitive paths; and enable scheduled collection only when you understand what systems it will repeatedly access.
Capability Tags
requires-sensitive-credentials
Capability Assessment
Purpose & Capability
The core web/API/database/file collection, processing, scheduling, and export behavior matches the stated data-harvester purpose, but the bundled OpenClaw API client also includes skill registration, update, install, uninstall, arbitrary skill-operation execution, and package upload methods that are not needed for ordinary data harvesting.
Instruction Scope
User-facing docs disclose broad collection and scheduled export at a feature level, but they do not clearly scope sensitive data handling, credential use, recurring access, overwrite behavior, or the separate high-impact platform-management API methods.
Install Mechanism
Installation is a normal Python dependency install from requirements.txt, and VirusTotal plus static scan were clean. Dependencies are range-based rather than pinned, which is supply-chain hygiene risk rather than direct malicious evidence.
Credentials
Network access, file reads/writes, database access, authenticated API calls, and scheduling are proportionate for a data harvester, but database queries are configuration-provided SQL and the OpenClaw client can act on platform state beyond the advertised data workflow.
Persistence & Privilege
The scheduler can run recurring background tasks and exports can write local files, which is expected for this purpose, but the artifacts do not provide strong user-facing warnings, retention guidance, or guardrails around ongoing collection and credential-backed access.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install data-harvester
  3. After installation, invoke the skill by name or use /data-harvester
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
- Updated version in skill.json from 1.0.0 to 1.0.1. - No other changes were made to documentation or functionality.
v1.0.0
- Initial release of data-harvester. - Automates data collection from web, APIs, databases, and files. - Supports smart data cleaning, conversion, aggregation, and analysis. - Exports data in JSON, CSV, Excel, SQLite, and PDF formats. - Includes task scheduling and full Chinese language support. - Seamlessly integrates with OpenClaw for easy installation and use.
Metadata
Slug data-harvester
Version 1.0.1
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 2
Frequently Asked Questions

What is 智能数据采集器?

智能数据采集器 - 自动化数据采集、处理和导出工具。支持Web抓取、API调用、数据库查询、文件读取,导出JSON/CSV/Excel/PDF。 It is an AI Agent Skill for Claude Code / OpenClaw, with 55 downloads so far.

How do I install 智能数据采集器?

Run "/install data-harvester" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is 智能数据采集器 free?

Yes, 智能数据采集器 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does 智能数据采集器 support?

智能数据采集器 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created 智能数据采集器?

It is built and maintained by dxg852621787 (@dxg852621787); the current version is v1.0.1.

💬 Comments