← 返回 Skills 市场
careytian-ai

web-data-extractor(网页数据采集器,支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出(CSV/JSON)。 适用于市场调研、竞品分析、内容聚合。)

作者 careytian · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
179
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install web-data-extractor
功能描述
网页数据采集器,支持 CSS 选择器/XPath 提取、批量抓取、自动分页、数据导出(CSV/JSON/Markdown)。
安全使用建议
This skill appears to do what it says (web scraping and export) but has a few small mismatches you should verify before installing. Ask the author for source code or a homepage so you can confirm what the helper tools (web_fetch, read, write) actually do. Specifically: 1) Confirm why config.json lists an 'exec' capability and whether the skill will run shell commands. 2) Confirm the scope of the 'read' tool (which directories/files it can access). 3) If you proceed, run it in a sandboxed environment and do not supply any credentials. 4) Ensure its crawling respects robots.txt and target terms of service. If the author cannot provide a clear explanation or source, treat the skill as higher risk and avoid granting broad read/exec access.
功能分析
Type: OpenClaw Skill Name: web-data-extractor Version: 1.0.0 The skill bundle requests the 'exec' capability in config.json, which allows for arbitrary command execution, yet this high-risk permission is not listed in the SKILL.md requirements nor is it justified by the stated purpose of web scraping. While the provided documentation (SKILL.md) describes benign data extraction tasks, the discrepancy in requested permissions represents a potential security risk or over-privileging without a clear functional necessity.
能力评估
Purpose & Capability
The name/description (web scraping and export) align with requiring a fetch tool and write capability to save CSV/JSON. However config.json lists an additional 'exec' capability that is not declared in the SKILL.md's required bins and is not explained — running arbitrary commands via exec is more powerful than the described feature set and creates a mismatch.
Instruction Scope
SKILL.md uses high-level helper calls (web_fetch, extractData, exportToCSV) and expects 'read'/'write' tooling. The doc is vague about what files the 'read' tool can access or whether the skill will read arbitrary user files or agent state. The examples do not define these helpers (this is an instruction-only skill), giving the agent broad discretion to call local read/write/exec primitives in ways that aren't explicitly limited by the instructions.
Install Mechanism
No install spec and no code files — instruction-only — so nothing is being downloaded or written to disk by the skill itself. This is the lowest install risk.
Credentials
The skill requests no environment variables or external credentials (good). But it does require 'read' and 'write' binaries which, depending on the platform, can allow access to any file the agent can read/write. Exporting data justifies write access, but read access is not clearly justified beyond potentially reading input URL lists; that could be overbroad.
Persistence & Privilege
always is false and there's no indication the skill requests persistent global privileges or modifies other skills. Autonomous invocation is allowed (platform default) but not combined with other strong red flags here.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install web-data-extractor
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /web-data-extractor 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of web-data-extractor - 支持通过 CSS 选择器和 XPath 从网页提取结构化数据 - 实现自动分页、URL 批量处理与并发抓取控制 - 提供多种数据导出格式,包括 CSV、JSON 和 Markdown 表格 - 适用于市场调研、内容聚合、数据分析、舆情监控与 SEO 分析等多场景 - 推荐遵守 robots.txt 及合理控制抓取频率
元数据
Slug web-data-extractor
版本 1.0.0
许可证 MIT-0
累计安装 1
当前安装数 1
历史版本数 1
常见问题

web-data-extractor(网页数据采集器,支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出(CSV/JSON)。 适用于市场调研、竞品分析、内容聚合。) 是什么?

网页数据采集器,支持 CSS 选择器/XPath 提取、批量抓取、自动分页、数据导出(CSV/JSON/Markdown)。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 179 次。

如何安装 web-data-extractor(网页数据采集器,支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出(CSV/JSON)。 适用于市场调研、竞品分析、内容聚合。)?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install web-data-extractor」即可一键安装,无需额外配置。

web-data-extractor(网页数据采集器,支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出(CSV/JSON)。 适用于市场调研、竞品分析、内容聚合。) 是免费的吗?

是的,web-data-extractor(网页数据采集器,支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出(CSV/JSON)。 适用于市场调研、竞品分析、内容聚合。) 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

web-data-extractor(网页数据采集器,支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出(CSV/JSON)。 适用于市场调研、竞品分析、内容聚合。) 支持哪些平台?

web-data-extractor(网页数据采集器,支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出(CSV/JSON)。 适用于市场调研、竞品分析、内容聚合。) 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 web-data-extractor(网页数据采集器,支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出(CSV/JSON)。 适用于市场调研、竞品分析、内容聚合。)?

由 careytian(@careytian-ai)开发并维护,当前版本 v1.0.0。

💬 留言讨论