web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。适用于市场调研、竞品分析、内容聚合。）

Name: web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。 适用于市场调研、竞品分析、内容聚合。）
Author: careytian-ai

by careytian · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

179

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install web-data-extractor

Description

网页数据采集器，支持 CSS 选择器/XPath 提取、批量抓取、自动分页、数据导出（CSV/JSON/Markdown）。

Usage Guidance

This skill appears to do what it says (web scraping and export) but has a few small mismatches you should verify before installing. Ask the author for source code or a homepage so you can confirm what the helper tools (web_fetch, read, write) actually do. Specifically: 1) Confirm why config.json lists an 'exec' capability and whether the skill will run shell commands. 2) Confirm the scope of the 'read' tool (which directories/files it can access). 3) If you proceed, run it in a sandboxed environment and do not supply any credentials. 4) Ensure its crawling respects robots.txt and target terms of service. If the author cannot provide a clear explanation or source, treat the skill as higher risk and avoid granting broad read/exec access.

Capability Analysis

Type: OpenClaw Skill Name: web-data-extractor Version: 1.0.0 The skill bundle requests the 'exec' capability in config.json, which allows for arbitrary command execution, yet this high-risk permission is not listed in the SKILL.md requirements nor is it justified by the stated purpose of web scraping. While the provided documentation (SKILL.md) describes benign data extraction tasks, the discrepancy in requested permissions represents a potential security risk or over-privileging without a clear functional necessity.

Capability Assessment

ℹ Purpose & Capability

The name/description (web scraping and export) align with requiring a fetch tool and write capability to save CSV/JSON. However config.json lists an additional 'exec' capability that is not declared in the SKILL.md's required bins and is not explained — running arbitrary commands via exec is more powerful than the described feature set and creates a mismatch.

⚠ Instruction Scope

SKILL.md uses high-level helper calls (web_fetch, extractData, exportToCSV) and expects 'read'/'write' tooling. The doc is vague about what files the 'read' tool can access or whether the skill will read arbitrary user files or agent state. The examples do not define these helpers (this is an instruction-only skill), giving the agent broad discretion to call local read/write/exec primitives in ways that aren't explicitly limited by the instructions.

✓ Install Mechanism

No install spec and no code files — instruction-only — so nothing is being downloaded or written to disk by the skill itself. This is the lowest install risk.

ℹ Credentials

The skill requests no environment variables or external credentials (good). But it does require 'read' and 'write' binaries which, depending on the platform, can allow access to any file the agent can read/write. Exporting data justifies write access, but read access is not clearly justified beyond potentially reading input URL lists; that could be overbroad.

✓ Persistence & Privilege

always is false and there's no indication the skill requests persistent global privileges or modifies other skills. Autonomous invocation is allowed (platform default) but not combined with other strong red flags here.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install web-data-extractor
After installation, invoke the skill by name or use /web-data-extractor
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release of web-data-extractor - 支持通过 CSS 选择器和 XPath 从网页提取结构化数据 - 实现自动分页、URL 批量处理与并发抓取控制 - 提供多种数据导出格式，包括 CSV、JSON 和 Markdown 表格 - 适用于市场调研、内容聚合、数据分析、舆情监控与 SEO 分析等多场景 - 推荐遵守 robots.txt 及合理控制抓取频率

Metadata

Slug web-data-extractor

Version 1.0.0

License MIT-0

All-time Installs 1

Active Installs 1

Total Versions 1

Frequently Asked Questions

What is web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。适用于市场调研、竞品分析、内容聚合。）?

网页数据采集器，支持 CSS 选择器/XPath 提取、批量抓取、自动分页、数据导出（CSV/JSON/Markdown）。 It is an AI Agent Skill for Claude Code / OpenClaw, with 179 downloads so far.

How do I install web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。适用于市场调研、竞品分析、内容聚合。）?

Run "/install web-data-extractor" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。适用于市场调研、竞品分析、内容聚合。） free?

Yes, web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。适用于市场调研、竞品分析、内容聚合。） is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。适用于市场调研、竞品分析、内容聚合。） support?

web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。适用于市场调研、竞品分析、内容聚合。） is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。适用于市场调研、竞品分析、内容聚合。）?

It is built and maintained by careytian (@careytian-ai); the current version is v1.0.0.

More Skills

web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。 适用于市场调研、竞品分析、内容聚合。）

What is web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。 适用于市场调研、竞品分析、内容聚合。）?

How do I install web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。 适用于市场调研、竞品分析、内容聚合。）?

Is web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。 适用于市场调研、竞品分析、内容聚合。） free?

Which platforms does web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。 适用于市场调研、竞品分析、内容聚合。） support?

Who created web-data-extractor（网页数据采集器，支持 CSS/XPath 选择器、批量抓取、自动分页、数据导出（CSV/JSON）。 适用于市场调研、竞品分析、内容聚合。）?

💬 Comments