← 返回 Skills 市场

Web Scraper as a Service

Name: Web Scraper as a Service
Author: seanwyngaard

作者 Sean Wyngaard · GitHub ↗ · v1.0.0

cross-platform ⚠ suspicious

3042

总下载

当前安装

版本数

在 OpenClaw 中安装

/install web-scraper-as-a-service

功能描述

Build client-ready web scrapers with clean data output. Use when creating scrapers for clients, extracting data from websites, or delivering scraping projects.

安全使用建议

This skill appears coherent and focused on building scrapers, but consider these practical precautions before using it: (1) The skill will fetch arbitrary target URLs — ensure you (and your client) are authorized to scrape the target and that scraping does not violate site terms or laws. (2) Some sites use CAPTCHAs or anti-bot protections; the guidance mentions detecting these but does not instruct bypassing them — avoid attempting evasive or unauthorized bypass techniques. (3) If you execute generated scrapers, run them in an isolated environment (container/VM) and watch network usage and stored data to avoid accidental leakage of client or system secrets. (4) The skill has no stated source or homepage — if you need long-term support or guarantees, prefer vetted libraries or an authored/published implementation. (5) If scraping requires authenticated access to a target, expect you will need to supply credentials separately; the skill does not request or manage credentials itself.

功能分析

Type: OpenClaw Skill Name: web-scraper-as-a-service Version: 1.0.0 The skill is classified as suspicious due to the broad permissions granted to the AI agent, specifically the `Bash` and `WebFetch` tools listed in `SKILL.md`. While these tools are plausibly necessary for the stated purpose of building and running web scrapers, the ability to execute arbitrary shell commands (`Bash`) and make network requests to external endpoints (`WebFetch`) without explicit constraints in the instructions presents a significant vulnerability. There is no clear evidence of intentional malicious behavior or prompt injection attempts within the provided files, but the powerful capabilities could be exploited if the agent is given a malicious target URL or an adversarial prompt.

能力评估

✓ Purpose & Capability

The name/description (web scraper service) match the SKILL.md: it enumerates scraping templates, generation of a Python scraper, data cleaning, and delivery packaging. There are no unrelated environment variables, binaries, or config paths requested that would be disproportionate to building scrapers.

✓ Instruction Scope

The runtime instructions are focused on collecting page structure, generating scraper code, running scrapes, cleaning data, and packaging results. They explicitly recommend respecting robots.txt and ToS and avoiding personal data unless authorized. The instructions do require fetching target URLs (expected for scraping) but do not instruct reading unrelated local files or exfiltrating secrets.

✓ Install Mechanism

This is an instruction-only skill with no install spec and no code files included. That minimizes code-on-disk risk; the instructions recommend standard Python packages (requests, BeautifulSoup, playwright) which is proportionate for the stated purpose.

✓ Credentials

No environment variables, credentials, or config paths are requested; the requirement set is minimal and appropriate for a scraper template/instruction skill. The skill does not ask for unrelated secrets or broad system access.

✓ Persistence & Privilege

The skill does not request always: true and is user-invocable only. It does not instruct modifying other skills or system-wide agent settings. Autonomous invocation is enabled by default but not combined with other elevated privileges.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install web-scraper-as-a-service
安装完成后，直接呼叫该 Skill 的名称或使用 /web-scraper-as-a-service 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

Initial release of web-scraper-as-a-service. - Enables rapid generation of custom web scrapers from client briefs or URLs. - Scraper pipeline includes site analysis, tool selection, Python script generation, data cleaning, and packaging for client delivery. - Builds scrapers with best practices: rate limiting, user-agent rotation, retry/error handling, progress tracking, resume, and output in CSV/JSON. - Delivers clean, normalized data and data quality report in a ready-to-hand-off client package. - Includes template field sets for common target types (e-commerce, real estate, jobs, directories, news). - Enforces ethical scraping (robots.txt, rate limits, ToS checks).

元数据

Slug web-scraper-as-a-service

版本 1.0.0

许可证 —

累计安装 22

当前安装数 20

历史版本数 1

常见问题

Web Scraper as a Service 是什么？

Build client-ready web scrapers with clean data output. Use when creating scrapers for clients, extracting data from websites, or delivering scraping projects. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 3042 次。

如何安装 Web Scraper as a Service？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install web-scraper-as-a-service」即可一键安装，无需额外配置。

Web Scraper as a Service 是免费的吗？

是的，Web Scraper as a Service 完全免费（开源免费），可自由下载、安装和使用。

Web Scraper as a Service 支持哪些平台？

Web Scraper as a Service 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Web Scraper as a Service？

由 Sean Wyngaard（@seanwyngaard）开发并维护，当前版本 v1.0.0。