← 返回 Skills 市场
Web Scraper as a Service
作者
Sean Wyngaard
· GitHub ↗
· v1.0.0
3042
总下载
3
收藏
20
当前安装
1
版本数
在 OpenClaw 中安装
/install web-scraper-as-a-service
功能描述
Build client-ready web scrapers with clean data output. Use when creating scrapers for clients, extracting data from websites, or delivering scraping projects.
安全使用建议
This skill appears coherent and focused on building scrapers, but consider these practical precautions before using it: (1) The skill will fetch arbitrary target URLs — ensure you (and your client) are authorized to scrape the target and that scraping does not violate site terms or laws. (2) Some sites use CAPTCHAs or anti-bot protections; the guidance mentions detecting these but does not instruct bypassing them — avoid attempting evasive or unauthorized bypass techniques. (3) If you execute generated scrapers, run them in an isolated environment (container/VM) and watch network usage and stored data to avoid accidental leakage of client or system secrets. (4) The skill has no stated source or homepage — if you need long-term support or guarantees, prefer vetted libraries or an authored/published implementation. (5) If scraping requires authenticated access to a target, expect you will need to supply credentials separately; the skill does not request or manage credentials itself.
功能分析
Type: OpenClaw Skill
Name: web-scraper-as-a-service
Version: 1.0.0
The skill is classified as suspicious due to the broad permissions granted to the AI agent, specifically the `Bash` and `WebFetch` tools listed in `SKILL.md`. While these tools are plausibly necessary for the stated purpose of building and running web scrapers, the ability to execute arbitrary shell commands (`Bash`) and make network requests to external endpoints (`WebFetch`) without explicit constraints in the instructions presents a significant vulnerability. There is no clear evidence of intentional malicious behavior or prompt injection attempts within the provided files, but the powerful capabilities could be exploited if the agent is given a malicious target URL or an adversarial prompt.
能力评估
Purpose & Capability
The name/description (web scraper service) match the SKILL.md: it enumerates scraping templates, generation of a Python scraper, data cleaning, and delivery packaging. There are no unrelated environment variables, binaries, or config paths requested that would be disproportionate to building scrapers.
Instruction Scope
The runtime instructions are focused on collecting page structure, generating scraper code, running scrapes, cleaning data, and packaging results. They explicitly recommend respecting robots.txt and ToS and avoiding personal data unless authorized. The instructions do require fetching target URLs (expected for scraping) but do not instruct reading unrelated local files or exfiltrating secrets.
Install Mechanism
This is an instruction-only skill with no install spec and no code files included. That minimizes code-on-disk risk; the instructions recommend standard Python packages (requests, BeautifulSoup, playwright) which is proportionate for the stated purpose.
Credentials
No environment variables, credentials, or config paths are requested; the requirement set is minimal and appropriate for a scraper template/instruction skill. The skill does not ask for unrelated secrets or broad system access.
Persistence & Privilege
The skill does not request always: true and is user-invocable only. It does not instruct modifying other skills or system-wide agent settings. Autonomous invocation is enabled by default but not combined with other elevated privileges.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install web-scraper-as-a-service - 安装完成后,直接呼叫该 Skill 的名称或使用
/web-scraper-as-a-service触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of web-scraper-as-a-service.
- Enables rapid generation of custom web scrapers from client briefs or URLs.
- Scraper pipeline includes site analysis, tool selection, Python script generation, data cleaning, and packaging for client delivery.
- Builds scrapers with best practices: rate limiting, user-agent rotation, retry/error handling, progress tracking, resume, and output in CSV/JSON.
- Delivers clean, normalized data and data quality report in a ready-to-hand-off client package.
- Includes template field sets for common target types (e-commerce, real estate, jobs, directories, news).
- Enforces ethical scraping (robots.txt, rate limits, ToS checks).
元数据
常见问题
Web Scraper as a Service 是什么?
Build client-ready web scrapers with clean data output. Use when creating scrapers for clients, extracting data from websites, or delivering scraping projects. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 3042 次。
如何安装 Web Scraper as a Service?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install web-scraper-as-a-service」即可一键安装,无需额外配置。
Web Scraper as a Service 是免费的吗?
是的,Web Scraper as a Service 完全免费(开源免费),可自由下载、安装和使用。
Web Scraper as a Service 支持哪些平台?
Web Scraper as a Service 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Web Scraper as a Service?
由 Sean Wyngaard(@seanwyngaard)开发并维护,当前版本 v1.0.0。
推荐 Skills