← 返回 Skills 市场
Web Scraper
作者
rupertnt034
· GitHub ↗
· v1.0.0
· MIT-0
838
总下载
1
收藏
2
当前安装
1
版本数
在 OpenClaw 中安装
/install rupert-web-scraper
功能描述
Extract and export structured web data like text, tables, and images in JSON, CSV, Markdown, or SQL formats while respecting ethical and legal scraping pract...
安全使用建议
This skill is an instruction-only web-scraper and appears coherent and low-risk from the manifest (no installs, no credentials). Before using it, consider: 1) legal/ethical risk — ensure you have rights to scrape a target and respect robots.txt and TOS; 2) scope control — restrict crawling to specific allowlists and set conservative depth/rate limits to avoid accidental broad crawls or DOS; 3) sensitive targets — do not allow the agent to crawl internal/private URLs or sites requiring authentication unless you trust the outcome; 4) data exfiltration risk — although the skill does not request credentials, scraped data could still be transmitted elsewhere by the agent/platform, so review platform network/output policies; and 5) testing — try on a small, public test site first. If you want stronger guarantees, require explicit user confirmation for any multi-page crawl and add an allowlist/denylist enforcement step in the SKILL.md or platform policy.
功能分析
Type: OpenClaw Skill
Name: rupert-web-scraper
Version: 1.0.0
The skill bundle contains only metadata and documentation for a web scraping tool. The instructions in SKILL.md describe standard data extraction capabilities and emphasize ethical practices such as respecting robots.txt and implementing rate limiting. No executable code or malicious instructions were found.
能力评估
Purpose & Capability
Name and description match the SKILL.md. Requested capabilities (data extraction, formats, rate-limiting, proxy support, robots.txt) are appropriate for a web scraper and there are no unrelated environment variables, binaries, or install steps.
Instruction Scope
The SKILL.md is high-level and describes typical scraping commands and best practices (respect robots.txt, rate limits, UA rotation). It does not instruct reading local files or environment variables, nor does it point to unexpected external endpoints. However, it is intentionally permissive (e.g., 'crawl [website] depth [n]') and lacks enforcement of safeguards—so an agent using this skill could be directed to crawl wide ranges of sites unless additional runtime limits or confirmations are imposed.
Install Mechanism
No install spec and no code files are present, so nothing is written to disk and there is no third-party package download risk.
Credentials
No environment variables, credentials, or config paths are requested. The lack of declared secrets is proportional to an instruction-only scraper.
Persistence & Privilege
always is false and the skill does not request persistent or elevated privileges. It does not attempt to modify other skills or system-wide settings.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install rupert-web-scraper - 安装完成后,直接呼叫该 Skill 的名称或使用
/rupert-web-scraper触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of the Web Scraper Skill.
- Extracts text, tables, and media from websites.
- Supports output to JSON, CSV, Markdown, and SQL.
- Features rate limiting, caching, retries, proxy support, and robust error handling.
- Adheres to ethical scraping standards, including robots.txt and legal compliance.
- Includes simple commands for scraping and exporting data in various formats.
元数据
常见问题
Web Scraper 是什么?
Extract and export structured web data like text, tables, and images in JSON, CSV, Markdown, or SQL formats while respecting ethical and legal scraping pract... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 838 次。
如何安装 Web Scraper?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install rupert-web-scraper」即可一键安装,无需额外配置。
Web Scraper 是免费的吗?
是的,Web Scraper 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Web Scraper 支持哪些平台?
Web Scraper 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Web Scraper?
由 rupertnt034(@rupertnt034)开发并维护,当前版本 v1.0.0。
推荐 Skills