← 返回 Skills 市场
332
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install yinan-web-scraper
功能描述
Extract structured data from websites using browser automation. Use when scraping product listings, articles, contact info, prices, or any web content. Suppo...
安全使用建议
This package looks like a normal web-scraper but has important gaps you should resolve before installing or running it: (1) The Python scripts call a local binary 'agent-browser' but the skill metadata does not declare this dependency — ask the author which binary/executable is required and ensure you trust it. (2) SKILL.md references additional scripts (infinite_scroll, dynamic) that are not included; confirm whether those features exist or are intentionally omitted. (3) Because the scripts invoke an external browser automation binary via subprocess, that binary will carry network and execution privileges; verify its provenance and inspect it for unwanted behavior. (4) If you plan to follow SKILL.md guidance to use proxies or CAPTCHA-solving services, expect to supply credentials/configuration not declared here — only provide such secrets to trusted code and services. If these points are not clarified by the skill author, treat the skill as incomplete/untrusted and avoid running it on sensitive systems or with privileged credentials.
功能分析
Type: OpenClaw Skill
Name: yinan-web-scraper
Version: 1.0.0
The skill bundle provides a functional web scraping toolset using browser automation. The Python scripts (scrape_page.py, scrape_paginated.py) safely invoke the 'agent-browser' utility via subprocess.run using list-based arguments to prevent shell injection, and the documentation (SKILL.md) contains standard instructions for data extraction without any evidence of prompt injection or malicious intent.
能力评估
Purpose & Capability
The skill claims to use browser automation (agent-browser) but the registry metadata lists no required binaries; the Python scripts call a local binary named 'agent-browser' via subprocess.run, which is not declared. SKILL.md also documents additional scripts (scrape_infinite_scroll.py, scrape_dynamic.py) that are referenced but not included in the file manifest. These mismatches mean the bundle is incomplete or undeclared dependencies exist.
Instruction Scope
The SKILL.md and included scripts remain focused on scraping tasks, but SKILL.md instructs using additional scripts and features (infinite scroll, dynamic interaction) that are not provided. The code executes a local binary ('agent-browser') to perform page actions; that binary will have significant control but is outside the skill bundle. The instructions also advise techniques (proxies, CAPTCHA services) that would require external services or credentials not declared here.
Install Mechanism
There is no install spec (instruction-only), which minimizes automatic installation risk. However, the presence of runnable scripts means the agent (or user) will execute local Python files that call an external binary. Because no install step fetches code, nothing is auto-downloaded by the skill, but the skill depends on external tooling that is not described.
Credentials
The skill declares no required environment variables or credentials, which is consistent with the provided scripts that only save data locally. However, SKILL.md recommends using proxies/CAPTCHA services and rotating IPs — policies that normally require credentials or configuration but none are declared. The missing declaration of the 'agent-browser' binary is the primary proportionality issue.
Persistence & Privilege
The skill does not request 'always: true' and does not declare persistence or modifications to other skills. It can be invoked by the agent autonomously (default), which is expected behavior for skills; no extra privileges are requested.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install yinan-web-scraper - 安装完成后,直接呼叫该 Skill 的名称或使用
/yinan-web-scraper触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of web-scraper.
- Extracts structured data from websites using browser automation.
- Supports scraping of single pages, paginated lists, infinite scroll, and dynamic (JavaScript-heavy) content.
- Flexible field selection with CSS selectors; handles complex sites.
- Outputs data to CSV, JSON, or Excel (XLSX) formats.
- Includes ready-to-use scripts and documentation for common use cases.
- Provides guidance on anti-scraping measures, legal considerations, and troubleshooting.
元数据
常见问题
Web Scraper 是什么?
Extract structured data from websites using browser automation. Use when scraping product listings, articles, contact info, prices, or any web content. Suppo... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 332 次。
如何安装 Web Scraper?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install yinan-web-scraper」即可一键安装,无需额外配置。
Web Scraper 是免费的吗?
是的,Web Scraper 完全免费(开源免费),可自由下载、安装和使用。
Web Scraper 支持哪些平台?
Web Scraper 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Web Scraper?
由 Yinanping(@yinanping-cpu)开发并维护,当前版本 v1.0.0。
推荐 Skills