← 返回 Skills 市场

Web Scraper

Name: Web Scraper
Author: yinanping-cpu

作者 Yinanping · GitHub ↗ · v1.0.0

cross-platform ⚠ suspicious

332

总下载

当前安装

版本数

在 OpenClaw 中安装

/install yinan-web-scraper

功能描述

Extract structured data from websites using browser automation. Use when scraping product listings, articles, contact info, prices, or any web content. Suppo...

安全使用建议

This package looks like a normal web-scraper but has important gaps you should resolve before installing or running it: (1) The Python scripts call a local binary 'agent-browser' but the skill metadata does not declare this dependency — ask the author which binary/executable is required and ensure you trust it. (2) SKILL.md references additional scripts (infinite_scroll, dynamic) that are not included; confirm whether those features exist or are intentionally omitted. (3) Because the scripts invoke an external browser automation binary via subprocess, that binary will carry network and execution privileges; verify its provenance and inspect it for unwanted behavior. (4) If you plan to follow SKILL.md guidance to use proxies or CAPTCHA-solving services, expect to supply credentials/configuration not declared here — only provide such secrets to trusted code and services. If these points are not clarified by the skill author, treat the skill as incomplete/untrusted and avoid running it on sensitive systems or with privileged credentials.

功能分析

Type: OpenClaw Skill Name: yinan-web-scraper Version: 1.0.0 The skill bundle provides a functional web scraping toolset using browser automation. The Python scripts (scrape_page.py, scrape_paginated.py) safely invoke the 'agent-browser' utility via subprocess.run using list-based arguments to prevent shell injection, and the documentation (SKILL.md) contains standard instructions for data extraction without any evidence of prompt injection or malicious intent.

能力评估

⚠ Purpose & Capability

The skill claims to use browser automation (agent-browser) but the registry metadata lists no required binaries; the Python scripts call a local binary named 'agent-browser' via subprocess.run, which is not declared. SKILL.md also documents additional scripts (scrape_infinite_scroll.py, scrape_dynamic.py) that are referenced but not included in the file manifest. These mismatches mean the bundle is incomplete or undeclared dependencies exist.

⚠ Instruction Scope

The SKILL.md and included scripts remain focused on scraping tasks, but SKILL.md instructs using additional scripts and features (infinite scroll, dynamic interaction) that are not provided. The code executes a local binary ('agent-browser') to perform page actions; that binary will have significant control but is outside the skill bundle. The instructions also advise techniques (proxies, CAPTCHA services) that would require external services or credentials not declared here.

ℹ Install Mechanism

There is no install spec (instruction-only), which minimizes automatic installation risk. However, the presence of runnable scripts means the agent (or user) will execute local Python files that call an external binary. Because no install step fetches code, nothing is auto-downloaded by the skill, but the skill depends on external tooling that is not described.

ℹ Credentials

The skill declares no required environment variables or credentials, which is consistent with the provided scripts that only save data locally. However, SKILL.md recommends using proxies/CAPTCHA services and rotating IPs — policies that normally require credentials or configuration but none are declared. The missing declaration of the 'agent-browser' binary is the primary proportionality issue.

✓ Persistence & Privilege

The skill does not request 'always: true' and does not declare persistence or modifications to other skills. It can be invoked by the agent autonomously (default), which is expected behavior for skills; no extra privileges are requested.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install yinan-web-scraper
安装完成后，直接呼叫该 Skill 的名称或使用 /yinan-web-scraper 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

Initial release of web-scraper. - Extracts structured data from websites using browser automation. - Supports scraping of single pages, paginated lists, infinite scroll, and dynamic (JavaScript-heavy) content. - Flexible field selection with CSS selectors; handles complex sites. - Outputs data to CSV, JSON, or Excel (XLSX) formats. - Includes ready-to-use scripts and documentation for common use cases. - Provides guidance on anti-scraping measures, legal considerations, and troubleshooting.

元数据

Slug yinan-web-scraper

版本 1.0.0

许可证 —

累计安装 1

当前安装数 1

历史版本数 1

常见问题

Web Scraper 是什么？

Extract structured data from websites using browser automation. Use when scraping product listings, articles, contact info, prices, or any web content. Suppo... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 332 次。

如何安装 Web Scraper？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install yinan-web-scraper」即可一键安装，无需额外配置。

Web Scraper 是免费的吗？

是的，Web Scraper 完全免费（开源免费），可自由下载、安装和使用。

Web Scraper 支持哪些平台？

Web Scraper 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Web Scraper？

由 Yinanping（@yinanping-cpu）开发并维护，当前版本 v1.0.0。