← 返回 Skills 市场
agistack

Scraper

作者 AGIstack · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
738
总下载
0
收藏
9
当前安装
1
版本数
在 OpenClaw 中安装
/install scraper
功能描述
Structured extraction and cleanup for public, user-authorized web pages. Use when the user wants to collect, clean, summarize, or transform content from acce...
安全使用建议
This skill appears to do what it says: fetch public pages, extract text, and save results locally. Before installing or enabling it for autonomous use, consider: (1) the scripts will fetch any URL you or the agent give them — add URL validation or an allowlist if you need to block internal/IP ranges (SSRF risk); (2) there is no enforcement of 'public/user-authorized' rules — rely on agent policies or operator oversight to prevent misuse (paywall/login bypass, private endpoints); (3) outputs are stored at ~/.openclaw/workspace/memory/scraper — check and clean that directory if sensitive data might be saved. If you only plan manual, user-initiated runs and trust the callers, the skill is coherent and appropriate.
功能分析
Type: OpenClaw Skill Name: scraper Version: 1.0.0 The scraper skill is a standard utility for fetching and cleaning public web content, with all operations restricted to local storage (~/.openclaw/workspace/). The Python scripts (fetch_page.py, extract_text.py, save_output.py) use the standard urllib library and basic regex for HTML stripping, and the SKILL.md instructions include explicit safety boundaries against bypassing access controls or collecting credentials.
能力评估
Purpose & Capability
Name/description match the included scripts: fetching pages, extracting text, saving outputs locally. No unrelated credentials, binaries, or installs are requested.
Instruction Scope
SKILL.md and scripts restrict work to public/user-authorized pages and local-only storage. However, there is no runtime enforcement of those rules: the scripts will fetch any URL provided (including internal IPs/localhost), and there is no robots/paywall/captcha checking, rate limiting, or URL validation. That is expected for a small helper but is a security consideration rather than an incoherence.
Install Mechanism
No install spec and no remote downloads; the skill is instruction-only with bundled Python scripts, which minimizes install risk.
Credentials
The skill requires no environment variables or credentials and only writes under ~/.openclaw/workspace/memory/scraper, consistent with the declared purpose.
Persistence & Privilege
The skill is not always-enabled and can be invoked by the user. It does create persistent local state (jobs.json and output files) under the user's home — this is coherent but users should be aware of stored files and cleanup policy.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install scraper
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /scraper 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
[email protected]: Local-first extraction for public, user-authorized pages. Added page fetch, text extraction, local output saving, and job tracking.
元数据
Slug scraper
版本 1.0.0
许可证 MIT-0
累计安装 9
当前安装数 9
历史版本数 1
常见问题

Scraper 是什么?

Structured extraction and cleanup for public, user-authorized web pages. Use when the user wants to collect, clean, summarize, or transform content from acce... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 738 次。

如何安装 Scraper?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install scraper」即可一键安装,无需额外配置。

Scraper 是免费的吗?

是的,Scraper 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Scraper 支持哪些平台?

Scraper 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Scraper?

由 AGIstack(@agistack)开发并维护,当前版本 v1.0.0。

💬 留言讨论