← 返回 Skills 市场
theshadowrose

WebClip Save & Summarize Web Pages

作者 Shadow Rose · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
522
总下载
0
收藏
3
当前安装
1
版本数
在 OpenClaw 中安装
/install web-clip
功能描述
Fetch web pages, strip to clean readable text, summarize into agent-ready markdown. Research assistant foundation. No browser required.
使用说明 (SKILL.md)

WebClip Save & Summarize Web Pages

Fetch web pages, strip to clean readable text, summarize into agent-ready markdown. Research assistant foundation. No browser required.


Fetch any web page, strip the junk, extract clean readable text, and optionally summarize it. Perfect for research tasks.

Usage

const { WebClip } = require('./src/web-clip');
const clip = new WebClip();

// Fetch and clean
const page = await clip.fetch('https://example.com/article');
console.log(page.title);
console.log(page.text);      // Clean text, no HTML
console.log(page.markdown);  // Formatted markdown

// Fetch and summarize
const summary = await clip.summarize('https://example.com/article', {
  maxLength: 200,
  model: 'llama3.1:8b'
});

Features

  • HTML stripping — removes scripts, styles, nav, ads, footers
  • Readability extraction — finds main content automatically
  • Markdown conversion — preserves headings, lists, links, code blocks
  • Batch fetching — multiple URLs in parallel
  • Caching — don't re-fetch pages you've already clipped
  • Offline archive — save pages as local markdown files

Output Formats

Format Use Case
.text Raw clean text for agent context
.markdown Formatted for reading or storage
.summary Condensed version (requires model)
.metadata Title, author, date, word count

Zero Dependencies

Uses only Node.js built-in https module. No Puppeteer, no headless browser.

⚠️ Disclaimer

This software is provided "AS IS", without warranty of any kind, express or implied.

USE AT YOUR OWN RISK.

  • The author(s) are NOT liable for any damages, losses, or consequences arising from the use or misuse of this software — including but not limited to financial loss, data loss, security breaches, business interruption, or any indirect/consequential damages.
  • This software does NOT constitute financial, legal, trading, or professional advice.
  • Users are solely responsible for evaluating whether this software is suitable for their use case, environment, and risk tolerance.
  • No guarantee is made regarding accuracy, reliability, completeness, or fitness for any particular purpose.
  • The author(s) are not responsible for how third parties use, modify, or distribute this software after purchase.

By downloading, installing, or using this software, you acknowledge that you have read this disclaimer and agree to use the software entirely at your own risk.

DATA DISCLAIMER: This software processes and stores data locally on your system. The author(s) are not responsible for data loss, corruption, or unauthorized access resulting from software bugs, system failures, or user error. Always maintain independent backups of important data. This software does not transmit data externally unless explicitly configured by the user.


Support & Links

🐛 Bug Reports [email protected]
Ko-fi ko-fi.com/theshadowrose
🛒 Gumroad shadowyrose.gumroad.com
🐦 Twitter @TheShadowyRose
🐙 GitHub github.com/TheShadowRose
🧠 PromptBase promptbase.com/profile/shadowrose

Built with OpenClaw — thank you for making this possible.


🛠️ Need something custom? Custom OpenClaw agents & skills starting at $500. If you can describe it, I can build it. → Hire me on Fiverr

安全使用建议
This skill appears coherent and does what it claims: fetch pages, remove junk, produce markdown, and save locally. Before installing or enabling it: 1) Review and (if needed) run the code in a sandboxed environment since it performs network fetches and writes files locally. 2) Note the advertised 'caching' behavior isn't implemented (fetch() always downloads); if you rely on caching, modify the code to check cacheDir. 3) save(filename) accepts a caller-supplied filename — consider restricting or sanitizing filenames to avoid path traversal (the code sanitizes generated slugs but will join any provided filename to cacheDir). 4) The fetcher blocks many internal IP ranges, limits redirects, and caps response size, which reduces SSRF/internal network risk, but you should still not expose this skill to untrusted agents or inputs. If you need stronger guarantees, run it in an isolated container, set cacheDir to a safe path, and add explicit filename validation and a real cache lookup.
功能分析
Type: OpenClaw Skill Name: web-clip Version: 1.0.0 The WebClip skill is a utility designed to fetch web pages and convert them into clean markdown for AI agent consumption. The implementation in `src/web-clip.js` includes proactive security measures, such as an SSRF blocklist to prevent access to internal network resources and limits on response size and redirects. While there is a minor discrepancy where the `summarize` method mentioned in the documentation is missing from the source code, the overall package is well-structured, lacks suspicious dependencies, and shows no signs of malicious intent or data exfiltration.
能力评估
Purpose & Capability
Overall the code matches the described purpose (fetch, clean, convert, batch, save). Minor mismatch: SKILL.md/README advertise a caching feature (“Caching — don't re-fetch pages you've already clipped”), but the implementation creates a cache directory and a save() method without implementing a read/cache lookup in fetch(); so 'caching' is not actually performed before fetching.
Instruction Scope
Runtime instructions do exactly what is expected: fetch arbitrary URLs, strip HTML, produce markdown, and optionally save files locally. The code explicitly blocks internal/metadata IP address ranges and limits response size and redirects. It writes files to a local cacheDir (default './web-cache'), which is expected behavior for an offline archive feature.
Install Mechanism
No install spec and the code uses only Node built-ins (https/http/fs/path). No remote downloads or third-party packages are introduced, so installation risk is low.
Credentials
No environment variables, credentials, or external service tokens are requested. The skill's filesystem writes (cache/archive) are proportionate to its stated functionality.
Persistence & Privilege
always:false and the skill does not request persistent platform privileges or modify other skills. It can be invoked autonomously (default), which is normal — no additional privileged behavior observed.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install web-clip
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /web-clip 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial upload
元数据
Slug web-clip
版本 1.0.0
许可证 MIT-0
累计安装 3
当前安装数 3
历史版本数 1
常见问题

WebClip Save & Summarize Web Pages 是什么?

Fetch web pages, strip to clean readable text, summarize into agent-ready markdown. Research assistant foundation. No browser required. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 522 次。

如何安装 WebClip Save & Summarize Web Pages?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install web-clip」即可一键安装,无需额外配置。

WebClip Save & Summarize Web Pages 是免费的吗?

是的,WebClip Save & Summarize Web Pages 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

WebClip Save & Summarize Web Pages 支持哪些平台?

WebClip Save & Summarize Web Pages 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 WebClip Save & Summarize Web Pages?

由 Shadow Rose(@theshadowrose)开发并维护,当前版本 v1.0.0。

💬 留言讨论