← 返回 Skills 市场
ernestyu

Claw Web Fetch

作者 Ernestyu · GitHub ↗ · v0.1.8 · MIT-0
cross-platform ✓ 安全检测通过
388
总下载
0
收藏
1
当前安装
7
版本数
在 OpenClaw 中安装
/install clawfetch
功能描述
Web page → Markdown scraper skill that wraps the clawfetch Node CLI to fetch articles, GitHub READMEs, and Reddit threads into normalized markdown with metad...
使用说明 (SKILL.md)

clawfetch (OpenClaw Skill)

Use the published clawfetch npm CLI to convert a single web page into normalized markdown with a small metadata header.

This skill is a thin wrapper around the public clawfetch package:

  • It does not vendor the clawfetch source code.
  • It does not perform any hidden git clones.
  • It only installs the clawfetch npm package in this skill directory via a separate, explicit bootstrap step.

Typical use cases for agents:

  • Fetch a WeChat / blog / docs article into markdown for later indexing.
  • Fetch a GitHub repository README (raw) and store it into a knowledge base.
  • Fetch a Reddit thread as a structured "post + comments" markdown document.

The CLI emits a header like:

--- METADATA ---
Title: ...
Author: ...
Site: ...
FinalURL: ...
Extraction: readability|fallback-container|body-innerText|github-raw-fast-path|reddit-rss
FallbackSelector: ...   # only when not readability
--- MARKDOWN ---
\x3Cmarkdown>

Installation (skill-level)

This skill assumes:

  • node / npm are available in the OpenClaw container.
  • Network access to npm to install the clawfetch package and its dependencies.

1. Install the skill into your workspace

Use the OpenClaw CLI to pull the skill from ClawHub into your active workspace:

openclaw skills install clawfetch

This creates a directory similar to:

~/.openclaw/workspace/skills/clawfetch

At this point only the skill wrapper files exist (SKILL.md, README, manifest, bootstrap_deps.sh, etc.). The actual npm package with the scraper logic is not installed yet.

2. Bootstrap the npm CLI once (required)

From the skill directory, run the bootstrap script to install the npm package locally:

cd ~/.openclaw/workspace/skills/clawfetch
bash bootstrap_deps.sh

This script is small and reviewable; it only calls:

npm install [email protected]

Key points:

  • OpenClaw's skills install does not currently auto-run this bootstrap step. Agents or humans must call bootstrap_deps.sh explicitly.
  • The install happens inside the skill directory. No global npm state is modified.

After this completes, the CLI entrypoint is available at:

~/.openclaw/workspace/skills/clawfetch/node_modules/clawfetch/clawfetch.js

Runtime usage (for agents)

After the bootstrap step, agents can invoke the CLI from this directory as:

node node_modules/clawfetch/clawfetch.js \x3Curl> [--max-comments N] [--no-reddit-rss]

Recommended patterns:

  • General articles / docs
    • Use the default mode, which launches headless Chromium via Playwright, then uses Readability + Turndown to extract the main article body.
  • GitHub repositories
    • Pass the repository URL (e.g. https://github.com/owner/repo).
    • clawfetch will attempt a raw README fast-path from raw.githubusercontent.com before falling back to browser mode.
  • Reddit threads
    • Pass the thread URL (e.g. https://www.reddit.com/r/.../comments/.../).
    • By default, clawfetch will:
      • Convert the URL to .rss and fetch the Atom feed using a normal desktop browser User-Agent.
      • Render the first entry as the main post (## Post: ...).
      • Render subsequent entries as comments (### Comment by ...), limited by --max-comments (default 50; 0 means no limit).
    • Use --no-reddit-rss to force browser scraping instead of RSS.

Safety / non-suspicious behaviour

  • This skill does not:
    • Clone any git repositories at runtime.
    • Download arbitrary source trees into the skill directory.
    • Run hidden package managers beyond the explicit npm install in the bootstrap step.
  • All heavy work (Playwright, Readability, Turndown) comes from the published clawfetch package and its declared npm dependencies.

Agents should treat this skill as a read-mostly web → markdown tool and avoid using it for arbitrary scripting or filesystem operations beyond its intended CLI interface.

安全使用建议
This skill is a small, auditable wrapper that installs the public clawfetch npm package into the skill directory and runs its CLI. Before installing: (1) be aware npm packages carry normal supply-chain risk—review the upstream clawfetch package and its dependencies if you need higher assurance; (2) Playwright/Chromium may download browser binaries during install or runtime, and Playwright executes site JS as part of rendering (this is inherent to browser-based scraping); (3) only set FLARESOLVERR_URL to a trusted FlareSolverr service if you need bot-challenge support; (4) run the one-time bootstrap script manually in an isolated / controlled environment if you prefer, and inspect node_modules/clawfetch if you want full assurance of runtime behavior.
功能分析
Type: OpenClaw Skill Name: clawfetch Version: 0.1.8 The clawfetch skill is a transparent wrapper for a Node.js web-to-markdown scraper utility. It uses a simple bootstrap_deps.sh script to install the public 'clawfetch' npm package and requires network access consistent with its stated purpose of fetching web content. The documentation (SKILL.md, README.md) is detailed and lacks any indicators of prompt injection, data exfiltration, or malicious execution.
能力评估
Purpose & Capability
The name and description match the requested binaries (node, npm), the network permission, and the single one-time npm install. Nothing requested (env vars, config paths, binaries) appears unrelated to a web→markdown scraper CLI.
Instruction Scope
SKILL.md and the README limit runtime behavior to invoking the installed clawfetch CLI in the skill directory. There are no instructions to read unrelated host files, access unrelated credentials, or post data to unexpected endpoints. The only optional external endpoint mentioned is FLARESOLVERR_URL (documented as optional for bot-challenge handling).
Install Mechanism
Installation is a one-line npm install ([email protected]) performed locally in the skill directory via bootstrap_deps.sh. This uses the public npm registry (a normal, expected mechanism) but does carry supply-chain risk inherent to installing packages from npm (transitive dependencies, native/browser downloads like Playwright's browsers). The bootstrap script is small and reviewable and does not download code from arbitrary URLs.
Credentials
No required environment variables or credentials are declared. The README documents an optional FLARESOLVERR_URL for Cloudflare/anti-bot handling; that is reasonable and optional. The skill does not request unrelated secrets or config paths.
Persistence & Privilege
Skill is not always-enabled and is user-invocable; it does not request elevated or persistent privileges nor does it modify other skills' configurations. Runtime autonomy (disable-model-invocation=false) is the platform default and is not by itself concerning here.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install clawfetch
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /clawfetch 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.8
clawfetch v0.1.8 - Updated version to 0.1.8 in SKILL.md and manifest. - Removed unused ENV_EXAMPLE.md. - Minor documentation and metadata updates for clarity.
v0.1.7
- Bumped clawfetch npm package dependency to version 0.1.7 in bootstrap and documentation. - Updated SKILL.md, README.md, and README_zh.md to reference the new version. - No feature changes; this is a maintenance update to align with the latest upstream release. - Manifest and scripts adjusted to use [email protected].
v0.1.6
- Bumped npm dependency to use [email protected] (was 0.1.5). - Updated documentation to reflect the new dependency version in SKILL.md, README.md, and bootstrap_deps.sh. - No logic or API changes—this release is a dependency and docs update only.
v0.1.5
clawfetch 0.1.5 - Updated to use and install clawfetch npm package version 0.1.5. - Improved installation instructions, clarifying the explicit bootstrap step required to install dependencies. - Documentation now highlights that the npm package is not installed until running bootstrap_deps.sh manually. - Revised and expanded README files for clarity and explicit install guidance. - No changes to runtime CLI usage or core scraping logic.
v0.1.2
clawfetch 0.1.2 - Added ENV_EXAMPLE.md to provide sample environment configuration. - Updated documentation files (README.md, README_zh.md, SKILL.md) for clarity and minor improvements. - No functional changes to the skill's behavior or installation process.
v0.1.1
clawfetch v0.1.1 - Updated documentation in SKILL.md and READMEs for improved clarity and instructions. - Removed mention of the deprecated --auto-install flag from usage examples. - Incremented version metadata to 0.1.1. - No functional or interface changes to the skill logic.
v0.1.0
Initial release: wraps the clawfetch Node CLI to fetch and convert web content to normalized markdown for OpenClaw agents. - Fetches web pages, GitHub READMEs, and Reddit threads into markdown with structured metadata. - Installs the clawfetch npm package in the skill directory (no vendored code or hidden clones). - CLI supports article, README, and Reddit RSS extraction with user-customizable options. - No extraneous source downloads or file operations outside the explicit npm install step. - Designed for agent use: web content → markdown with safety and transparency.
元数据
Slug clawfetch
版本 0.1.8
许可证 MIT-0
累计安装 1
当前安装数 1
历史版本数 7
常见问题

Claw Web Fetch 是什么?

Web page → Markdown scraper skill that wraps the clawfetch Node CLI to fetch articles, GitHub READMEs, and Reddit threads into normalized markdown with metad... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 388 次。

如何安装 Claw Web Fetch?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install clawfetch」即可一键安装,无需额外配置。

Claw Web Fetch 是免费的吗?

是的,Claw Web Fetch 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Claw Web Fetch 支持哪些平台?

Claw Web Fetch 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Claw Web Fetch?

由 Ernestyu(@ernestyu)开发并维护,当前版本 v0.1.8。

💬 留言讨论