← Back to Skills Marketplace

Scrapling Web Fetch

Name: Scrapling Web Fetch
Author: jllyzzd2023

by 晨冬 · GitHub ↗ · v1.0.2

cross-platform ⚠ suspicious

947

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install scrapling-web-fetch

Description

使用 Scrapling + html2text 获取现代网页正文内容，支持微信公众号文章抓取与尾部噪音清洗，减少无用信息与 token 消耗；适合抓取博客、新闻、公告及许多普通 fetch 不稳定、存在反爬或动态渲染干扰的网页。Supports WeChat article cleanup, markdown...

Usage Guidance

This skill appears to do what it says: fetch pages, extract body text, convert to Markdown, and clean WeChat noise. Before installing or running it, review the PyPI package 'scrapling' (and its homepage/source) because the script depends on it; a malicious or overly-permissive package could perform network calls or execute code. Run pip installs in a virtualenv or sandbox, inspect installed package source, and avoid passing sensitive local file paths to --batch or --selectors (the script will read those files). If you need stronger assurance, ask for the 'scrapling' package source or use an alternative extractor implemented with well-known libraries (requests + readability / newspaper / browser automation) whose behavior you can audit.

Capability Analysis

Type: OpenClaw Skill Name: scrapling-web-fetch Version: 1.0.2 The skill is a legitimate web content extraction tool designed to fetch and convert web pages into Markdown using the 'scrapling' and 'html2text' libraries. It features specific logic in 'scripts/scrapling_fetch.py' to clean up UI noise from WeChat articles, which helps reduce token usage for LLM processing. There is no evidence of data exfiltration, malicious execution, or prompt injection; the hardcoded file paths in 'SKILL.md' and 'usage.md' appear to be artifacts of the developer's local environment rather than malicious indicators.

Capability Assessment

✓ Purpose & Capability

Name/description match the code and runtime instructions: the script fetches pages, selects likely article containers, converts to Markdown, cleans WeChat noise, supports batch mode and site overrides. No unrelated credentials, binaries, or paths are required.

✓ Instruction Scope

SKILL.md instructs running the included Python script and describes inputs/outputs. The script only reads files explicitly passed by the user (--batch, --selectors) and fetches the provided URLs. It does not attempt to read arbitrary system files or environment variables.

ℹ Install Mechanism

There is no install spec (instruction-only), which is low-risk. However the skill recommends installing two PyPI packages (scrapling, html2text) via pip. Installing unknown third-party packages can execute arbitrary code at install/run time; the package 'scrapling' is not further documented here, so evaluate that package before installing.

✓ Credentials

The skill requests no environment variables, no credentials, and no config paths. The script only reads user-supplied batch or selectors files and writes output to stdout/stderr as expected.

✓ Persistence & Privilege

The skill does not request persistent or privileged presence (always:false). It does not modify other skills or system-wide configuration.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install scrapling-web-fetch
After installation, invoke the skill by name or use /scrapling-web-fetch
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.2

Improve storefront description to highlight WeChat article cleanup, token savings, markdown output, and support for hard-to-fetch modern pages.

v1.0.1

Fix Scrapling auto_match deprecation usage and improve release notes for WeChat cleanup, token savings, and anti-bot page support.

v1.0.0

Initial release: Scrapling + html2text web content extraction with WeChat tail noise cleanup, batch mode, selector overrides, and JSON output.

Metadata

Slug scrapling-web-fetch

Version 1.0.2

License —

All-time Installs 10

Active Installs 10

Total Versions 3

Frequently Asked Questions

What is Scrapling Web Fetch?

使用 Scrapling + html2text 获取现代网页正文内容，支持微信公众号文章抓取与尾部噪音清洗，减少无用信息与 token 消耗；适合抓取博客、新闻、公告及许多普通 fetch 不稳定、存在反爬或动态渲染干扰的网页。Supports WeChat article cleanup, markdown... It is an AI Agent Skill for Claude Code / OpenClaw, with 947 downloads so far.

How do I install Scrapling Web Fetch?

Run "/install scrapling-web-fetch" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Scrapling Web Fetch free?

Yes, Scrapling Web Fetch is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Scrapling Web Fetch support?

Scrapling Web Fetch is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Scrapling Web Fetch?

It is built and maintained by 晨冬 (@jllyzzd2023); the current version is v1.0.2.

More Skills