← Back to Skills Marketplace
shuxiangfanclaw

Scrapling Fetch Pro

by shuxiangfanclaw · GitHub ↗ · v1.2.0 · MIT-0
cross-platform ⚠ suspicious
131
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install scrapling-fetch-pro
Description
专业网页抓取工具,完整支持微信公众号文章爬取、自动模式检测、噪音清理。适合抓取博客、新闻、公告及各类有反爬保护的网站。
Usage Guidance
Things to consider before installing/using this skill: - Provenance: the package has no homepage and an unknown source/owner. Prefer code from known sources. - Version/metadata mismatch: SKILL.md claims v1.2.0 while _meta.json and the script header show v1.1.0 — this could indicate sloppy packaging or partial updates. - Promised "Cloudflare Turnstile" bypass is not implemented in the visible code; stealth behavior is delegated to scrapling.fetchers.StealthyFetcher. Inspect that external library before trusting the bypass claim. - Dependencies: Playwright will download browser binaries at runtime and executes page JavaScript (normal for stealth scraping). Run in a sandboxed environment and be aware of large network/download side-effects. - Legal/ethical risk: scraping WeChat and sites protected by anti-bot measures may violate terms of service or local law. Ensure you have the right to scrape target sites. - Recommended actions: review the scrapling package (scrapling.fetchers) source, verify the StealthyFetcher implementation, run the tool in an isolated environment (container/VM), and only provide it access to target URLs you control or are permitted to scrape.
Capability Analysis
Type: OpenClaw Skill Name: scrapling-fetch-pro Version: 1.2.0 The skill bundle is a specialized web scraping tool designed to extract content from websites, with specific optimizations for WeChat Official Accounts and anti-bot bypass (Cloudflare). The Python script `scripts/scrapling_fetch.py` uses legitimate libraries like `scrapling`, `playwright`, and `beautifulsoup4` to perform its stated functions. There is no evidence of data exfiltration, unauthorized execution, or malicious intent; the stealth and evasion capabilities are transparently documented as features for legitimate scraping purposes.
Capability Assessment
Purpose & Capability
Name/description claim a professional scraper with WeChat and anti-bot bypass features; the included script implements selector-based scraping, WeChat noise removal, basic/stealth modes and Markdown output, which is coherent with the stated purpose. However the README claims automatic Cloudflare Turnstile bypass and other advanced anti-bot techniques while the script delegates stealth behavior to an external StealthyFetcher (scrapling.fetchers) — the bypass behavior is not visible in the shipped code. Also metadata files and the script show inconsistent version numbers (SKILL.md says 1.2.0, _meta.json and script header show 1.1.0), and source/homepage are unknown.
Instruction Scope
SKILL.md and references instruct running the included Python script and describe modes/flags; they do not direct the agent to read unrelated files, exfiltrate environment variables, or call unrelated external endpoints. The runtime instructions are narrowly scoped to scraping tasks. Note: they refer to Sessions and Cloudflare bypass in prose but do not include concrete config or credential usage in the included files.
Install Mechanism
There is no install spec (instruction-only + code file). That is lowest-install risk, but the script depends on several heavy packages (playwright, patchright, scrapling, html2text, beautifulsoup4, lxml). Playwright in particular typically downloads browser binaries at runtime which has additional network/file implications. Because there is no provided install or provenance, it's unclear how those dependencies should be installed or whether the 'scrapling' package (and its StealthyFetcher) is trustworthy.
Credentials
The skill declares no required environment variables, credentials, or config paths and the code does not read env vars. That is proportionate to the stated purpose. Note that stealth scraping may require cookies/sessions for logged-in pages (the docs mention Sessions) but no session-handling credentials are requested by the skill as packaged.
Persistence & Privilege
The skill is not marked always:true and does not request persistent system modifications. It is user-invocable and can be called autonomously by the agent (default behavior), which is normal. No code attempts to modify other skills or global agent settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install scrapling-fetch-pro
  3. After installation, invoke the skill by name or use /scrapling-fetch-pro
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.2.0
更改许可证从 MIT-0 到 MIT(需要署名)
v1.1.0
- 新增完整支持微信公众号文章抓取及专用标题选择器 - 引入自动模式检测,根据 URL 智能选择抓取模式(basic/stealth/auto) - 强化噪音清理,自动移除公众号文章中的广告、工具栏等无用内容 - 增加正文选择器至16个,提升各类网站适配能力 - 支持 Cloudflare Turnstile 绕过及浏览器指纹伪装,增强反爬能力
Metadata
Slug scrapling-fetch-pro
Version 1.2.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 2
Frequently Asked Questions

What is Scrapling Fetch Pro?

专业网页抓取工具,完整支持微信公众号文章爬取、自动模式检测、噪音清理。适合抓取博客、新闻、公告及各类有反爬保护的网站。 It is an AI Agent Skill for Claude Code / OpenClaw, with 131 downloads so far.

How do I install Scrapling Fetch Pro?

Run "/install scrapling-fetch-pro" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Scrapling Fetch Pro free?

Yes, Scrapling Fetch Pro is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Scrapling Fetch Pro support?

Scrapling Fetch Pro is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Scrapling Fetch Pro?

It is built and maintained by shuxiangfanclaw (@shuxiangfanclaw); the current version is v1.2.0.

💬 Comments