← Back to Skills Marketplace
Scrapling Fetch Basic
by
shuxiangfanclaw
· GitHub ↗
· v1.0.0
· MIT-0
126
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install scrapling-fetch-basic
Description
基础网页抓取工具,支持绕过反爬系统、自动定位正文区域、HTML 转 Markdown。适合抓取博客、新闻、公告等静态页面。
Usage Guidance
This skill appears internally consistent, but take these precautions before using it:
- Source verification: the package has no homepage and an unknown owner; inspect the scrapling dependency source (PyPI/GitHub) before installing and prefer installing in an isolated environment (virtualenv/container).
- Install notes: playwright will usually download browser binaries when first used (run 'playwright install' or follow its docs). That will add sizeable executables to the host; be prepared for that.
- SSRF / network risk: the script fetches arbitrary URLs. If you run it on a server that can access internal resources, an attacker-supplied URL could cause server-side requests to internal endpoints. Only run with trusted URLs or in a network-isolated environment.
- Legal/ethical: stealth mode and Cloudflare bypass are intended to evade anti-bot protections — ensure you have the right to scrape targets and comply with terms of service and laws.
- Dependency hygiene: install dependencies from official registries or pinned releases, review the 'scrapling' package code because the skill relies on it for network access and stealth behavior.
- Runtime safety: run first with --debug and limited targets; consider timeouts/rate limits to avoid unintended heavy load.
If you want higher assurance, ask the author for a homepage or source repository and a release provenance (e.g., GitHub repo and PyPI package/version).
Capability Analysis
Type: OpenClaw Skill
Name: scrapling-fetch-basic
Version: 1.0.0
The skill is a legitimate web scraping tool designed to extract content from URLs and convert it to Markdown. It uses the 'scrapling' and 'html2text' libraries to provide basic and stealthy (browser-based) fetching modes as described in scripts/scrapling_fetch.py and SKILL.md, with no evidence of data exfiltration, malicious execution, or prompt injection.
Capability Assessment
Purpose & Capability
Name/description (web scraping, Cloudflare/stealth, HTML→Markdown) align with the provided script and declared deps (scrapling, html2text, playwright). No unrelated env vars, binaries, or config paths are required.
Instruction Scope
SKILL.md describes running the included Python script; the script only fetches the target URL, extracts content with a set of selectors, converts to Markdown, and prints output or JSON. It does not attempt to read local files, other env vars, or exfiltrate results to external endpoints.
Install Mechanism
There is no install spec (instruction-only) and a single Python script is included. Dependencies are listed but not installed automatically; the user environment must install scrapling, html2text, and playwright. Playwright typically requires downloading browser binaries (user should be aware).
Credentials
No credentials, secret environment variables, or config paths are requested. Required libraries are proportional to the functionality (HTML parsing and optional browser automation).
Persistence & Privilege
Skill does not request persistent always:true, does not modify other skills or system configs, and is user-invocable only. The script is executed on demand and does not persist credentials or install itself.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install scrapling-fetch-basic - After installation, invoke the skill by name or use
/scrapling-fetch-basic - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
scrapling-fetch-basic 1.0.0 — 首发版
- 发布基础版网页抓取工具,支持静态页面内容提取与反爬绕过
- 自动定位正文区域,支持 11 种常用选择器
- 抓取结果可自动转换为 Markdown 格式
- 提供 basic 和 stealth 两种模式,满足常规与反爬页面需求
- 支持字符数自定义、JSON 输出等常用功能
Metadata
Frequently Asked Questions
What is Scrapling Fetch Basic?
基础网页抓取工具,支持绕过反爬系统、自动定位正文区域、HTML 转 Markdown。适合抓取博客、新闻、公告等静态页面。 It is an AI Agent Skill for Claude Code / OpenClaw, with 126 downloads so far.
How do I install Scrapling Fetch Basic?
Run "/install scrapling-fetch-basic" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Scrapling Fetch Basic free?
Yes, Scrapling Fetch Basic is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Scrapling Fetch Basic support?
Scrapling Fetch Basic is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Scrapling Fetch Basic?
It is built and maintained by shuxiangfanclaw (@shuxiangfanclaw); the current version is v1.0.0.
More Skills