FlowCrawl — Stealth Web Scraper That Bypasses Everything
/install flowcrawl
FlowCrawl
Scrape any website. Bypass any bot protection. Free.
Install Scrapling First
pip install scrapling
Scrapling installs Playwright automatically on first run. That's the only dependency.
Quick Usage
# Single URL — prints clean markdown to stdout
python3 ~/clawd/skills/flowcrawl/scripts/flowcrawl.py https://example.com
# Spider the whole site
python3 ~/clawd/skills/flowcrawl/scripts/flowcrawl.py https://example.com --deep
# Deep crawl with limits, save and combine
python3 ~/clawd/skills/flowcrawl/scripts/flowcrawl.py https://example.com --deep --limit 30 --combine
# JSON output — pipe into anything
python3 ~/clawd/skills/flowcrawl/scripts/flowcrawl.py https://example.com --json
Add Alias (Recommended)
echo 'alias flowcrawl="python3 ~/clawd/skills/flowcrawl/scripts/flowcrawl.py"' >> ~/.zshrc
source ~/.zshrc
Then just: flowcrawl https://example.com
How It Works
FlowCrawl uses a 3-tier fetcher cascade. Starts fast, escalates only when blocked:
| Tier | Method | Handles |
|---|---|---|
| 1 | Plain HTTP | Most sites, instant |
| 2 | Stealth + TLS spoof | Cloudflare, Imperva, basic WAFs |
| 3 | Full JS execution | SPAs, heavy JS, aggressive bot detection |
Auto-detects blocking (403, 503, "Just a moment...") and escalates silently.
All Options
| Flag | Description | Default |
|---|---|---|
--deep |
Spider whole site following internal links | off |
--depth N |
Max hop depth from start URL | 3 |
--limit N |
Max pages to crawl | 50 |
--combine |
Merge all pages into one file | off |
--format md|txt |
Output format | md |
--output DIR |
Output directory | ./flowcrawl-output |
--json |
Structured JSON output | off |
--quiet |
Suppress progress logs | off |
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install flowcrawl - 安装完成后,直接呼叫该 Skill 的名称或使用
/flowcrawl触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
FlowCrawl — Stealth Web Scraper That Bypasses Everything 是什么?
Stealth web scraper. Give it any URL and it punches through Cloudflare, bot detection, and WAFs automatically using a 3-tier cascade (plain HTTP → TLS spoof... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 419 次。
如何安装 FlowCrawl — Stealth Web Scraper That Bypasses Everything?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install flowcrawl」即可一键安装,无需额外配置。
FlowCrawl — Stealth Web Scraper That Bypasses Everything 是免费的吗?
是的,FlowCrawl — Stealth Web Scraper That Bypasses Everything 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
FlowCrawl — Stealth Web Scraper That Bypasses Everything 支持哪些平台?
FlowCrawl — Stealth Web Scraper That Bypasses Everything 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 FlowCrawl — Stealth Web Scraper That Bypasses Everything?
由 windseeker1111(@windseeker1111)开发并维护,当前版本 v1.1.0。