← Back to Skills Marketplace

FlowCrawl — Stealth Web Scraper That Bypasses Everything

Name: FlowCrawl — Stealth Web Scraper That Bypasses Everything
Author: windseeker1111

by windseeker1111 · GitHub ↗ · v1.1.0 · MIT-0

cross-platform ⚠ suspicious

419

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install flowcrawl

Description

Stealth web scraper. Give it any URL and it punches through Cloudflare, bot detection, and WAFs automatically using a 3-tier cascade (plain HTTP → TLS spoof...

README (SKILL.md)

FlowCrawl

Scrape any website. Bypass any bot protection. Free.

Install Scrapling First

pip install scrapling

Scrapling installs Playwright automatically on first run. That's the only dependency.

Quick Usage

# Single URL — prints clean markdown to stdout
python3 ~/clawd/skills/flowcrawl/scripts/flowcrawl.py https://example.com

# Spider the whole site
python3 ~/clawd/skills/flowcrawl/scripts/flowcrawl.py https://example.com --deep

# Deep crawl with limits, save and combine
python3 ~/clawd/skills/flowcrawl/scripts/flowcrawl.py https://example.com --deep --limit 30 --combine

# JSON output — pipe into anything
python3 ~/clawd/skills/flowcrawl/scripts/flowcrawl.py https://example.com --json

Add Alias (Recommended)

echo 'alias flowcrawl="python3 ~/clawd/skills/flowcrawl/scripts/flowcrawl.py"' >> ~/.zshrc
source ~/.zshrc

Then just: flowcrawl https://example.com

How It Works

FlowCrawl uses a 3-tier fetcher cascade. Starts fast, escalates only when blocked:

Tier	Method	Handles
1	Plain HTTP	Most sites, instant
2	Stealth + TLS spoof	Cloudflare, Imperva, basic WAFs
3	Full JS execution	SPAs, heavy JS, aggressive bot detection

Auto-detects blocking (403, 503, "Just a moment...") and escalates silently.

All Options

Flag	Description	Default
`--deep`	Spider whole site following internal links	off
`--depth N`	Max hop depth from start URL	3
`--limit N`	Max pages to crawl	50
`--combine`	Merge all pages into one file	off
`--format md\|txt`	Output format	md
`--output DIR`	Output directory	./flowcrawl-output
`--json`	Structured JSON output	off
`--quiet`	Suppress progress logs	off

Usage Guidance

This skill is coherent with its stated aim of bypassing bot protections, but that purpose is inherently risky and may violate site terms or laws. Before installing: 1) Decide whether evading WAFs/Cloudflare is appropriate and legal for your use case — don’t use on sites you don’t own or without permission. 2) Review the scrapling project source and trustworthiness (pip package + GitHub repo) because installing it will bring Playwright and download browser binaries. 3) Be aware the README suggests modifying ~/.zshrc (adds an alias); only do this if you want that persistent change. 4) Run in an isolated environment (VM/container) if you want to reduce risk of surprising downloads or side effects. 5) If you plan to use this in production or in an automated agent, consider legal/ethical review and logging/limits to avoid abusive scraping. If you want a lower-risk option, prefer tools that respect robots.txt and avoid active fingerprint spoofing.

Capability Analysis

Type: OpenClaw Skill Name: flowcrawl Version: 1.1.0 FlowCrawl is a web scraping utility that implements a three-tier escalation strategy (plain HTTP, TLS spoofing, and full JS execution) using the 'scrapling' library to bypass bot protections. The Python script in `scripts/flowcrawl.py` contains standard crawling logic, markdown extraction, and local file management without any evidence of data exfiltration, unauthorized network calls, or malicious execution. While `SKILL.md` suggests adding a shell alias to `~/.zshrc`, this is presented as a documented convenience for CLI usage rather than a hidden persistence mechanism.

Capability Assessment

ℹ Purpose & Capability

The name/description (stealth scraper that 'punches through Cloudflare/WAFs') align with the included code and SKILL.md: the CLI uses a three-tier escalation (plain HTTP → stealth/TLS spoof → full JS via Playwright). No unrelated credentials or config are requested. The claim 'No CDP Chrome' is potentially misleading because Playwright and stealth tooling are used—functionally this is a browser-automation based bypass stack, which matches the stated purpose but the marketing is aggressive and possibly inaccurate.

⚠ Instruction Scope

SKILL.md instructs the user to pip install scrapling (which will pull Playwright and stealth plugins) and to add an alias to the user's shell rc (~/.zshrc). The runtime instructions and code explicitly escalate to evasion techniques (TLS fingerprint spoofing, stealth plugins, full JS execution) to bypass protections — behavior that intentionally evades server-side defenses and could violate terms of service or laws. The skill does not attempt to read unrelated local files, nor does it exfiltrate data to external endpoints, but it does modify user shell config via the recommended alias and triggers external downloads when installed or run.

⚠ Install Mechanism

There is no registry install spec, but SKILL.md requires 'pip install scrapling'. Scrapling will install Playwright and (on first run) download browser binaries — a network-driven install that writes binaries to disk. The lack of a formal install spec in the registry plus the implicit heavy runtime dependency (Playwright/browser downloads) is a practical installation risk and should be made explicit to users. The pip/Playwright download is from public registries, not an unknown URL, but can be large and perform additional network activity.

✓ Credentials

The skill requests no environment variables, no credentials, and no special config paths. That is proportionate to a local scraper tool. There are no declared requirements for unrelated secrets or remote service keys.

ℹ Persistence & Privilege

The skill is user-invocable and not 'always: true' (no elevated persistent privilege). However SKILL.md recommends adding an alias to ~/.zshrc which writes to the user's shell config — a mild, user-visible persistence action. Playwright will also place browser artifacts on disk. The skill does not modify other skills or system-wide OpenClaw settings.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install flowcrawl
After installation, invoke the skill by name or use /flowcrawl
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.1.0

Stealth web scraper. Punches through Cloudflare, bot detection, and WAFs using a 3-tier cascade (plain HTTP, TLS spoof, full JS). No API keys, no proxies, no CDP Chrome. Free from the Flow team.

v1.0.2

Version 1.0.2 of FlowCrawl - No file changes were detected in this version. - Functionality, documentation, and options remain unchanged.

v1.0.1

- Updated SKILL.md with improved description and branding. - Clarified usage and description to emphasize FlowCrawl’s ability to bypass bot protection. - Adjusted skill name casing and authorship notes. - No code changes; documentation only.

v1.0.0

Initial release of FlowCrawl, a stealth web scraper that bypasses Cloudflare and bot protections. - Introduces a 3-tier cascade for web scraping: plain HTTP → TLS fingerprint spoofing → full JS execution. - Requires Scrapling (installs Playwright on first use) as the only dependency. - Offers CLI usage for scraping single URLs, deep site crawling, output in markdown or JSON, and output combining. - Includes flags for crawl depth, page limits, output format, and quiet mode. - Automatically detects and escalates around site blocks, supporting most modern anti-bot protections.

Metadata

Slug flowcrawl

Version 1.1.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 4

Frequently Asked Questions

What is FlowCrawl — Stealth Web Scraper That Bypasses Everything?

Stealth web scraper. Give it any URL and it punches through Cloudflare, bot detection, and WAFs automatically using a 3-tier cascade (plain HTTP → TLS spoof... It is an AI Agent Skill for Claude Code / OpenClaw, with 419 downloads so far.

How do I install FlowCrawl — Stealth Web Scraper That Bypasses Everything?

Run "/install flowcrawl" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is FlowCrawl — Stealth Web Scraper That Bypasses Everything free?

Yes, FlowCrawl — Stealth Web Scraper That Bypasses Everything is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does FlowCrawl — Stealth Web Scraper That Bypasses Everything support?

FlowCrawl — Stealth Web Scraper That Bypasses Everything is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created FlowCrawl — Stealth Web Scraper That Bypasses Everything?

It is built and maintained by windseeker1111 (@windseeker1111); the current version is v1.1.0.

More Skills