Crawl4ai

Name: Crawl4ai
Author: codylrn804

Description

AI-powered web scraping framework for extracting structured data from websites. Use when Codex needs to crawl, scrape, or extract data from web pages using AI-powered parsing, handle dynamic content, or work with complex HTML structures.

Usage Guidance

Install only if you need browser-based web scraping. Use it only on sites and pages you are authorized to access, avoid the Cloudflare/proxy bypass guidance, do not run gh auth login for this skill unless a separate GitHub task requires it, and treat screenshots, saved HTML, and extracted data as potentially sensitive.

Capability Analysis

Type: OpenClaw Skill Name: crawl4ai Version: 1.0.0 The OpenClaw AgentSkills skill bundle 'crawl4ai' is classified as suspicious. While its primary purpose is benign web scraping, it provides powerful capabilities that, if misused by a malicious prompt to the AI agent, could lead to harmful actions. Specifically, the `AsyncWebCrawler` allows arbitrary network requests to any URL and the `js_code` parameter enables the execution of arbitrary JavaScript within the browser context of the scraped page (as seen in `SKILL.md`, `references/api_reference.md`, and `references/examples.md`). Although there is no explicit malicious intent or instructions for the agent to perform harmful actions within the skill's own files, these broad capabilities present a significant attack surface for prompt injection or other forms of misuse against the agent, allowing it to potentially access sensitive data or perform unauthorized actions on external websites.

Capability Assessment

ℹ Purpose & Capability

The scraping, crawling, JavaScript rendering, screenshots, custom JavaScript, and local output files fit the stated Crawl4ai purpose, but they can collect sensitive page content if used on private or authenticated targets.

⚠ Instruction Scope

The instructions include some responsible scraping advice, but they also recommend residential proxies and user-agent variation for Cloudflare bot detection and show broad scraping examples without clear authorization or scope controls.

ℹ Install Mechanism

No automatic installer or startup hook is present; the scripts are user-run helpers that rely on the external crawl4ai Python package, with dependency setup not fully declared in the artifact.

⚠ Credentials

Outbound network access, browser automation, JavaScript execution, screenshots, file:// HTML rendering, and saved scraped output are proportionate for scraping, but the artifact under-discloses risks for internal URLs, authenticated pages, screenshots, and untrusted local HTML.

⚠ Persistence & Privilege

There is no hidden background persistence or destructive behavior, but the troubleshooting guide tells users to run gh auth login, creating an unrelated GitHub authenticated session for a scraping skill.

Version History

v1.0.0

Initial release of crawl4ai – an AI-powered web scraping framework for extracting structured data from websites. - Enables intelligent extraction and cleaning of data from complex or dynamic web pages. - Supports scraping with JavaScript rendering, main content extraction, and custom data fields (like products or articles). - Offers simple Python async interface with robust error handling and output as markdown, clean HTML, structured JSON, and screenshots. - Includes guidance for common scraping scenarios, custom JavaScript injection, session management, and batch/bulk scraping. - Provides best practices for responsible web scraping and includes sample scripts and documentation for quick onboarding.

Metadata

Slug crawl4ai

Version 1.0.0

License —

All-time Installs 32

Active Installs 32

Total Versions 1

Frequently Asked Questions

What is Crawl4ai?

AI-powered web scraping framework for extracting structured data from websites. Use when Codex needs to crawl, scrape, or extract data from web pages using AI-powered parsing, handle dynamic content, or work with complex HTML structures. It is an AI Agent Skill for Claude Code / OpenClaw, with 3262 downloads so far.

How do I install Crawl4ai?

Run "/install crawl4ai" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Crawl4ai free?

Yes, Crawl4ai is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Crawl4ai support?

Crawl4ai is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Crawl4ai?

It is built and maintained by codylrn804 (@codylrn804); the current version is v1.0.0.

More Skills

What is Crawl4ai?

How do I install Crawl4ai?

Is Crawl4ai free?

Which platforms does Crawl4ai support?

Who created Crawl4ai?

💬 Comments