← Back to Skills Marketplace
codylrn804

Crawl4ai

by codylrn804 · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
3262
Downloads
2
Stars
32
Active Installs
1
Versions
Install in OpenClaw
/install crawl4ai
Description
AI-powered web scraping framework for extracting structured data from websites. Use when Codex needs to crawl, scrape, or extract data from web pages using AI-powered parsing, handle dynamic content, or work with complex HTML structures.
Usage Guidance
Install only if you need browser-based web scraping. Use it only on sites and pages you are authorized to access, avoid the Cloudflare/proxy bypass guidance, do not run gh auth login for this skill unless a separate GitHub task requires it, and treat screenshots, saved HTML, and extracted data as potentially sensitive.
Capability Analysis
Type: OpenClaw Skill Name: crawl4ai Version: 1.0.0 The OpenClaw AgentSkills skill bundle 'crawl4ai' is classified as suspicious. While its primary purpose is benign web scraping, it provides powerful capabilities that, if misused by a malicious prompt to the AI agent, could lead to harmful actions. Specifically, the `AsyncWebCrawler` allows arbitrary network requests to any URL and the `js_code` parameter enables the execution of arbitrary JavaScript within the browser context of the scraped page (as seen in `SKILL.md`, `references/api_reference.md`, and `references/examples.md`). Although there is no explicit malicious intent or instructions for the agent to perform harmful actions within the skill's own files, these broad capabilities present a significant attack surface for prompt injection or other forms of misuse against the agent, allowing it to potentially access sensitive data or perform unauthorized actions on external websites.
Capability Assessment
Purpose & Capability
The scraping, crawling, JavaScript rendering, screenshots, custom JavaScript, and local output files fit the stated Crawl4ai purpose, but they can collect sensitive page content if used on private or authenticated targets.
Instruction Scope
The instructions include some responsible scraping advice, but they also recommend residential proxies and user-agent variation for Cloudflare bot detection and show broad scraping examples without clear authorization or scope controls.
Install Mechanism
No automatic installer or startup hook is present; the scripts are user-run helpers that rely on the external crawl4ai Python package, with dependency setup not fully declared in the artifact.
Credentials
Outbound network access, browser automation, JavaScript execution, screenshots, file:// HTML rendering, and saved scraped output are proportionate for scraping, but the artifact under-discloses risks for internal URLs, authenticated pages, screenshots, and untrusted local HTML.
Persistence & Privilege
There is no hidden background persistence or destructive behavior, but the troubleshooting guide tells users to run gh auth login, creating an unrelated GitHub authenticated session for a scraping skill.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install crawl4ai
  3. After installation, invoke the skill by name or use /crawl4ai
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of crawl4ai – an AI-powered web scraping framework for extracting structured data from websites. - Enables intelligent extraction and cleaning of data from complex or dynamic web pages. - Supports scraping with JavaScript rendering, main content extraction, and custom data fields (like products or articles). - Offers simple Python async interface with robust error handling and output as markdown, clean HTML, structured JSON, and screenshots. - Includes guidance for common scraping scenarios, custom JavaScript injection, session management, and batch/bulk scraping. - Provides best practices for responsible web scraping and includes sample scripts and documentation for quick onboarding.
Metadata
Slug crawl4ai
Version 1.0.0
License
All-time Installs 32
Active Installs 32
Total Versions 1
Frequently Asked Questions

What is Crawl4ai?

AI-powered web scraping framework for extracting structured data from websites. Use when Codex needs to crawl, scrape, or extract data from web pages using AI-powered parsing, handle dynamic content, or work with complex HTML structures. It is an AI Agent Skill for Claude Code / OpenClaw, with 3262 downloads so far.

How do I install Crawl4ai?

Run "/install crawl4ai" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Crawl4ai free?

Yes, Crawl4ai is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Crawl4ai support?

Crawl4ai is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Crawl4ai?

It is built and maintained by codylrn804 (@codylrn804); the current version is v1.0.0.

💬 Comments