← 返回 Skills 市场
codylrn804

Crawl4ai

作者 codylrn804 · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
3262
总下载
2
收藏
32
当前安装
1
版本数
在 OpenClaw 中安装
/install crawl4ai
功能描述
AI-powered web scraping framework for extracting structured data from websites. Use when Codex needs to crawl, scrape, or extract data from web pages using AI-powered parsing, handle dynamic content, or work with complex HTML structures.
安全使用建议
Install only if you need browser-based web scraping. Use it only on sites and pages you are authorized to access, avoid the Cloudflare/proxy bypass guidance, do not run gh auth login for this skill unless a separate GitHub task requires it, and treat screenshots, saved HTML, and extracted data as potentially sensitive.
功能分析
Type: OpenClaw Skill Name: crawl4ai Version: 1.0.0 The OpenClaw AgentSkills skill bundle 'crawl4ai' is classified as suspicious. While its primary purpose is benign web scraping, it provides powerful capabilities that, if misused by a malicious prompt to the AI agent, could lead to harmful actions. Specifically, the `AsyncWebCrawler` allows arbitrary network requests to any URL and the `js_code` parameter enables the execution of arbitrary JavaScript within the browser context of the scraped page (as seen in `SKILL.md`, `references/api_reference.md`, and `references/examples.md`). Although there is no explicit malicious intent or instructions for the agent to perform harmful actions within the skill's own files, these broad capabilities present a significant attack surface for prompt injection or other forms of misuse against the agent, allowing it to potentially access sensitive data or perform unauthorized actions on external websites.
能力评估
Purpose & Capability
The scraping, crawling, JavaScript rendering, screenshots, custom JavaScript, and local output files fit the stated Crawl4ai purpose, but they can collect sensitive page content if used on private or authenticated targets.
Instruction Scope
The instructions include some responsible scraping advice, but they also recommend residential proxies and user-agent variation for Cloudflare bot detection and show broad scraping examples without clear authorization or scope controls.
Install Mechanism
No automatic installer or startup hook is present; the scripts are user-run helpers that rely on the external crawl4ai Python package, with dependency setup not fully declared in the artifact.
Credentials
Outbound network access, browser automation, JavaScript execution, screenshots, file:// HTML rendering, and saved scraped output are proportionate for scraping, but the artifact under-discloses risks for internal URLs, authenticated pages, screenshots, and untrusted local HTML.
Persistence & Privilege
There is no hidden background persistence or destructive behavior, but the troubleshooting guide tells users to run gh auth login, creating an unrelated GitHub authenticated session for a scraping skill.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install crawl4ai
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /crawl4ai 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of crawl4ai – an AI-powered web scraping framework for extracting structured data from websites. - Enables intelligent extraction and cleaning of data from complex or dynamic web pages. - Supports scraping with JavaScript rendering, main content extraction, and custom data fields (like products or articles). - Offers simple Python async interface with robust error handling and output as markdown, clean HTML, structured JSON, and screenshots. - Includes guidance for common scraping scenarios, custom JavaScript injection, session management, and batch/bulk scraping. - Provides best practices for responsible web scraping and includes sample scripts and documentation for quick onboarding.
元数据
Slug crawl4ai
版本 1.0.0
许可证
累计安装 32
当前安装数 32
历史版本数 1
常见问题

Crawl4ai 是什么?

AI-powered web scraping framework for extracting structured data from websites. Use when Codex needs to crawl, scrape, or extract data from web pages using AI-powered parsing, handle dynamic content, or work with complex HTML structures. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 3262 次。

如何安装 Crawl4ai?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install crawl4ai」即可一键安装,无需额外配置。

Crawl4ai 是免费的吗?

是的,Crawl4ai 完全免费(开源免费),可自由下载、安装和使用。

Crawl4ai 支持哪些平台?

Crawl4ai 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Crawl4ai?

由 codylrn804(@codylrn804)开发并维护,当前版本 v1.0.0。

💬 留言讨论