← 返回 Skills 市场
qthans

Anycrawl

作者 QThans · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ✓ 安全检测通过
259
总下载
1
收藏
0
当前安装
2
版本数
在 OpenClaw 中安装
/install anycrawl-cli
功能描述
Web scraping, search, crawling, and site mapping via the AnyCrawl CLI. Use when the user wants to search the web, scrape a page, find URLs on a site, or bulk...
使用说明 (SKILL.md)

AnyCrawl CLI

Web scraping, search, and crawling CLI. Returns clean markdown optimized for LLM context windows. Default engine: playwright.

Run anycrawl --help or anycrawl \x3Ccommand> --help for full option details.

Prerequisites

Must be installed and authenticated. Run anycrawl login or set ANYCRAWL_API_KEY.

If not ready, see rules/install.md. For output handling guidelines, see rules/security.md.

Commands

  • Search - No specific URL yet. Find pages, answer questions. Use --scrape to get full page content with results.
  • Scrape - Have a URL. Extract its content directly.
  • Map - Need to locate a specific page on a site. Discover URLs, then scrape the ones you need.
  • Crawl - Need bulk content from a site or section. Use crawl directly — no need for map first.
Need Command When
Find pages on a topic search No specific URL yet
Get a page's content scrape Have a URL
Find URLs within a site map Need to locate a specific subpage
Bulk extract a site section crawl Need many pages (e.g., all /docs/)

For detailed command reference, run anycrawl \x3Ccommand> --help (e.g., anycrawl search, anycrawl scrape).

Avoid redundant fetches: search --scrape already fetches full page content. Don't re-scrape those URLs. Check .anycrawl/ for existing data before fetching again.

Output & Organization

Write results to .anycrawl/ with -o. Add .anycrawl/ to .gitignore. Always quote URLs in shell commands. Never read entire output files at once — use grep, head, or incremental reads.

Documentation

安全使用建议
This skill is an instruction-only wrapper for the AnyCrawl CLI and appears internally consistent with that purpose. Before installing or using it: 1) Verify the upstream npm package (anycrawl-cli) and its source/release provenance; prefer using npx if you don't want a global install. 2) Be aware the CLI fetches arbitrary third‑party web content and will store an API key under your user config directory — use a scoped/rotatable key and inspect where it is stored. 3) The SKILL.md references ANYCRAWL_API_KEY but the skill metadata doesn't declare it; ask the author to declare required env vars explicitly. 4) Follow the included security advice (write output to .anycrawl/, add to .gitignore, and read files incrementally) to reduce prompt‑injection and data exposure risk.
功能分析
Type: OpenClaw Skill Name: anycrawl-cli Version: 1.0.1 The anycrawl-cli skill bundle is a well-structured integration for a web scraping service. It follows security best practices by explicitly instructing the agent to quote URLs to prevent command injection and recommending file-based isolation and incremental reading to mitigate indirect prompt injection from untrusted web content (rules/security.md). The installation steps (rules/install.md) and tool permissions (SKILL.md) are standard for CLI utilities and are appropriately scoped to the 'anycrawl' command.
能力评估
Purpose & Capability
The name/description match the instructions: the skill expects the AnyCrawl CLI and explains search/scrape/map/crawl workflows. Nothing requested or documented is unrelated to web crawling/scraping.
Instruction Scope
SKILL.md only instructs use of the anycrawl CLI and contains safe-sounding handling guidance (write outputs to .anycrawl/, incremental reads, URL quoting, avoid returning fetched pages directly). It does not instruct reading unrelated system files or exfiltrating data.
Install Mechanism
There is no formal install spec (instruction-only), and install guidance points to the npm package anycrawl-cli and npx usage — a standard approach. Users should still verify the npm package and upstream docs before installing.
Credentials
SKILL.md and rules/install.md reference an ANYCRAWL_API_KEY and describe persistent credentials in ~/.config or %APPDATA%, but the skill metadata lists no required environment variables or primary credential. The skill should declare ANYCRAWL_API_KEY (or be explicit it is optional).
Persistence & Privilege
The skill is not forced-always, is user-invocable, and does not request persistent platform privileges or modify other skills' configs. Autonomous invocation is allowed by default but not unusual here.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install anycrawl-cli
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /anycrawl-cli 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
- Clarified and simplified workflow section, now titled "Commands". - Improved explanations for command scenarios and when to use each command. - Updated the command reference table for clarity. - Added tips to avoid redundant fetches and directions to check existing data. - Retained full prerequisite, output, and documentation sections.
v1.0.0
anycrawl-cli 1.0.0 initial release - Introduces a CLI for web scraping, search, crawling, and site mapping using AnyCrawl. - Supports commands for searching, scraping, mapping site URLs, and bulk crawling site sections. - Outputs clean, LLM-optimized markdown for easy downstream processing. - Requires installation and authentication via login or API key. - Organizes results in the .anycrawl/ directory and provides guidance to avoid redundant fetches.
元数据
Slug anycrawl-cli
版本 1.0.1
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 2
常见问题

Anycrawl 是什么?

Web scraping, search, crawling, and site mapping via the AnyCrawl CLI. Use when the user wants to search the web, scrape a page, find URLs on a site, or bulk... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 259 次。

如何安装 Anycrawl?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install anycrawl-cli」即可一键安装,无需额外配置。

Anycrawl 是免费的吗?

是的,Anycrawl 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Anycrawl 支持哪些平台?

Anycrawl 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Anycrawl?

由 QThans(@qthans)开发并维护,当前版本 v1.0.1。

💬 留言讨论