← 返回 Skills 市场
mariokarras

Firecrawl Cli

作者 Mario Karras · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
199
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install abm-firecrawl-cli
功能描述
When the user wants to scrape, crawl, or extract content from websites. Also use when the user mentions 'scrape site,' 'crawl website,' 'extract content,' 'w...
使用说明 (SKILL.md)

Firecrawl CLI

You help users scrape, crawl, and extract content from websites using the Firecrawl CLI tool. You select the right subcommand for the task and handle async operations like crawling.

Before Starting

Check for product marketing context first: If .agents/product-marketing-context.md exists (or .claude/product-marketing-context.md in older setups), read it before asking questions. Use that context and only ask for information not already covered or specific to this task.

Then determine:

  1. Target URL(s) — What site or page to extract content from
  2. What content to extract — Full page, specific data, all pages on a site
  3. Single page vs full site — One URL (scrape) or many pages (crawl)
  4. Search first? — Does the user need to find pages before extracting content

Command Reference

Command Purpose When to Use
scrape Extract content from a single page Need content from one specific URL
crawl Crawl an entire site (async) Need content from multiple pages on a site
crawl-status Check a crawl job's progress After starting a crawl, to poll for results
map Discover all URLs on a site Need to see what pages exist before scraping
search Search the web and scrape results Need to find relevant pages AND extract content
agent Autonomous web data gathering Need AI to find and extract data across multiple sites
extract Structured data extraction with LLMs Need specific structured data from one or more URLs

Workflow by Use Case

Scraping a Single Page

firecrawl.js scrape --url "https://example.com/page"

When: You need content from one specific page. Returns markdown by default.

Options:

  • --formats markdown,html — Choose output format(s), comma-separated
  • --only-main-content false — Include headers, footers, navs (default: main content only)
  • --dry-run — Preview the API request without sending it

Example — extract a blog post as markdown:

firecrawl.js scrape --url "https://example.com/blog/post-title"

Example — get full HTML including navigation:

firecrawl.js scrape --url "https://example.com" --formats html --only-main-content false

Crawling an Entire Site

firecrawl.js crawl --url "https://example.com" --limit 50

When: You need content from multiple pages across a site. This is an async operation — the CLI starts a crawl job and returns a job ID.

Options:

  • --limit N — Maximum number of pages to crawl
  • --max-depth N — Maximum link depth from the starting URL
  • --poll — Automatically poll for completion instead of returning the job ID
  • --dry-run — Preview the API request without sending it

Workflow without --poll:

  1. Start the crawl: firecrawl.js crawl --url "https://example.com" --limit 20
  2. Note the returned job ID
  3. Check status: firecrawl.js crawl-status --id "job-id-here"
  4. Repeat step 3 until complete

Workflow with --poll (recommended):

firecrawl.js crawl --url "https://example.com" --limit 20 --poll

The CLI handles polling automatically and returns all results when done.

Checking Crawl Status

firecrawl.js crawl-status --id "job-id-here"

When: You started a crawl without --poll and need to check if it finished.

Getting a Site Map

firecrawl.js map --url "https://example.com"

When: You need to discover all URLs on a site before deciding what to scrape. Good first step before targeted scraping.

Options:

  • --search "term" — Filter URLs containing a search term
  • --limit N — Maximum number of URLs to return
  • --dry-run — Preview the API request without sending it

Example — find all blog pages:

firecrawl.js map --url "https://example.com" --search "blog"

Searching and Scraping

firecrawl.js search --query "site:example.com pricing plans"

When: You need to find relevant pages AND extract their content in one step. Combines web search with content extraction.

Options:

  • --limit N — Number of results to return
  • --country "us" — Country code for localized results
  • --dry-run — Preview the API request without sending it

Example — find competitor pricing pages:

firecrawl.js search --query "competitor.com pricing" --limit 5

Autonomous Agent Extraction

firecrawl.js agent --prompt "Find pricing information for Acme Corp" --poll

When: You need the AI to autonomously search, navigate, and gather data. No URLs required -- just describe what you need.

Options:

  • --prompt "..." -- Describe what data to find (required, max 10000 chars)
  • --url "..." -- Optionally constrain to a specific URL
  • --urls "url1,url2" -- Optionally constrain to multiple URLs
  • --max-credits N -- Set spending limit (default: 2500)
  • --poll -- Wait for results instead of returning job ID
  • --dry-run -- Preview the API request without sending it

Example -- find competitor pricing:

firecrawl.js agent --prompt "Find pricing tiers and plans for competitor.com" --url "https://competitor.com" --poll

Extracting Structured Data

firecrawl.js extract --urls "https://example.com/pricing" --prompt "Extract pricing tiers with plan names and prices" --poll

When: You need specific structured data extracted from URLs using LLMs. Supports wildcards for domain-wide extraction.

Options:

  • --urls "url1,url2" -- URLs to extract from, comma-separated (required). Supports /* wildcards
  • --prompt "..." -- Describe what data to extract
  • --schema '{"type":"object",...}' -- JSON schema for structured output
  • --enable-web-search -- Follow external links during extraction
  • --poll -- Wait for results instead of returning job ID
  • --dry-run -- Preview the API request without sending it

Example -- extract product data from multiple pages:

firecrawl.js extract --urls "https://example.com/products/*" --prompt "Extract product names, prices, and descriptions" --poll

Choosing the Right Command

  • Need one page?scrape
  • Need the whole site?crawl with --limit and --poll
  • Need to discover URLs first?map, then scrape specific pages
  • Need search + content?search
  • Started a crawl and need results?crawl-status
  • Need AI to find data autonomously?agent with --poll
  • Need structured data from known URLs?extract with --prompt and --poll

Common multi-step workflows:

  1. Targeted extraction: map → review URLs → scrape specific pages
  2. Full site dump: crawl --poll --limit 100
  3. Research a topic: search → review results → scrape for deeper content

Output Format

Present extracted content with clear structure:

  • Source URL — Always include the URL the content came from
  • Extraction method — Note which command was used (scrape, crawl, search)
  • Content — The extracted markdown/HTML content

For crawl results with multiple pages, organize by page with clear headers:

### Page 1: [title] (https://example.com/page-1)
[content]

### Page 2: [title] (https://example.com/page-2)
[content]

Environment Setup

The Firecrawl CLI requires a FIRECRAWL_API_KEY environment variable. If the key is not set, the CLI will return an error. The user needs to:

  1. Get an API key from firecrawl.dev
  2. Set it in their environment: export FIRECRAWL_API_KEY="fc-..."

Use --dry-run on any command to preview the API request without needing a key.


Related Skills

  • exa-company-research — For web search and company research (search-focused, not scraping)
  • seo-audit — Uses Firecrawl for schema detection and technical SEO analysis
  • competitive-intelligence — Combines Exa search + Firecrawl scraping for competitive analysis (Phase 4)
安全使用建议
Before installing or using this skill, ask the publisher for: (1) an authoritative install method or package (where does firecrawl.js come from?), (2) the exact network endpoints the CLI talks to and where scraped data is stored or transmitted, (3) required credentials or config (e.g., FIRECRAWL_API_KEY or similar) and why those are needed, and (4) privacy/legal constraints for crawling target sites. Consider running the CLI only in a sandboxed environment and avoid granting it access to sensitive workspace files until you confirm it only needs a specific, limited context file. If you rely on the skill to read .agents/product-marketing-context.md (or similar), inspect that file for secrets first. If the author cannot explain the missing binary/install and missing auth requirements, treat the package as risky.
功能分析
Type: OpenClaw Skill Name: abm-firecrawl-cli Version: 1.0.0 The skill bundle provides instructions for an AI agent to interface with the Firecrawl CLI tool (firecrawl.js) for web scraping and data extraction. The SKILL.md file outlines standard usage for scraping, crawling, and structured data extraction, requiring a legitimate API key from firecrawl.dev. No indicators of malicious intent, data exfiltration of sensitive local files, or unauthorized command execution were found; the behavior is entirely consistent with the stated purpose of web research and content extraction.
能力评估
Purpose & Capability
The SKILL.md explicitly instructs the agent to run a Firecrawl CLI (firecrawl.js) to perform scrapes, crawls, agent-driven extraction, and async jobs. However, the skill metadata lists no required binaries, no install spec, and no required environment variables/credentials. A CLI that starts remote crawl jobs typically requires a binary and an API key or config; that expected linkage is missing and therefore incoherent.
Instruction Scope
Runtime instructions direct the agent to read workspace files if present (e.g., .agents/product-marketing-context.md or .claude/product-marketing-context.md) before asking questions. That lets the skill read arbitrary repository/agent-local files, which can include sensitive information. The SKILL.md also references starting async jobs, polling, and previewing API requests (dry-run) but does not state what remote endpoints are used, where scraped data is stored, or whether data is transmitted off-host.
Install Mechanism
This is an instruction-only skill with no install spec and no code files, which minimizes direct disk writes from the skill bundle itself. However, the instructions expect firecrawl.js to exist in the environment; the absence of an install mechanism or declared required binary is an inconsistency (it is unclear how the CLI will be provided). That ambiguity could lead to ad-hoc fetching/execution behavior by the agent or user.
Credentials
No environment variables or primary credential are declared, yet the CLI semantics (async jobs, --max-credits, polling, agent autonomous gathering) strongly imply interaction with a remote service that would normally require authentication and possibly billing credentials. The skill also instructs reading local context files, increasing access to potential secrets. The lack of declared credentials is disproportionate to the implied capabilities.
Persistence & Privilege
The skill is not always-enabled (always: false) and does not request elevated platform privileges in the metadata. SKILL.md does not instruct modifying other skills or global agent settings. Autonomous invocation is allowed by default, which is normal for skills; this is not sufficient alone to escalate concern.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install abm-firecrawl-cli
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /abm-firecrawl-cli 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of firecrawl-cli skill. - Supports all core Firecrawl CLI commands: scrape, crawl, crawl-status, map, search, agent, and extract. - Clearly explains command selection, options, and step-by-step workflows for scraping, crawling, sitemap discovery, structured data extraction, and autonomous web data gathering. - Provides usage examples and output formatting guidance. - Includes context-checking instructions for product marketing scenarios.
元数据
Slug abm-firecrawl-cli
版本 1.0.0
许可证 MIT-0
累计安装 1
当前安装数 0
历史版本数 1
常见问题

Firecrawl Cli 是什么?

When the user wants to scrape, crawl, or extract content from websites. Also use when the user mentions 'scrape site,' 'crawl website,' 'extract content,' 'w... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 199 次。

如何安装 Firecrawl Cli?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install abm-firecrawl-cli」即可一键安装,无需额外配置。

Firecrawl Cli 是免费的吗?

是的,Firecrawl Cli 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Firecrawl Cli 支持哪些平台?

Firecrawl Cli 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Firecrawl Cli?

由 Mario Karras(@mariokarras)开发并维护,当前版本 v1.0.0。

💬 留言讨论