Firecrawl Local
/install firecrawl-local
Firecrawl Local Skill
Self-hosted Firecrawl integration using the v1 REST API. Tests connectivity first, executes scrape/crawl/map, handles async crawl polling automatically.
Setup (one-time)
mkdir -p ~/.openclaw/skills/firecrawl-local
cp run.sh ~/.openclaw/skills/firecrawl-local/run.sh
chmod +x ~/.openclaw/skills/firecrawl-local/run.sh
The script lives at scripts/run.sh in this skill folder — copy it into place as above.
Prerequisites: curl, jq installed. Firecrawl running at localhost:3002.
Optional env vars:
export FIRECRAWL_LOCAL_URL="http://localhost:3002" # default
export FIRECRAWL_API_KEY="fc-your-key" # only needed if auth enabled
Commands
Default — scrape a single page (URL only, no subcommand needed)
firecrawl-local https://docs.example.com/api
Scrape — explicit, with format options
firecrawl-local scrape https://docs.example.com/api
firecrawl-local scrape https://docs.example.com/api --formats markdown,html
Map — discover all URLs on a site
firecrawl-local map https://docs.example.com
firecrawl-local map https://docs.example.com --limit 200
Crawl — bulk extract multiple pages (async, auto-polled)
firecrawl-local crawl https://docs.example.com
firecrawl-local crawl https://docs.example.com --limit 30 --max-depth 2
firecrawl-local crawl https://docs.example.com --include /docs --exclude /blog
Agent Instructions
When to use each command
| Goal | Command |
|---|---|
| Get content from one URL (quickest) | firecrawl-local \x3Curl> |
| Discover what pages exist | map |
| Get content from one URL with format control | scrape |
| Ingest an entire docs site | crawl |
| RAG pipeline ingestion | map → targeted scrape or crawl |
Optimal workflows
Documentation RAG pipeline:
1. map https://docs.example.com → get full URL list
2. scrape \x3Cspecific key pages> → targeted extraction
3. Pass markdown to embedding pipeline
Full site ingestion:
1. crawl https://docs.example.com --limit 50 --max-depth 3
2. Results auto-polled and returned as JSON array of {url, markdown}
Parameters
| Flag | Applies to | Description |
|---|---|---|
--limit N |
map, crawl | Max pages (default: 50 for crawl, 500 for map) |
--max-depth N |
crawl | How deep to follow links (default: 2) |
--include /path |
crawl | Only crawl URLs matching this path prefix |
--exclude /path |
crawl | Skip URLs matching this path prefix |
--formats list |
scrape | Comma-separated: markdown, html, rawHtml, links |
Reading the output
- scrape: Returns
{success, data: {markdown, html, metadata}} - map: Returns
{success, links: [...]} - crawl: Returns
{success, data: [{url, markdown, metadata}, ...]}← after polling completes
Failure signals and fixes
| Error | Cause | Fix |
|---|---|---|
Local Firecrawl unavailable |
Service not running | Start Firecrawl, check port 3002 |
success: false |
Bad URL or blocked | Check URL is reachable, try --formats html |
Empty markdown field |
JS-rendered page | Firecrawl handles most JS — check if site blocks bots |
| Crawl times out | Site is large | Reduce --limit or --max-depth |
Script reference
See scripts/run.sh for the full implementation. Key design decisions:
- Health check uses
/healthendpoint with 3s timeout - Auth header only sent when
FIRECRAWL_API_KEYis set - Crawl polling retries every 5s up to 60 attempts (5 minutes)
- All parameters are passed via
jqto prevent shell injection in JSON
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install firecrawl-local - 安装完成后,直接呼叫该 Skill 的名称或使用
/firecrawl-local触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Firecrawl Local 是什么?
Use this skill whenever you need to scrape web pages, crawl websites, or map site structure using a self-hosted Firecrawl instance. Triggers on requests to e... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 157 次。
如何安装 Firecrawl Local?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install firecrawl-local」即可一键安装,无需额外配置。
Firecrawl Local 是免费的吗?
是的,Firecrawl Local 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Firecrawl Local 支持哪些平台?
Firecrawl Local 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Firecrawl Local?
由 SaddamTechie(@saddamtechie)开发并维护,当前版本 v1.0.0。