← 返回 Skills 市场

Firecrawl Scrape

Name: Firecrawl Scrape
Author: eohmig

作者 eohmig · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ 安全检测通过

总下载

当前安装

版本数

在 OpenClaw 中安装

/install firecrawl-scrape

功能描述

Extract clean markdown from any URL, including JavaScript-rendered SPAs. Use this skill whenever the user provides a URL and wants its content, says "scrape"...

使用说明 (SKILL.md)

firecrawl scrape

Scrape one or more URLs. Returns clean, LLM-optimized markdown. Multiple URLs are scraped concurrently.

When to use

You have a specific URL and want its content
The page is static or JS-rendered (SPA)
Step 2 in the workflow escalation pattern: search → scrape → map → crawl → interact

Quick start

# Basic markdown extraction
firecrawl scrape "\x3Curl>" -o .firecrawl/page.md

# Main content only, no nav/footer
firecrawl scrape "\x3Curl>" --only-main-content -o .firecrawl/page.md

# Wait for JS to render, then scrape
firecrawl scrape "\x3Curl>" --wait-for 3000 -o .firecrawl/page.md

# Multiple URLs (each saved to .firecrawl/)
firecrawl scrape https://example.com https://example.com/blog https://example.com/docs

# Get markdown and links together
firecrawl scrape "\x3Curl>" --format markdown,links -o .firecrawl/page.json

# Ask a question about the page
firecrawl scrape "https://example.com/pricing" --query "What is the enterprise plan price?"

Options

Option	Description
`-f, --format \x3Cformats>`	Output formats: markdown, html, rawHtml, links, screenshot, json
`-Q, --query \x3Cprompt>`	Ask a question about the page content (5 credits)
`-H`	Include HTTP headers in output
`--only-main-content`	Strip nav, footer, sidebar — main content only
`--wait-for \x3Cms>`	Wait for JS rendering before scraping
`--include-tags \x3Ctags>`	Only include these HTML tags
`--exclude-tags \x3Ctags>`	Exclude these HTML tags
`-o, --output \x3Cpath>`	Output file path

Tips

Prefer plain scrape over --query. Scrape to a file, then use grep, head, or read the markdown directly — you can search and reason over the full content yourself. Use --query only when you want a single targeted answer without saving the page (costs 5 extra credits).
Try scrape before interact. Scrape handles static pages and JS-rendered SPAs. Only escalate to interact when you need interaction (clicks, form fills, pagination).
Multiple URLs are scraped concurrently — check firecrawl --status for your concurrency limit.
Single format outputs raw content. Multiple formats (e.g., --format markdown,links) output JSON.
Always quote URLs — shell interprets ? and & as special characters.
Naming convention: .firecrawl/{site}-{path}.md

Initial release of firecrawl-scrape skill. - Extracts clean, LLM-optimized markdown from any URL, including JavaScript-rendered SPAs. - Supports multiple concurrent URL scraping. - Handles static and dynamic web pages; use instead of WebFetch for scraping. - Offers flexible output options (markdown, HTML, raw HTML, links, screenshot, JSON). - Includes advanced features such as main content extraction, custom wait times, tag filters, and inline querying. - Designed for easy integration into content workflows and automation scripts.

元数据

Slug firecrawl-scrape

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题