← Back to Skills Marketplace
eohmig

Firecrawl Scrape

by eohmig · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
44
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install firecrawl-scrape
Description
Extract clean markdown from any URL, including JavaScript-rendered SPAs. Use this skill whenever the user provides a URL and wants its content, says "scrape"...
README (SKILL.md)

firecrawl scrape

Scrape one or more URLs. Returns clean, LLM-optimized markdown. Multiple URLs are scraped concurrently.

When to use

  • You have a specific URL and want its content
  • The page is static or JS-rendered (SPA)
  • Step 2 in the workflow escalation pattern: search → scrape → map → crawl → interact

Quick start

# Basic markdown extraction
firecrawl scrape "\x3Curl>" -o .firecrawl/page.md

# Main content only, no nav/footer
firecrawl scrape "\x3Curl>" --only-main-content -o .firecrawl/page.md

# Wait for JS to render, then scrape
firecrawl scrape "\x3Curl>" --wait-for 3000 -o .firecrawl/page.md

# Multiple URLs (each saved to .firecrawl/)
firecrawl scrape https://example.com https://example.com/blog https://example.com/docs

# Get markdown and links together
firecrawl scrape "\x3Curl>" --format markdown,links -o .firecrawl/page.json

# Ask a question about the page
firecrawl scrape "https://example.com/pricing" --query "What is the enterprise plan price?"

Options

Option Description
-f, --format \x3Cformats> Output formats: markdown, html, rawHtml, links, screenshot, json
-Q, --query \x3Cprompt> Ask a question about the page content (5 credits)
-H Include HTTP headers in output
--only-main-content Strip nav, footer, sidebar — main content only
--wait-for \x3Cms> Wait for JS rendering before scraping
--include-tags \x3Ctags> Only include these HTML tags
--exclude-tags \x3Ctags> Exclude these HTML tags
-o, --output \x3Cpath> Output file path

Tips

  • Prefer plain scrape over --query. Scrape to a file, then use grep, head, or read the markdown directly — you can search and reason over the full content yourself. Use --query only when you want a single targeted answer without saving the page (costs 5 extra credits).
  • Try scrape before interact. Scrape handles static pages and JS-rendered SPAs. Only escalate to interact when you need interaction (clicks, form fills, pagination).
  • Multiple URLs are scraped concurrently — check firecrawl --status for your concurrency limit.
  • Single format outputs raw content. Multiple formats (e.g., --format markdown,links) output JSON.
  • Always quote URLs — shell interprets ? and & as special characters.
  • Naming convention: .firecrawl/{site}-{path}.md

See also

Usage Guidance
Install this if you want an agent to use Firecrawl for webpage extraction. Be careful with URLs that include secrets, private documents, internal hosts, or sensitive query parameters, because requested pages may be processed by Firecrawl and saved into local .firecrawl files.
Capability Assessment
Purpose & Capability
The stated purpose is to extract clean markdown from URLs, including JavaScript-rendered pages, and its allowed tools are limited to Firecrawl CLI invocations.
Instruction Scope
The trigger language is broad and tells agents to prefer this over WebFetch for webpage extraction, which could route ordinary URL-reading requests through Firecrawl more often than users expect.
Install Mechanism
The artifact contains only SKILL.md and declares no installer scripts or bundled executable code; it invokes an existing firecrawl command or npx firecrawl.
Credentials
Network access to Firecrawl and fetching user-supplied URLs are proportionate for a scraping skill, but users should avoid sensitive, private, or token-bearing URLs unless they intend third-party processing.
Persistence & Privilege
Examples write scraped content under .firecrawl, such as .firecrawl/page.md or .firecrawl/page.json; this is disclosed in the usage examples and fits the workflow.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install firecrawl-scrape
  3. After installation, invoke the skill by name or use /firecrawl-scrape
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of firecrawl-scrape skill. - Extracts clean, LLM-optimized markdown from any URL, including JavaScript-rendered SPAs. - Supports multiple concurrent URL scraping. - Handles static and dynamic web pages; use instead of WebFetch for scraping. - Offers flexible output options (markdown, HTML, raw HTML, links, screenshot, JSON). - Includes advanced features such as main content extraction, custom wait times, tag filters, and inline querying. - Designed for easy integration into content workflows and automation scripts.
Metadata
Slug firecrawl-scrape
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Firecrawl Scrape?

Extract clean markdown from any URL, including JavaScript-rendered SPAs. Use this skill whenever the user provides a URL and wants its content, says "scrape"... It is an AI Agent Skill for Claude Code / OpenClaw, with 44 downloads so far.

How do I install Firecrawl Scrape?

Run "/install firecrawl-scrape" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Firecrawl Scrape free?

Yes, Firecrawl Scrape is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Firecrawl Scrape support?

Firecrawl Scrape is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Firecrawl Scrape?

It is built and maintained by eohmig (@eohmig); the current version is v1.0.0.

💬 Comments