← 返回 Skills 市场
eohmig

Firecrawl Scrape

作者 eohmig · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
44
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install firecrawl-scrape
功能描述
Extract clean markdown from any URL, including JavaScript-rendered SPAs. Use this skill whenever the user provides a URL and wants its content, says "scrape"...
使用说明 (SKILL.md)

firecrawl scrape

Scrape one or more URLs. Returns clean, LLM-optimized markdown. Multiple URLs are scraped concurrently.

When to use

  • You have a specific URL and want its content
  • The page is static or JS-rendered (SPA)
  • Step 2 in the workflow escalation pattern: search → scrape → map → crawl → interact

Quick start

# Basic markdown extraction
firecrawl scrape "\x3Curl>" -o .firecrawl/page.md

# Main content only, no nav/footer
firecrawl scrape "\x3Curl>" --only-main-content -o .firecrawl/page.md

# Wait for JS to render, then scrape
firecrawl scrape "\x3Curl>" --wait-for 3000 -o .firecrawl/page.md

# Multiple URLs (each saved to .firecrawl/)
firecrawl scrape https://example.com https://example.com/blog https://example.com/docs

# Get markdown and links together
firecrawl scrape "\x3Curl>" --format markdown,links -o .firecrawl/page.json

# Ask a question about the page
firecrawl scrape "https://example.com/pricing" --query "What is the enterprise plan price?"

Options

Option Description
-f, --format \x3Cformats> Output formats: markdown, html, rawHtml, links, screenshot, json
-Q, --query \x3Cprompt> Ask a question about the page content (5 credits)
-H Include HTTP headers in output
--only-main-content Strip nav, footer, sidebar — main content only
--wait-for \x3Cms> Wait for JS rendering before scraping
--include-tags \x3Ctags> Only include these HTML tags
--exclude-tags \x3Ctags> Exclude these HTML tags
-o, --output \x3Cpath> Output file path

Tips

  • Prefer plain scrape over --query. Scrape to a file, then use grep, head, or read the markdown directly — you can search and reason over the full content yourself. Use --query only when you want a single targeted answer without saving the page (costs 5 extra credits).
  • Try scrape before interact. Scrape handles static pages and JS-rendered SPAs. Only escalate to interact when you need interaction (clicks, form fills, pagination).
  • Multiple URLs are scraped concurrently — check firecrawl --status for your concurrency limit.
  • Single format outputs raw content. Multiple formats (e.g., --format markdown,links) output JSON.
  • Always quote URLs — shell interprets ? and & as special characters.
  • Naming convention: .firecrawl/{site}-{path}.md

See also

安全使用建议
Install this if you want an agent to use Firecrawl for webpage extraction. Be careful with URLs that include secrets, private documents, internal hosts, or sensitive query parameters, because requested pages may be processed by Firecrawl and saved into local .firecrawl files.
能力评估
Purpose & Capability
The stated purpose is to extract clean markdown from URLs, including JavaScript-rendered pages, and its allowed tools are limited to Firecrawl CLI invocations.
Instruction Scope
The trigger language is broad and tells agents to prefer this over WebFetch for webpage extraction, which could route ordinary URL-reading requests through Firecrawl more often than users expect.
Install Mechanism
The artifact contains only SKILL.md and declares no installer scripts or bundled executable code; it invokes an existing firecrawl command or npx firecrawl.
Credentials
Network access to Firecrawl and fetching user-supplied URLs are proportionate for a scraping skill, but users should avoid sensitive, private, or token-bearing URLs unless they intend third-party processing.
Persistence & Privilege
Examples write scraped content under .firecrawl, such as .firecrawl/page.md or .firecrawl/page.json; this is disclosed in the usage examples and fits the workflow.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install firecrawl-scrape
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /firecrawl-scrape 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of firecrawl-scrape skill. - Extracts clean, LLM-optimized markdown from any URL, including JavaScript-rendered SPAs. - Supports multiple concurrent URL scraping. - Handles static and dynamic web pages; use instead of WebFetch for scraping. - Offers flexible output options (markdown, HTML, raw HTML, links, screenshot, JSON). - Includes advanced features such as main content extraction, custom wait times, tag filters, and inline querying. - Designed for easy integration into content workflows and automation scripts.
元数据
Slug firecrawl-scrape
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Firecrawl Scrape 是什么?

Extract clean markdown from any URL, including JavaScript-rendered SPAs. Use this skill whenever the user provides a URL and wants its content, says "scrape"... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 44 次。

如何安装 Firecrawl Scrape?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install firecrawl-scrape」即可一键安装,无需额外配置。

Firecrawl Scrape 是免费的吗?

是的,Firecrawl Scrape 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Firecrawl Scrape 支持哪些平台?

Firecrawl Scrape 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Firecrawl Scrape?

由 eohmig(@eohmig)开发并维护,当前版本 v1.0.0。

💬 留言讨论