← 返回 Skills 市场
44
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install firecrawl-scrape
功能描述
Extract clean markdown from any URL, including JavaScript-rendered SPAs. Use this skill whenever the user provides a URL and wants its content, says "scrape"...
使用说明 (SKILL.md)
firecrawl scrape
Scrape one or more URLs. Returns clean, LLM-optimized markdown. Multiple URLs are scraped concurrently.
When to use
- You have a specific URL and want its content
- The page is static or JS-rendered (SPA)
- Step 2 in the workflow escalation pattern: search → scrape → map → crawl → interact
Quick start
# Basic markdown extraction
firecrawl scrape "\x3Curl>" -o .firecrawl/page.md
# Main content only, no nav/footer
firecrawl scrape "\x3Curl>" --only-main-content -o .firecrawl/page.md
# Wait for JS to render, then scrape
firecrawl scrape "\x3Curl>" --wait-for 3000 -o .firecrawl/page.md
# Multiple URLs (each saved to .firecrawl/)
firecrawl scrape https://example.com https://example.com/blog https://example.com/docs
# Get markdown and links together
firecrawl scrape "\x3Curl>" --format markdown,links -o .firecrawl/page.json
# Ask a question about the page
firecrawl scrape "https://example.com/pricing" --query "What is the enterprise plan price?"
Options
| Option | Description |
|---|---|
-f, --format \x3Cformats> |
Output formats: markdown, html, rawHtml, links, screenshot, json |
-Q, --query \x3Cprompt> |
Ask a question about the page content (5 credits) |
-H |
Include HTTP headers in output |
--only-main-content |
Strip nav, footer, sidebar — main content only |
--wait-for \x3Cms> |
Wait for JS rendering before scraping |
--include-tags \x3Ctags> |
Only include these HTML tags |
--exclude-tags \x3Ctags> |
Exclude these HTML tags |
-o, --output \x3Cpath> |
Output file path |
Tips
- Prefer plain scrape over
--query. Scrape to a file, then usegrep,head, or read the markdown directly — you can search and reason over the full content yourself. Use--queryonly when you want a single targeted answer without saving the page (costs 5 extra credits). - Try scrape before interact. Scrape handles static pages and JS-rendered SPAs. Only escalate to
interactwhen you need interaction (clicks, form fills, pagination). - Multiple URLs are scraped concurrently — check
firecrawl --statusfor your concurrency limit. - Single format outputs raw content. Multiple formats (e.g.,
--format markdown,links) output JSON. - Always quote URLs — shell interprets
?and&as special characters. - Naming convention:
.firecrawl/{site}-{path}.md
See also
- firecrawl-search — find pages when you don't have a URL
- firecrawl-interact — when scrape can't get the content, use
interactto click, fill forms, etc. - firecrawl-download — bulk download an entire site to local files
安全使用建议
Install this if you want an agent to use Firecrawl for webpage extraction. Be careful with URLs that include secrets, private documents, internal hosts, or sensitive query parameters, because requested pages may be processed by Firecrawl and saved into local .firecrawl files.
能力评估
Purpose & Capability
The stated purpose is to extract clean markdown from URLs, including JavaScript-rendered pages, and its allowed tools are limited to Firecrawl CLI invocations.
Instruction Scope
The trigger language is broad and tells agents to prefer this over WebFetch for webpage extraction, which could route ordinary URL-reading requests through Firecrawl more often than users expect.
Install Mechanism
The artifact contains only SKILL.md and declares no installer scripts or bundled executable code; it invokes an existing firecrawl command or npx firecrawl.
Credentials
Network access to Firecrawl and fetching user-supplied URLs are proportionate for a scraping skill, but users should avoid sensitive, private, or token-bearing URLs unless they intend third-party processing.
Persistence & Privilege
Examples write scraped content under .firecrawl, such as .firecrawl/page.md or .firecrawl/page.json; this is disclosed in the usage examples and fits the workflow.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install firecrawl-scrape - 安装完成后,直接呼叫该 Skill 的名称或使用
/firecrawl-scrape触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of firecrawl-scrape skill.
- Extracts clean, LLM-optimized markdown from any URL, including JavaScript-rendered SPAs.
- Supports multiple concurrent URL scraping.
- Handles static and dynamic web pages; use instead of WebFetch for scraping.
- Offers flexible output options (markdown, HTML, raw HTML, links, screenshot, JSON).
- Includes advanced features such as main content extraction, custom wait times, tag filters, and inline querying.
- Designed for easy integration into content workflows and automation scripts.
元数据
常见问题
Firecrawl Scrape 是什么?
Extract clean markdown from any URL, including JavaScript-rendered SPAs. Use this skill whenever the user provides a URL and wants its content, says "scrape"... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 44 次。
如何安装 Firecrawl Scrape?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install firecrawl-scrape」即可一键安装,无需额外配置。
Firecrawl Scrape 是免费的吗?
是的,Firecrawl Scrape 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Firecrawl Scrape 支持哪些平台?
Firecrawl Scrape 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Firecrawl Scrape?
由 eohmig(@eohmig)开发并维护,当前版本 v1.0.0。
推荐 Skills