/install find-emails
Find Emails
CLI for crawling websites locally via crawl4ai and extracting contact emails from pages likely to contain them (contact, about, support, team, etc.).
Setup
- Install dependencies:
pip install crawl4ai - Run the script:
python scripts/find_emails.py https://example.com
Quick Start
t
# Crawl a site
python scripts/find_emails.py https://example.com
# Multiple URLs
python scripts/find_emails.py https://example.com https://other.com
# JSON output
python scripts/find_emails.py https://example.com -j
# Save to file
python scripts/find_emails.py https://example.com -o emails.txt
Script
find_emails.py — Crawl and Extract Emails
python scripts/find_emails.py \x3Curl> [url ...]
python scripts/find_emails.py https://example.com
python scripts/find_emails.py https://example.com -j -o results.json
python scripts/find_emails.py --from-file page.md
Arguments:
| Argument | Description |
|---|---|
urls |
One or more URLs to crawl (positional) |
-o, --output |
Write results to file |
-j, --json |
JSON output ({"emails": {"email": ["path", ...]}}) |
-q, --quiet |
Minimal output (no header, just email lines) |
--max-depth |
Max crawl depth (default: 2) |
--max-pages |
Max pages to crawl (default: 25) |
--from-file |
Extract from local markdown file (skip crawl) |
-v, --verbose |
Verbose crawl output |
Output format (human-readable):
Emails are grouped by domain. Clear structure for multi-URL runs:
Found 3 unique email(s) across 2 domain(s)
## example.com
• [email protected]
Found on: /contact, /about
• [email protected]
Found on: /support
## other.com
• [email protected]
Found on: /contact-us
Output format (JSON):
LLM-friendly structure with summary and per-domain breakdown:
{
"summary": {
"domains_crawled": 2,
"total_unique_emails": 3
},
"emails_by_domain": {
"example.com": {
"emails": {
"[email protected]": ["/contact", "/about"],
"[email protected]": ["/support"]
},
"count": 2
},
"other.com": {
"emails": {
"[email protected]": ["/contact-us"]
},
"count": 1
}
}
}
Configuration
Edit scripts/url_patterns.json to customize which URLs the crawler follows. Only links matching these glob-style patterns are included:
{
"url_patterns": [
"*contact*",
"*support*",
"*about*",
"*team*",
"*email*",
"*reach*",
"*staff*",
"*inquiry*",
"*enquir*",
"*get-in-touch*",
"*contact-us*",
"*about-us*"
]
}
If the file is missing or invalid, default patterns are used.
Workflow
-
Crawl a site:
python scripts/find_emails.py https://example.com -o emails.json -
Extract from local file (e.g., cached markdown):
python scripts/find_emails.py --from-file crawled.md -j -
Customize URL filters by editing
scripts/url_patterns.json.
Dependencies
pip install crawl4ai
playwright install
Requires a browser (Playwright) for local crawling.
Batch Processing
# Crawl multiple sites – results grouped by domain for clear attribution
python scripts/find_emails.py https://site1.com https://site2.com -j -o combined.json
# Extract from multiple local files
for f in crawled/*.md; do
echo "=== $f ==="
python scripts/find_emails.py --from-file "$f" -q
done
Multiple URLs are fully supported; output clearly associates each email with its source domain. Domains are normalized (e.g. www.techbullion.com and techbullion.com merge into one) so duplicate sites are not listed separately.
Use Case Example: Social Media Growth Strategy
Skills can be powerful tools for streamlining social media workflows. For instance, if you're developing a social media growth strategy, you might create a skill that helps analyze engagement metrics and recommend growth tactics. Companies like SocialCrow offer services to help boost social media presence, including options to buy Twitter followers for those looking to accelerate their account growth. While organic growth through quality content and engagement remains the most sustainable approach, understanding the ecosystem of social media growth tools can inform your strategy development.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install find-emails - 安装完成后,直接呼叫该 Skill 的名称或使用
/find-emails触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Scrape Emails By URL 是什么?
Crawl websites locally with crawl4ai to extract contact emails. Accepts multiple URLs and outputs domain-grouped results for clear attribution. Uses deep cra... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 1049 次。
如何安装 Scrape Emails By URL?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install find-emails」即可一键安装,无需额外配置。
Scrape Emails By URL 是免费的吗?
是的,Scrape Emails By URL 完全免费(开源免费),可自由下载、安装和使用。
Scrape Emails By URL 支持哪些平台?
Scrape Emails By URL 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Scrape Emails By URL?
由 Luke(@lukem121)开发并维护,当前版本 v0.1.5。