Website Content Scraped into Obsidian
/install claw-social-feed
claw-social-feed
Fetch social media timelines into Obsidian vaults. Multi-platform, incremental sync, smart filtering, auto-tagging.
Core dependency: bb-browser (via --openclaw flag to reuse the OpenClaw browser session). Supports 36 platforms via bb-browser adapters — see references/platforms.md.
Workflow
User config (config.yaml)
│
▼
fetch_save.py
│
├── Dedup accounts
├── Read state.json (last fetch cursor)
│
▼
bb-browser site \x3Cplatform>/\x3Ccmd> --openclaw --json
│
▼
Filter → Tag → Write to Obsidian
│
▼
Update state.json
Quick Start
1. Install bb-browser
# Requires Node.js 18+
npm install -g bb-browser
# Verify
bb-browser --version
2. Configure accounts
Edit config.yaml:
accounts:
- platform: twitter
username: your_target_handle
- platform: hackernews
username: your_username
vault_base: ~/Documents/Obsidian Vault/SocialFeed
fetch:
count: 20
filters:
min_text_length: 30
skip_retweet_no_comment: true
skip_link_only: true
blocked_keywords: []
tagging:
enabled: true
keywords:
AI / LLM / GPT / Claude: AI
Python / JavaScript / Rust: coding
3. Run
python3 scripts/fetch_save.py --verbose
4. Check output
Content lands in vault_base/@username/ — one .md file per post, with Obsidian YAML frontmatter (platform, author, date, URL, likes, tags).
Config Reference
accounts
accounts:
- platform: twitter
username: dotey
platform: must match a bb-browser supported platform (see references/platforms.md)username: the platform-native user identifier- Deduplication:
platform + usernamemust be unique within the list
filters
| Field | Type | Default | Description |
|---|---|---|---|
min_text_length |
int | 30 | Skip posts below this character count |
skip_retweet_no_comment |
bool | true | Skip retweets with no original comment |
skip_link_only |
bool | true | Skip posts that are links/images with little text |
blocked_keywords |
list | [] | Skip posts containing any of these keywords |
tagging
Auto-tag based on keyword matching (case-insensitive, / separated synonyms = OR):
tagging:
enabled: true
keywords:
AI / LLM / 大模型: AI
skill / Skills: skill
Python / JavaScript: coding
fetch.count
fetch:
count: 20 # default 20, max 100
twitter/tweets returns ~20 tweets newest-first by default. For scheduled syncs, set to 50–100 to avoid missing posts from high-frequency accounts between sync intervals.
Incremental Sync
state.json tracks the last-fetched timestamp per account. On re-run:
- Skips posts with
created_at ≤ last_fetch - Saves only new content
- Updates
last_fetchtimestamp
Missed-run compensation: if a cron job missed a run (e.g., machine was off), the next run will backfill content within catchup_window_days (default 3 days).
To force re-fetch an account: delete its entry in state.json or delete the corresponding .md files.
Scheduled Sync
To enable automatic sync, ask the agent:
"Sync every morning at 9am" or "Sync every Monday at 8am"
The agent will create a cron job that runs in isolated mode with incremental sync — no duplicates.
Troubleshooting
bb-browser: command not found
The script auto-detects bb-browser PATH. If it still fails, confirm npm global bin is in your PATH, or install via npm install -g bb-browser.
twitter/search returns webpack module error
Use twitter/tweets instead of twitter/search. This is a known bb-browser adapter compatibility issue.
Platform returns 401 Unauthorized The OpenClaw browser needs to be logged into that platform. Open the site manually in the browser, log in once, then retry.
File already exists but want to re-fetch
Delete the corresponding entry in state.json or delete the .md files for that account.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install claw-social-feed - 安装完成后,直接呼叫该 Skill 的名称或使用
/claw-social-feed触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Website Content Scraped into Obsidian 是什么?
Fetch social media content and save to Obsidian. Supports Twitter/X, Reddit, GitHub, HackerNews, Bilibili, Weibo, Xiaohongshu and 30+ platforms via bb-browse... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 142 次。
如何安装 Website Content Scraped into Obsidian?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install claw-social-feed」即可一键安装,无需额外配置。
Website Content Scraped into Obsidian 是免费的吗?
是的,Website Content Scraped into Obsidian 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Website Content Scraped into Obsidian 支持哪些平台?
Website Content Scraped into Obsidian 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Website Content Scraped into Obsidian?
由 Glassmarbles(@glassmarbles)开发并维护,当前版本 v0.1.2。