HN Podcast Archive
/install hn-podcast-archive
HN Podcast Archive
Set up or maintain a repeatable pipeline that:
- reads an RSS feed,
- detects new episodes,
- downloads audio,
- transcribes with local Whisper,
- writes a markdown archive per episode,
- updates index/state files.
Workflow
- Read
references/layout.mdto understand the expected archive layout and outputs. - Use
scripts/hn_podcast_archive.pyas the primary implementation. - Run
python3 scripts/hn_podcast_archive.py --helpto inspect options. - For first-time setup, ensure required binaries and Python modules exist.
- For automation, schedule the script on a recurring cadence with a stable output directory.
Required runtime dependencies
The script expects:
ffmpegin PATHwhisperin PATH- Python 3.10+
- Python package
feedparser
If any dependency is missing, surface a clear setup note instead of pretending the pipeline is ready to execute.
Recommended command
python3 skills/hn-podcast-archive/scripts/hn_podcast_archive.py \
--feed-url "https://example.com/podcast.rss" \
--output-dir ./data/hn-podcast-archive \
--whisper-model turbo
Output expectations
For each ingested episode, create:
- downloaded audio under
audio/ - transcript under
transcripts/ - markdown archive under
episodes/
Keep these shared files current:
index.mdstate.jsonrun-log.jsonl
Automation guidance
For automation, prefer a cron/standing-order style trigger that runs every few hours. The script is idempotent at the episode level by tracking processed GUIDs/URLs in state.json.
Safe operating rules
- Never overwrite unrelated archive content.
- Skip already-processed episodes unless explicitly forced.
- Preserve source metadata (title, published date, audio URL, guid).
- If transcription fails after download, keep the audio and record the failure in the log/state.
Customization points
Useful flags:
--limit Nto ingest only recent items during testing--forceto reprocess already-seen items--dry-runto inspect actions without writing outputs--whisper-modelto trade speed vs accuracy
Packaging/publishing
Package the skill from its folder. Publish with ClawHub only after local validation passes and authentication is available.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install hn-podcast-archive - 安装完成后,直接呼叫该 Skill 的名称或使用
/hn-podcast-archive触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
HN Podcast Archive 是什么?
Automate podcast archiving by detecting new HN episodes from RSS, downloading audio, transcribing locally with Whisper, and generating markdown archives with... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 103 次。
如何安装 HN Podcast Archive?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install hn-podcast-archive」即可一键安装,无需额外配置。
HN Podcast Archive 是免费的吗?
是的,HN Podcast Archive 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
HN Podcast Archive 支持哪些平台?
HN Podcast Archive 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 HN Podcast Archive?
由 terrycarter1985(@terrycarter1985)开发并维护,当前版本 v1.0.0。