HN Podcast Transcriber
/install hn-podcast-transcriber
HN Podcast Transcriber
Fetch new episodes from the Hacker News Morning Brief podcast RSS feed, transcribe with Whisper, and archive as searchable markdown.
Prerequisites
- whisper CLI installed (
pip install openai-whisper) - ffmpeg on PATH (required by whisper; download from https://ffmpeg.org)
- python3 with standard library (no extra deps for the fetch script)
- Disk space for audio files (~5-10 MB per episode)
Quick Start
Run the main script to fetch and transcribe all new episodes:
bash scripts/fetch_and_transcribe.sh --archive ~/hn-podcast-archive
First run processes all episodes. Subsequent runs only process new ones (tracked via state.json).
Options
| Flag | Default | Description |
|---|---|---|
--feed URL |
HN Morning Brief RSS | Podcast RSS feed URL |
--archive DIR |
./hn-podcast-archive |
Archive root directory |
--model MODEL |
turbo |
Whisper model (tiny/base/small/medium/large/turbo) |
--limit N |
0 (all) | Max new episodes to process per run |
Custom Feeds
Point at any podcast RSS feed:
bash scripts/fetch_and_transcribe.sh --feed "https://example.com/podcast/feed.xml" --archive ./my-podcast-archive
Scheduling
Set up an OpenClaw cron job for daily checks:
- Create an isolated cron job that runs the script
- Or add a heartbeat check in HEARTBEAT.md
Archive Structure
See references/archive-layout.md for directory layout and state.json schema.
Workflow Summary
- Download RSS feed → parse
\x3Citem>entries - Skip already-processed episodes (state.json lookup)
- Download audio (mp3/m4a) to episode directory
- Run
whisperto produce.txttranscript - Generate cleaned
transcript.mdwith title + date header - Update state.json with processed episode ID
Notes
- Whisper models cache to
~/.cache/whisperafter first download - Use
--model tinyfor speed,--model largefor best accuracy - Average episode (~6 min) takes ~1-2 min with turbo model on CPU
- For GPU acceleration, install ffmpeg with CUDA support
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install hn-podcast-transcriber - 安装完成后,直接呼叫该 Skill 的名称或使用
/hn-podcast-transcriber触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
HN Podcast Transcriber 是什么?
Automatically fetch, transcribe, and archive Hacker News podcast episodes (Hacker News Morning Brief). Use when the user wants to set up a podcast transcript... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 44 次。
如何安装 HN Podcast Transcriber?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install hn-podcast-transcriber」即可一键安装,无需额外配置。
HN Podcast Transcriber 是免费的吗?
是的,HN Podcast Transcriber 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
HN Podcast Transcriber 支持哪些平台?
HN Podcast Transcriber 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 HN Podcast Transcriber?
由 terrycarter1985(@terrycarter1985)开发并维护,当前版本 v1.0.0。