HN Podcast Archive
/install hn-podcast-archive
HN Podcast Archive
Set up or maintain a repeatable pipeline that:
- reads an RSS feed,
- detects new episodes,
- downloads audio,
- transcribes with local Whisper,
- writes a markdown archive per episode,
- updates index/state files.
Workflow
- Read
references/layout.mdto understand the expected archive layout and outputs. - Use
scripts/hn_podcast_archive.pyas the primary implementation. - Run
python3 scripts/hn_podcast_archive.py --helpto inspect options. - For first-time setup, ensure required binaries and Python modules exist.
- For automation, schedule the script on a recurring cadence with a stable output directory.
Required runtime dependencies
The script expects:
ffmpegin PATHwhisperin PATH- Python 3.10+
- Python package
feedparser
If any dependency is missing, surface a clear setup note instead of pretending the pipeline is ready to execute.
Recommended command
python3 skills/hn-podcast-archive/scripts/hn_podcast_archive.py \
--feed-url "https://example.com/podcast.rss" \
--output-dir ./data/hn-podcast-archive \
--whisper-model turbo
Output expectations
For each ingested episode, create:
- downloaded audio under
audio/ - transcript under
transcripts/ - markdown archive under
episodes/
Keep these shared files current:
index.mdstate.jsonrun-log.jsonl
Automation guidance
For automation, prefer a cron/standing-order style trigger that runs every few hours. The script is idempotent at the episode level by tracking processed GUIDs/URLs in state.json.
Safe operating rules
- Never overwrite unrelated archive content.
- Skip already-processed episodes unless explicitly forced.
- Preserve source metadata (title, published date, audio URL, guid).
- If transcription fails after download, keep the audio and record the failure in the log/state.
Customization points
Useful flags:
--limit Nto ingest only recent items during testing--forceto reprocess already-seen items--dry-runto inspect actions without writing outputs--whisper-modelto trade speed vs accuracy
Packaging/publishing
Package the skill from its folder. Publish with ClawHub only after local validation passes and authentication is available.
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install hn-podcast-archive - After installation, invoke the skill by name or use
/hn-podcast-archive - Provide required inputs per the skill's parameter spec and get structured output
What is HN Podcast Archive?
Automate podcast archiving by detecting new HN episodes from RSS, downloading audio, transcribing locally with Whisper, and generating markdown archives with... It is an AI Agent Skill for Claude Code / OpenClaw, with 103 downloads so far.
How do I install HN Podcast Archive?
Run "/install hn-podcast-archive" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is HN Podcast Archive free?
Yes, HN Podcast Archive is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does HN Podcast Archive support?
HN Podcast Archive is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created HN Podcast Archive?
It is built and maintained by terrycarter1985 (@terrycarter1985); the current version is v1.0.0.