← Back to Skills Marketplace
terrycarter1985

HN Podcast Transcriber

by terrycarter1985 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
44
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install hn-podcast-transcriber
Description
Automatically fetch, transcribe, and archive Hacker News podcast episodes (Hacker News Morning Brief). Use when the user wants to set up a podcast transcript...
README (SKILL.md)

HN Podcast Transcriber

Fetch new episodes from the Hacker News Morning Brief podcast RSS feed, transcribe with Whisper, and archive as searchable markdown.

Prerequisites

  • whisper CLI installed (pip install openai-whisper)
  • ffmpeg on PATH (required by whisper; download from https://ffmpeg.org)
  • python3 with standard library (no extra deps for the fetch script)
  • Disk space for audio files (~5-10 MB per episode)

Quick Start

Run the main script to fetch and transcribe all new episodes:

bash scripts/fetch_and_transcribe.sh --archive ~/hn-podcast-archive

First run processes all episodes. Subsequent runs only process new ones (tracked via state.json).

Options

Flag Default Description
--feed URL HN Morning Brief RSS Podcast RSS feed URL
--archive DIR ./hn-podcast-archive Archive root directory
--model MODEL turbo Whisper model (tiny/base/small/medium/large/turbo)
--limit N 0 (all) Max new episodes to process per run

Custom Feeds

Point at any podcast RSS feed:

bash scripts/fetch_and_transcribe.sh --feed "https://example.com/podcast/feed.xml" --archive ./my-podcast-archive

Scheduling

Set up an OpenClaw cron job for daily checks:

  1. Create an isolated cron job that runs the script
  2. Or add a heartbeat check in HEARTBEAT.md

Archive Structure

See references/archive-layout.md for directory layout and state.json schema.

Workflow Summary

  1. Download RSS feed → parse \x3Citem> entries
  2. Skip already-processed episodes (state.json lookup)
  3. Download audio (mp3/m4a) to episode directory
  4. Run whisper to produce .txt transcript
  5. Generate cleaned transcript.md with title + date header
  6. Update state.json with processed episode ID

Notes

  • Whisper models cache to ~/.cache/whisper after first download
  • Use --model tiny for speed, --model large for best accuracy
  • Average episode (~6 min) takes ~1-2 min with turbo model on CPU
  • For GPU acceleration, install ffmpeg with CUDA support
Usage Guidance
This skill is reasonable to install if you want a local podcast transcription archive. Before running it, install Whisper and ffmpeg from trusted sources, use a dedicated archive folder, try `--limit` first, and only enable cron/heartbeat scheduling if you want it to keep running automatically.
Capability Analysis
Type: OpenClaw Skill Name: hn-podcast-transcriber Version: 1.0.0 The skill bundle is a legitimate utility for automating the transcription of the Hacker News Morning Brief podcast. The core logic in `scripts/fetch_and_transcribe.sh` uses standard Python libraries to parse RSS feeds, download audio files, and invoke the Whisper CLI for transcription. The code follows best practices for subprocess management and file handling, and there are no indicators of malicious intent, data exfiltration, or prompt injection.
Capability Assessment
Purpose & Capability
The stated purpose and script behavior are coherent: fetch RSS items, download audio, run Whisper, and write transcripts. It also supports any podcast RSS feed, so users should choose trusted feeds and consider using --limit for large archives.
Instruction Scope
Execution is user-directed and disclosed, but the default first run processes all unprocessed episodes because the default limit is 0/all.
Install Mechanism
There is no install spec and the registry declares no required binaries, while SKILL.md requires local tools such as whisper, ffmpeg, and python3. This is purpose-aligned but should be made explicit before use.
Credentials
The skill downloads audio, writes an archive directory, caches Whisper models, and runs local transcription commands. These are expected for the stated purpose but can consume disk, network, CPU, or GPU resources.
Persistence & Privilege
No hidden persistence is shown, but the documentation suggests optional cron or heartbeat scheduling for daily runs; users should configure that deliberately and monitor it.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install hn-podcast-transcriber
  3. After installation, invoke the skill by name or use /hn-podcast-transcriber
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release: auto-fetch, transcribe, and archive Hacker News Morning Brief podcast episodes
Metadata
Slug hn-podcast-transcriber
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is HN Podcast Transcriber?

Automatically fetch, transcribe, and archive Hacker News podcast episodes (Hacker News Morning Brief). Use when the user wants to set up a podcast transcript... It is an AI Agent Skill for Claude Code / OpenClaw, with 44 downloads so far.

How do I install HN Podcast Transcriber?

Run "/install hn-podcast-transcriber" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is HN Podcast Transcriber free?

Yes, HN Podcast Transcriber is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does HN Podcast Transcriber support?

HN Podcast Transcriber is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created HN Podcast Transcriber?

It is built and maintained by terrycarter1985 (@terrycarter1985); the current version is v1.0.0.

💬 Comments