← 返回 Skills 市场
terrycarter1985

HN Podcast Archive

作者 terrycarter1985 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
103
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install hn-podcast-archive
功能描述
Automate podcast archiving by detecting new HN episodes from RSS, downloading audio, transcribing locally with Whisper, and generating markdown archives with...
使用说明 (SKILL.md)

HN Podcast Archive

Set up or maintain a repeatable pipeline that:

  1. reads an RSS feed,
  2. detects new episodes,
  3. downloads audio,
  4. transcribes with local Whisper,
  5. writes a markdown archive per episode,
  6. updates index/state files.

Workflow

  1. Read references/layout.md to understand the expected archive layout and outputs.
  2. Use scripts/hn_podcast_archive.py as the primary implementation.
  3. Run python3 scripts/hn_podcast_archive.py --help to inspect options.
  4. For first-time setup, ensure required binaries and Python modules exist.
  5. For automation, schedule the script on a recurring cadence with a stable output directory.

Required runtime dependencies

The script expects:

  • ffmpeg in PATH
  • whisper in PATH
  • Python 3.10+
  • Python package feedparser

If any dependency is missing, surface a clear setup note instead of pretending the pipeline is ready to execute.

Recommended command

python3 skills/hn-podcast-archive/scripts/hn_podcast_archive.py \
  --feed-url "https://example.com/podcast.rss" \
  --output-dir ./data/hn-podcast-archive \
  --whisper-model turbo

Output expectations

For each ingested episode, create:

  • downloaded audio under audio/
  • transcript under transcripts/
  • markdown archive under episodes/

Keep these shared files current:

  • index.md
  • state.json
  • run-log.jsonl

Automation guidance

For automation, prefer a cron/standing-order style trigger that runs every few hours. The script is idempotent at the episode level by tracking processed GUIDs/URLs in state.json.

Safe operating rules

  • Never overwrite unrelated archive content.
  • Skip already-processed episodes unless explicitly forced.
  • Preserve source metadata (title, published date, audio URL, guid).
  • If transcription fails after download, keep the audio and record the failure in the log/state.

Customization points

Useful flags:

  • --limit N to ingest only recent items during testing
  • --force to reprocess already-seen items
  • --dry-run to inspect actions without writing outputs
  • --whisper-model to trade speed vs accuracy

Packaging/publishing

Package the skill from its folder. Publish with ClawHub only after local validation passes and authentication is available.

安全使用建议
This skill appears coherent for archiving/transcribing podcasts, but take these practical precautions before installing or scheduling it: 1) Verify and install 'whisper' and 'ffmpeg' from trusted sources — the script will run the 'whisper' binary found on PATH, so a malicious binary with that name would be executed. 2) Run the script manually with --dry-run and a test feed and output directory to confirm behavior before scheduling. 3) Use a dedicated output directory (not a system or home root) and consider an isolated environment (virtualenv, container) for Python deps. 4) Inspect and trust the RSS feed sources you give the script; it will download and store audio from those URLs. 5) Pin feedparser and any other runtime components as you deploy. If you want higher assurance, request an install spec or signed release for the whisper/ffmpeg binaries you plan to use.
功能分析
Type: OpenClaw Skill Name: hn-podcast-archive Version: 1.0.0 The skill provides a legitimate utility for archiving and transcribing podcast episodes from RSS feeds using local Whisper transcription. The Python script (hn_podcast_archive.py) implements safe file handling, input sanitization via slugification, and secure subprocess execution for external binaries. No evidence of data exfiltration, malicious persistence, or prompt injection was found; the instructions in SKILL.md and documentation in references/ are consistent with the stated purpose of building an automated media pipeline.
能力评估
Purpose & Capability
Name/description (archive HN podcast episodes, download audio, transcribe with Whisper, write markdown) match the included files and declared runtime expectations. The script requires feedparser, ffmpeg, and a 'whisper' CLI which are appropriate for this task.
Instruction Scope
SKILL.md and references document only RSS fetching, downloading audio, local transcription, writing files (audio/, transcripts/, episodes/, state.json, run-log.jsonl, index.md), and scheduling. The script reads/writes only under the specified output directory and does not access unrelated system paths, environment variables, or external endpoints other than fetching RSS and episode audio.
Install Mechanism
There is no install spec (instruction-only), which is low-risk. The code expects external binaries ('ffmpeg' and 'whisper') and the Python feedparser package; these are reasonable but require the operator to install and vet. The 'whisper' CLI is invoked via subprocess — ensure the binary on PATH is the intended transcription tool (the script will execute whatever 'whisper' refers to).
Credentials
The skill requests no environment variables, credentials, or config paths. The script operates on a provided output directory and does network fetches for the RSS and audio files only, which is proportionate to the stated purpose.
Persistence & Privilege
Flags show no forced permanence (always:false) and no modifications to other skills or system-wide settings. The skill writes only to its own output directory and state/log files as described.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install hn-podcast-archive
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /hn-podcast-archive 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release: RSS-driven download, Whisper transcription, markdown archive, state/index management
元数据
Slug hn-podcast-archive
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

HN Podcast Archive 是什么?

Automate podcast archiving by detecting new HN episodes from RSS, downloading audio, transcribing locally with Whisper, and generating markdown archives with... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 103 次。

如何安装 HN Podcast Archive?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install hn-podcast-archive」即可一键安装,无需额外配置。

HN Podcast Archive 是免费的吗?

是的,HN Podcast Archive 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

HN Podcast Archive 支持哪些平台?

HN Podcast Archive 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 HN Podcast Archive?

由 terrycarter1985(@terrycarter1985)开发并维护,当前版本 v1.0.0。

💬 留言讨论