Audio SRT Workflow
/install audio-srt-workflow
Audio SRT Workflow
Use this skill for end-to-end subtitle work.
This package is self-contained for runtime entrypoints:
scripts/align_to_srt.pyscripts/gui_app.pyscripts/srt_stats.pyscripts/make_preview_mp4.pyscripts/requirements.txt
Scope
- Mode A: audio + reference text -> aligned SRT
- Mode B: audio only -> auto subtitle SRT
- Timing QA with
srt_stats.py - Burned preview generation with
make_preview_mp4.py
Inputs To Collect First
- Audio path (
wav,mp3,m4a, ...) - Whether a reference transcript is available
- Output SRT path (or output directory)
- Language hint (
zh,en, ...) - Preferred run style: CLI, GUI, or Python API
Decision Rule
- If transcript exists, run Mode A (
align_to_srt.py --text ...). - If transcript does not exist, run Mode B via GUI or Python API (
run_auto_subtitle_pipeline).
Workflow
- Validate environment and paths.
- Choose Mode A or Mode B by transcript availability.
- Run subtitle generation from packaged scripts.
- Run timing diagnostics (
srt_stats.py). - If needed, render a preview mp4 with burned subtitles.
Resolve Skill Script Path
Set a local variable to your installed skill directory.
Codex default path:
SKILL_DIR="${CODEX_HOME:-$HOME/.codex}/skills/audio-srt-workflow"
OpenClaw/ClawHub install path example:
SKILL_DIR="\x3Cyour-workdir>/skills/audio-srt-workflow"
Environment Checks
Run these checks before execution:
python3 --version
ffmpeg -version
python3 -c "import faster_whisper; print('ok')"
If faster-whisper import fails:
# Review dependencies before installing:
cat "$SKILL_DIR/scripts/requirements.txt"
pip install -r "$SKILL_DIR/scripts/requirements.txt"
Mode A Command Template (Audio + Transcript)
python3 "$SKILL_DIR/scripts/align_to_srt.py" \
--audio "\x3Cinput_audio>" \
--text "\x3Ctranscript_txt>" \
--output "\x3Coutput_srt>" \
--model small \
--language zh
Mode B Command Template (Audio Only)
GUI:
python3 "$SKILL_DIR/scripts/gui_app.py"
Or use Python API in scripts:
- Build config with
build_alignment_config(...) - Run
run_auto_subtitle_pipeline(...)
See command details in references/command-templates.md.
QA And Preview
Timing stats:
python3 "$SKILL_DIR/scripts/srt_stats.py" --srt "\x3Coutput_srt>"
Preview video:
python3 "$SKILL_DIR/scripts/make_preview_mp4.py" \
--audio "\x3Cinput_audio>" \
--srt "\x3Coutput_srt>" \
--output "\x3Cpreview_mp4>"
Output Conventions
- Default output uses
.srtextension. - Prefer dated naming for batch runs (for example
output_YYYYMMDD.srt). - Keep intermediate checks in a separate folder from final delivery files.
Notes
- For Chinese output (
zh), the pipeline strips commas/periods only. - If timings look off, inspect waveform snap related arguments before changing model size.
- This skill requires explicit invocation (
allow_implicit_invocation: false).
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install audio-srt-workflow - 安装完成后,直接呼叫该 Skill 的名称或使用
/audio-srt-workflow触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Audio SRT Workflow 是什么?
Generate or align SRT subtitles from audio using this repository. Use when the user asks for subtitle generation, transcript-to-audio alignment, timing clean... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 113 次。
如何安装 Audio SRT Workflow?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install audio-srt-workflow」即可一键安装,无需额外配置。
Audio SRT Workflow 是免费的吗?
是的,Audio SRT Workflow 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Audio SRT Workflow 支持哪些平台?
Audio SRT Workflow 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Audio SRT Workflow?
由 Sariel2018(@sariel2018)开发并维护,当前版本 v0.1.2。