Audio SRT Workflow
/install audio-srt-workflow
Audio SRT Workflow
Use this skill for end-to-end subtitle work.
This package is self-contained for runtime entrypoints:
scripts/align_to_srt.pyscripts/gui_app.pyscripts/srt_stats.pyscripts/make_preview_mp4.pyscripts/requirements.txt
Scope
- Mode A: audio + reference text -> aligned SRT
- Mode B: audio only -> auto subtitle SRT
- Timing QA with
srt_stats.py - Burned preview generation with
make_preview_mp4.py
Inputs To Collect First
- Audio path (
wav,mp3,m4a, ...) - Whether a reference transcript is available
- Output SRT path (or output directory)
- Language hint (
zh,en, ...) - Preferred run style: CLI, GUI, or Python API
Decision Rule
- If transcript exists, run Mode A (
align_to_srt.py --text ...). - If transcript does not exist, run Mode B via GUI or Python API (
run_auto_subtitle_pipeline).
Workflow
- Validate environment and paths.
- Choose Mode A or Mode B by transcript availability.
- Run subtitle generation from packaged scripts.
- Run timing diagnostics (
srt_stats.py). - If needed, render a preview mp4 with burned subtitles.
Resolve Skill Script Path
Set a local variable to your installed skill directory.
Codex default path:
SKILL_DIR="${CODEX_HOME:-$HOME/.codex}/skills/audio-srt-workflow"
OpenClaw/ClawHub install path example:
SKILL_DIR="\x3Cyour-workdir>/skills/audio-srt-workflow"
Environment Checks
Run these checks before execution:
python3 --version
ffmpeg -version
python3 -c "import faster_whisper; print('ok')"
If faster-whisper import fails:
# Review dependencies before installing:
cat "$SKILL_DIR/scripts/requirements.txt"
pip install -r "$SKILL_DIR/scripts/requirements.txt"
Mode A Command Template (Audio + Transcript)
python3 "$SKILL_DIR/scripts/align_to_srt.py" \
--audio "\x3Cinput_audio>" \
--text "\x3Ctranscript_txt>" \
--output "\x3Coutput_srt>" \
--model small \
--language zh
Mode B Command Template (Audio Only)
GUI:
python3 "$SKILL_DIR/scripts/gui_app.py"
Or use Python API in scripts:
- Build config with
build_alignment_config(...) - Run
run_auto_subtitle_pipeline(...)
See command details in references/command-templates.md.
QA And Preview
Timing stats:
python3 "$SKILL_DIR/scripts/srt_stats.py" --srt "\x3Coutput_srt>"
Preview video:
python3 "$SKILL_DIR/scripts/make_preview_mp4.py" \
--audio "\x3Cinput_audio>" \
--srt "\x3Coutput_srt>" \
--output "\x3Cpreview_mp4>"
Output Conventions
- Default output uses
.srtextension. - Prefer dated naming for batch runs (for example
output_YYYYMMDD.srt). - Keep intermediate checks in a separate folder from final delivery files.
Notes
- For Chinese output (
zh), the pipeline strips commas/periods only. - If timings look off, inspect waveform snap related arguments before changing model size.
- This skill requires explicit invocation (
allow_implicit_invocation: false).
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install audio-srt-workflow - After installation, invoke the skill by name or use
/audio-srt-workflow - Provide required inputs per the skill's parameter spec and get structured output
What is Audio SRT Workflow?
Generate or align SRT subtitles from audio using this repository. Use when the user asks for subtitle generation, transcript-to-audio alignment, timing clean... It is an AI Agent Skill for Claude Code / OpenClaw, with 113 downloads so far.
How do I install Audio SRT Workflow?
Run "/install audio-srt-workflow" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Audio SRT Workflow free?
Yes, Audio SRT Workflow is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Audio SRT Workflow support?
Audio SRT Workflow is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Audio SRT Workflow?
It is built and maintained by Sariel2018 (@sariel2018); the current version is v0.1.2.