/install bilibili-audio-transcribe
Bilibili Audio Transcribe
Convert a Bilibili link into local transcript files. Prefer this skill for Bilibili and b23.tv URLs only; do not use it for YouTube or generic web pages.
Quick start
- Ensure
ffmpegandffprobeare onPATH. - If Python dependencies are missing, run
scripts/bootstrap_env.shor installyt-dlpandfaster-whispermanually. - Run
scripts/transcribe_bilibili.pywith the target URL. - Return the generated transcript artifacts or summarize them if the user asked for analysis rather than raw text.
Example:
python scripts/transcribe_bilibili.py "https://b23.tv/SSx810h" \
--out-dir ./downloads/bilibili-audio \
--model-size base \
--beam-size 3
Workflow
1. Validate the request
Accept only bilibili.com or b23.tv URLs. If the URL points elsewhere, stop and say this skill is the wrong tool.
2. Prefer the bundled script
Use scripts/transcribe_bilibili.py instead of rewriting yt-dlp / whisper glue code in the session. The script:
- downloads the best available audio
- probes audio duration with
ffprobe - transcribes with
faster-whisper - writes
.txt,.srt, and.segments.json - prints coarse ETA / progress during ASR
3. Use sensible defaults
Default to:
--model-size base--beam-size 3--language zh- VAD enabled
These defaults are optimized for Chinese Bilibili speech. If the video is clearly non-Chinese or mixed-language, rerun with --language auto or a specific language code.
4. Keep outputs predictable
By default the script writes into downloads/bilibili-audio/ relative to the current working directory. Keep all three artifacts unless the user explicitly asks for fewer outputs:
- transcript text:
.txt - subtitles:
.srt - segment metadata:
.segments.json
5. Handle failures directly
If a run fails:
- missing
ffmpeg/ffprobe→ install system dependency first - missing Python packages → run
scripts/bootstrap_env.sh - extractor or redirect issues → retry with the resolved canonical Bilibili URL
- poor transcript quality → rerun with
--language auto, a larger model, or--no-vadwhen speech is clipped
Read references/troubleshooting.md when dependency setup or extraction fails.
Resources
scripts/
transcribe_bilibili.py— main downloader + ASR pipelinebootstrap_env.sh— create a virtualenv and install Python dependenciesrequirements.txt— Python package list for the bootstrap script
references/
troubleshooting.md— dependency and extraction failure playbook
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install bilibili-audio-transcribe - After installation, invoke the skill by name or use
/bilibili-audio-transcribe - Provide required inputs per the skill's parameter spec and get structured output
What is Bilibili Audio Transcribe?
Download audio from Bilibili or b23.tv links and transcribe it into txt, srt, and segment JSON with yt-dlp, ffmpeg, and faster-whisper. Use when a user asks... It is an AI Agent Skill for Claude Code / OpenClaw, with 137 downloads so far.
How do I install Bilibili Audio Transcribe?
Run "/install bilibili-audio-transcribe" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Bilibili Audio Transcribe free?
Yes, Bilibili Audio Transcribe is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Bilibili Audio Transcribe support?
Bilibili Audio Transcribe is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Bilibili Audio Transcribe?
It is built and maintained by yizh4ng (@yizh4ng); the current version is v0.1.0.