← Back to Skills Marketplace
allanmeng

Whisper GPU Audio Transcriber

by Allan.M · GitHub ↗ · v1.0.3 · MIT-0
cross-platform ✓ Security Clean
154
Downloads
0
Stars
0
Active Installs
4
Versions
Install in OpenClaw
/install whisper-gpu-transcriber-skill
Description
Convert audio to SRT subtitles using OpenAI Whisper with automatic GPU acceleration for Intel XPU / NVIDIA CUDA / AMD ROCm / Apple Metal. Ideal for content c...
Usage Guidance
This skill appears to do exactly what it claims: run a local Whisper model to convert audio to SRT. Before installing or running it, consider: (1) first run will download large model files (~1–2 GB) to ~/.cache/whisper (network and disk usage); (2) you must install a PyTorch build that matches your hardware (the SKILL.md points to the official PyTorch index); (3) run the script in a controlled environment (virtualenv/container) if you are cautious about running third‑party Python code; (4) the code does not request credentials or exfiltrate data, but the model download comes from the internet — if you prefer, manually obtain model files from trusted mirrors and place them in ~/.cache/whisper. Overall the skill is internally consistent and coherent with its stated purpose.
Capability Analysis
Type: OpenClaw Skill Name: whisper-gpu-transcriber-skill Version: 1.0.3 The skill is a legitimate wrapper for the OpenAI Whisper library, designed to transcribe audio files to SRT subtitles with GPU acceleration. The core logic in `scripts/transcribe.py` focuses on hardware detection (CUDA, XPU, MPS, ROCm) and standard file I/O, with no evidence of data exfiltration, malicious execution, or prompt injection.
Capability Assessment
Purpose & Capability
Name/description (Whisper GPU transcription) match the requested artifacts: a Python script that loads a local Whisper model and transcribes audio. Declared binaries (python) and suggested pip packages are appropriate for the stated purpose.
Instruction Scope
SKILL.md instructs the agent to run scripts/transcribe.py with a given audio path. The script only reads the provided audio file, loads a local model, transcribes to SRT, and writes the SRT next to the audio — it does not read unrelated files, export environment variables, or call remote endpoints in code.
Install Mechanism
This is instruction-only in the registry (no formal install spec), but SKILL.md instructs pip installs (openai-whisper and a PyTorch wheel/index). Those are expected. Note: the first run will download model files (~1.5GB) from the Whisper model host — this network activity is expected for local model use and may be large; mirrors or manual downloads are suggested in the docs for regions with restricted access.
Credentials
The skill does not request environment variables, credentials, or config paths. The script only accesses the specified audio file and writes an SRT file; no sensitive credentials are required.
Persistence & Privilege
always is false and the skill does not request permanent agent presence or modify other skills/system-wide settings. It runs a local script on demand.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install whisper-gpu-transcriber-skill
  3. After installation, invoke the skill by name or use /whisper-gpu-transcriber-skill
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.3
Version 1.0.3 - No file changes detected; this is a version update with no modifications to the code or documentation.
v1.0.2
- Updated documentation (SKILL.md) for improved clarity and organization. - Split instructions and information into separate English and Chinese sections. - Refined use case descriptions, usage instructions, and requirements. - No core functionality or feature changes; update is documentation only.
v1.0.1
- Changed install method for openai-whisper from uv to pip in the skill metadata. - Updated requirements to state openai-whisper is now automatically installed by ClawHub using pip. - No changes to functionality or other documentation detected.
v1.0.0
Initial release of whisper-gpu-transcriber-skill: - Convert audio files to SRT subtitles using OpenAI Whisper with automatic GPU acceleration (supports Intel, NVIDIA, AMD, Apple M Series). - Fully offline, free, and works as an alternative to paid subtitle tools. - Simple usage: just specify your audio file to get SRT subtitles. - Advanced features: select Whisper models and target subtitle language. - Auto-detects available GPU and uses optimal acceleration. - Supports multiple Whisper models; default model is turbo. - Includes clear setup instructions and requirements for all supported platforms.
Metadata
Slug whisper-gpu-transcriber-skill
Version 1.0.3
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 4
Frequently Asked Questions

What is Whisper GPU Audio Transcriber?

Convert audio to SRT subtitles using OpenAI Whisper with automatic GPU acceleration for Intel XPU / NVIDIA CUDA / AMD ROCm / Apple Metal. Ideal for content c... It is an AI Agent Skill for Claude Code / OpenClaw, with 154 downloads so far.

How do I install Whisper GPU Audio Transcriber?

Run "/install whisper-gpu-transcriber-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Whisper GPU Audio Transcriber free?

Yes, Whisper GPU Audio Transcriber is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Whisper GPU Audio Transcriber support?

Whisper GPU Audio Transcriber is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Whisper GPU Audio Transcriber?

It is built and maintained by Allan.M (@allanmeng); the current version is v1.0.3.

💬 Comments