← Back to Skills Marketplace

Whisper GPU Audio Transcriber

Name: Whisper GPU Audio Transcriber
Author: allanmeng

by Allan.M · GitHub ↗ · v1.0.3 · MIT-0

cross-platform ✓ Security Clean

154

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install whisper-gpu-transcriber-skill

Description

Convert audio to SRT subtitles using OpenAI Whisper with automatic GPU acceleration for Intel XPU / NVIDIA CUDA / AMD ROCm / Apple Metal. Ideal for content c...

Usage Guidance

This skill appears to do exactly what it claims: run a local Whisper model to convert audio to SRT. Before installing or running it, consider: (1) first run will download large model files (~1–2 GB) to ~/.cache/whisper (network and disk usage); (2) you must install a PyTorch build that matches your hardware (the SKILL.md points to the official PyTorch index); (3) run the script in a controlled environment (virtualenv/container) if you are cautious about running third‑party Python code; (4) the code does not request credentials or exfiltrate data, but the model download comes from the internet — if you prefer, manually obtain model files from trusted mirrors and place them in ~/.cache/whisper. Overall the skill is internally consistent and coherent with its stated purpose.

Capability Analysis

Type: OpenClaw Skill Name: whisper-gpu-transcriber-skill Version: 1.0.3 The skill is a legitimate wrapper for the OpenAI Whisper library, designed to transcribe audio files to SRT subtitles with GPU acceleration. The core logic in `scripts/transcribe.py` focuses on hardware detection (CUDA, XPU, MPS, ROCm) and standard file I/O, with no evidence of data exfiltration, malicious execution, or prompt injection.

Capability Assessment

✓ Purpose & Capability

Name/description (Whisper GPU transcription) match the requested artifacts: a Python script that loads a local Whisper model and transcribes audio. Declared binaries (python) and suggested pip packages are appropriate for the stated purpose.

✓ Instruction Scope

SKILL.md instructs the agent to run scripts/transcribe.py with a given audio path. The script only reads the provided audio file, loads a local model, transcribes to SRT, and writes the SRT next to the audio — it does not read unrelated files, export environment variables, or call remote endpoints in code.

ℹ Install Mechanism

This is instruction-only in the registry (no formal install spec), but SKILL.md instructs pip installs (openai-whisper and a PyTorch wheel/index). Those are expected. Note: the first run will download model files (~1.5GB) from the Whisper model host — this network activity is expected for local model use and may be large; mirrors or manual downloads are suggested in the docs for regions with restricted access.

✓ Credentials

The skill does not request environment variables, credentials, or config paths. The script only accesses the specified audio file and writes an SRT file; no sensitive credentials are required.

✓ Persistence & Privilege

always is false and the skill does not request permanent agent presence or modify other skills/system-wide settings. It runs a local script on demand.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install whisper-gpu-transcriber-skill
After installation, invoke the skill by name or use /whisper-gpu-transcriber-skill
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.3

Version 1.0.3 - No file changes detected; this is a version update with no modifications to the code or documentation.

v1.0.2

- Updated documentation (SKILL.md) for improved clarity and organization. - Split instructions and information into separate English and Chinese sections. - Refined use case descriptions, usage instructions, and requirements. - No core functionality or feature changes; update is documentation only.

v1.0.1

- Changed install method for openai-whisper from uv to pip in the skill metadata. - Updated requirements to state openai-whisper is now automatically installed by ClawHub using pip. - No changes to functionality or other documentation detected.

v1.0.0

Initial release of whisper-gpu-transcriber-skill: - Convert audio files to SRT subtitles using OpenAI Whisper with automatic GPU acceleration (supports Intel, NVIDIA, AMD, Apple M Series). - Fully offline, free, and works as an alternative to paid subtitle tools. - Simple usage: just specify your audio file to get SRT subtitles. - Advanced features: select Whisper models and target subtitle language. - Auto-detects available GPU and uses optimal acceleration. - Supports multiple Whisper models; default model is turbo. - Includes clear setup instructions and requirements for all supported platforms.

Metadata

Slug whisper-gpu-transcriber-skill

Version 1.0.3

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 4

Frequently Asked Questions

What is Whisper GPU Audio Transcriber?

Convert audio to SRT subtitles using OpenAI Whisper with automatic GPU acceleration for Intel XPU / NVIDIA CUDA / AMD ROCm / Apple Metal. Ideal for content c... It is an AI Agent Skill for Claude Code / OpenClaw, with 154 downloads so far.

How do I install Whisper GPU Audio Transcriber?

Run "/install whisper-gpu-transcriber-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Whisper GPU Audio Transcriber free?

Yes, Whisper GPU Audio Transcriber is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Whisper GPU Audio Transcriber support?

Whisper GPU Audio Transcriber is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Whisper GPU Audio Transcriber?

It is built and maintained by Allan.M (@allanmeng); the current version is v1.0.3.

More Skills