← Back to Skills Marketplace
vvusu

Whisper Transcriber

by vvusu · GitHub ↗ · v1.0.0
cross-platform ✓ Security Clean
363
Downloads
1
Stars
2
Active Installs
1
Versions
Install in OpenClaw
/install whisper-transcriber
Description
Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT/JSON outputs, and model download. Cr...
Usage Guidance
This skill appears to do what it says, but review and be cautious before running the installer and scripts: 1) The installer will call your system package manager (may prompt for sudo) and will download large model files from https://huggingface.co/ggerganov/whisper.cpp — ensure you have sufficient disk space and trust that source. 2) config.json contains empty modelsSha256 entries, so by default the provided integrity check has no expected hashes; the scripts will only verify if those fields are filled and the host has shasum/sha256sum (the small node -e helper used for verification requires node to be present). 3) Prefer running install.sh and transcribe.sh manually (inspect them first), do not run them as root unless you understand the package installs, and consider running inside an isolated environment (VM or container) if you want extra safety. 4) If you plan to enable autonomous agent invocation, remember the skill will download models and run local binaries on any audio it processes — ensure your agent's permissions and inputs are appropriate. If you want, I can point out the exact lines in the scripts that perform downloads, package installs, and sha256 checks so you can inspect them before running.
Capability Analysis
Type: OpenClaw Skill Name: whisper-transcriber Version: 1.0.0 The skill is a legitimate offline speech-to-text tool utilizing whisper.cpp and ffmpeg. The installation and transcription scripts (install.sh, transcribe.sh) perform expected actions such as system package management and downloading models from Hugging Face (huggingface.co). No evidence of data exfiltration, persistence, or malicious prompt injection was found.
Capability Assessment
Purpose & Capability
Name/description require ffmpeg and whisper-cli and perform audio conversion + offline ASR; the scripts, download behavior, and config match that purpose. Required binaries and model downloads are appropriate for an offline whisper.cpp-based transcriber.
Instruction Scope
SKILL.md and scripts instruct the agent/user to run install.sh and transcribe.sh which install packages, download models, create temp dirs, convert audio, and invoke whisper-cli. The instructions are scoped to the transcribe task, but the runtime scripts will: (1) call system package managers (may use sudo), (2) download multi-hundred-MB/GB model files from huggingface, (3) optionally use node -e to read config.json for sha256 verification. None of these expand scope beyond transcription, but they do perform privileged installs and large external downloads — review before running.
Install Mechanism
There is no platform install spec in the registry; the repo provides cross-platform install.sh that uses package managers (brew/apt/dnf/yum/pacman/zypper) and curl/wget to download models from huggingface.co (a well-known host). No URL shorteners or personal IPs observed. This is a common, expected install mechanism for model-based tools.
Credentials
The skill declares no required credentials and only optional environment variables for model/temporary directories and defaults (WHISPER_DEFAULT_MODEL, WHISPER_MODEL_DIR, WHISPER_TEMP_DIR, etc.). Scripts do not attempt to read unrelated secrets or system credentials.
Persistence & Privilege
always:false and user-invocable. The skill does not request permanent platform-level presence or modify other skills' config. Install scripts create files under the skill's assets/models and may write system packages via package managers (normal for installers).
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install whisper-transcriber
  3. After installation, invoke the skill by name or use /whisper-transcriber
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial public release v1.0.0: offline ASR via whisper.cpp (whisper-cli) + ffmpeg; cross-platform installer (macOS/Linux) with Windows WSL2 recommendation; auto-download models to assets/models; optional sha256 verification; safer temp dir (mktemp).
Metadata
Slug whisper-transcriber
Version 1.0.0
License
All-time Installs 2
Active Installs 2
Total Versions 1
Frequently Asked Questions

What is Whisper Transcriber?

Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT/JSON outputs, and model download. Cr... It is an AI Agent Skill for Claude Code / OpenClaw, with 363 downloads so far.

How do I install Whisper Transcriber?

Run "/install whisper-transcriber" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Whisper Transcriber free?

Yes, Whisper Transcriber is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Whisper Transcriber support?

Whisper Transcriber is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Whisper Transcriber?

It is built and maintained by vvusu (@vvusu); the current version is v1.0.0.

💬 Comments