← Back to Skills Marketplace
josunlp

Whisper Transcribe

by Jonas Pfalzgraf · GitHub ↗ · v1.0.0
cross-platform ✓ Security Clean
1395
Downloads
2
Stars
10
Active Installs
1
Versions
Install in OpenClaw
/install whisper-transcribe
Description
Transcribe audio files to text using OpenAI Whisper. Supports speech-to-text with auto language detection, multiple output formats (txt, srt, vtt, json), batch processing, and model selection (tiny to large). Use when transcribing audio recordings, podcasts, voice messages, lectures, meetings, or any audio/video file to text. Handles mp3, wav, m4a, ogg, flac, webm, opus, aac formats.
Usage Guidance
This skill appears to do what it claims: wrap the local 'whisper' CLI to transcribe audio. Before installing/using it: (1) ensure you trust the 'openai-whisper' pip package source and install it in a virtualenv to limit install-time risks, (2) install ffmpeg separately (the script assumes it exists), (3) be aware models are downloaded at first run and may be large and require network access and disk space, and (4) avoid running the tool on sensitive audio in untrusted environments. If you want extra assurance, inspect the pip package source before installing or run the transcription inside a container/VM.
Capability Analysis
Type: OpenClaw Skill Name: whisper-transcribe Version: 1.0.0 The skill bundle is benign. The `SKILL.md` provides clear, non-malicious instructions for transcribing audio using the `whisper` CLI. The `scripts/transcribe.sh` script is a well-structured shell wrapper that safely executes the `whisper` command by constructing arguments in an array (`"${args[@]}"`), mitigating shell injection risks. It performs necessary file system operations (creating output directories and writing transcription files) directly aligned with its stated purpose and does not exhibit any signs of data exfiltration, persistence, unauthorized network activity, or prompt injection attempts against the agent.
Capability Assessment
Purpose & Capability
Name/description (Whisper transcription) matches the included script and SKILL.md. Required tools (whisper CLI, ffmpeg) are appropriate for the stated functionality; no unrelated binaries or credentials are requested.
Instruction Scope
SKILL.md and the script only instruct running the local wrapper against user-provided audio files and writing transcripts to the same or a specified output directory. The script does not read extraneous system files, environment variables, or attempt network exfiltration itself.
Install Mechanism
This is an instruction-only skill (no install spec). The SKILL.md recommends installing the openai-whisper package via pip; that will download packages and the whisper runtime may download model files (~MBs to GBs) at first run. This network activity and package installation is expected for this purpose but is the primary point where external code/data is fetched—run in a virtualenv or isolated environment if you want to limit risk.
Credentials
No environment variables, credentials, or config paths are requested. The script does not access secrets or unrelated system configurations.
Persistence & Privilege
always:false and no install-time persistence. The skill does not modify other skills or system-wide settings and requests no elevated privileges.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install whisper-transcribe
  3. After installation, invoke the skill by name or use /whisper-transcribe
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release: batch processing, auto language detection, multiple formats, model selection
Metadata
Slug whisper-transcribe
Version 1.0.0
License
All-time Installs 12
Active Installs 10
Total Versions 1
Frequently Asked Questions

What is Whisper Transcribe?

Transcribe audio files to text using OpenAI Whisper. Supports speech-to-text with auto language detection, multiple output formats (txt, srt, vtt, json), batch processing, and model selection (tiny to large). Use when transcribing audio recordings, podcasts, voice messages, lectures, meetings, or any audio/video file to text. Handles mp3, wav, m4a, ogg, flac, webm, opus, aac formats. It is an AI Agent Skill for Claude Code / OpenClaw, with 1395 downloads so far.

How do I install Whisper Transcribe?

Run "/install whisper-transcribe" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Whisper Transcribe free?

Yes, Whisper Transcribe is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Whisper Transcribe support?

Whisper Transcribe is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Whisper Transcribe?

It is built and maintained by Jonas Pfalzgraf (@josunlp); the current version is v1.0.0.

💬 Comments