← Back to Skills Marketplace
nickylin

Whisper STT

by nic · GitHub ↗ · v1.0.0
cross-platform ✓ Security Clean
1230
Downloads
0
Stars
13
Active Installs
1
Versions
Install in OpenClaw
/install whisper-stt
Description
Free local speech-to-text transcription using OpenAI Whisper. Transcribe audio files (mp3, wav, m4a, ogg, etc.) to text without API costs. Use when: (1) User...
Usage Guidance
This skill appears safe and does what it says: local transcription using the open-source Whisper library. Before installing, be aware it will require pip-installing third-party packages (torch and openai-whisper) and that the first run will typically download large model files (hundreds of MBs to multiple GBs) to your machine — check disk space and network usage. Install torch from the official source appropriate for your GPU/OS to avoid incompatibilities. If you have privacy concerns, note transcription runs locally (no credentials are requested), but downloaded model files are cached on disk; inspect or run in a sandbox if you want stricter isolation. Finally, some model names listed (e.g., large-v3-turbo) may not be available in every whisper distribution — test with smaller models first.
Capability Analysis
Type: OpenClaw Skill Name: whisper-stt Version: 1.0.0 The OpenClaw skill bundle provides local speech-to-text transcription using OpenAI Whisper. All files, including the `SKILL.md` instructions, `scripts/demo.sh`, and `scripts/transcribe.py`, align with the stated purpose. The `transcribe.py` script utilizes the `whisper` library, which legitimately downloads models from known sources as part of its core functionality. There is no evidence of prompt injection attempts, unauthorized data exfiltration, malicious execution, persistence mechanisms, or other harmful behaviors.
Capability Assessment
Purpose & Capability
Name/description (local Whisper STT) match the files and runtime instructions. The included scripts implement local transcription using the openai-whisper package; no unrelated credentials, binaries, or services are requested.
Instruction Scope
SKILL.md and scripts only instruct installing Python packages (openai-whisper, torch), optionally ffmpeg, and running the local transcribe script. The scripts read the provided audio file and print formatted output; they do not access other system paths, environment variables, or remote endpoints directly from the skill code.
Install Mechanism
There is no install spec (instruction-only plus two script files) which is low risk. Runtime use of whisper.load_model will, however, typically download model weights from upstream hosts (large files). That network activity and large disk/cache use is expected behavior for Whisper but worth noting.
Credentials
The skill declares no required environment variables, credentials, or config paths. The code does not read secrets or unrelated env vars. The dependency list (openai-whisper, torch, optional ffmpeg) is proportionate to local transcription.
Persistence & Privilege
always is false and the skill does not request persistent elevated privileges or modify other skills. It runs only when invoked and does not attempt to change global agent settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install whisper-stt
  3. After installation, invoke the skill by name or use /whisper-stt
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of the whisper-stt skill for free, local speech-to-text transcription using OpenAI Whisper. - Supports a range of audio/video input formats (mp3, wav, m4a, ogg, etc.) without API costs. - Multiple output formats available: json, txt, srt, and vtt (for subtitles). - Configurable model sizes for performance vs. accuracy tradeoffs. - Option to specify target language and leverage GPU acceleration if available. - Comprehensive usage instructions and troubleshooting included.
Metadata
Slug whisper-stt
Version 1.0.0
License
All-time Installs 15
Active Installs 13
Total Versions 1
Frequently Asked Questions

What is Whisper STT?

Free local speech-to-text transcription using OpenAI Whisper. Transcribe audio files (mp3, wav, m4a, ogg, etc.) to text without API costs. Use when: (1) User... It is an AI Agent Skill for Claude Code / OpenClaw, with 1230 downloads so far.

How do I install Whisper STT?

Run "/install whisper-stt" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Whisper STT free?

Yes, Whisper STT is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Whisper STT support?

Whisper STT is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Whisper STT?

It is built and maintained by nic (@nickylin); the current version is v1.0.0.

💬 Comments