← Back to Skills Marketplace

Whisper STT

Name: Whisper STT
Author: nickylin

by nic · GitHub ↗ · v1.0.0

cross-platform ✓ Security Clean

1230

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install whisper-stt

Description

Free local speech-to-text transcription using OpenAI Whisper. Transcribe audio files (mp3, wav, m4a, ogg, etc.) to text without API costs. Use when: (1) User...

Usage Guidance

This skill appears safe and does what it says: local transcription using the open-source Whisper library. Before installing, be aware it will require pip-installing third-party packages (torch and openai-whisper) and that the first run will typically download large model files (hundreds of MBs to multiple GBs) to your machine — check disk space and network usage. Install torch from the official source appropriate for your GPU/OS to avoid incompatibilities. If you have privacy concerns, note transcription runs locally (no credentials are requested), but downloaded model files are cached on disk; inspect or run in a sandbox if you want stricter isolation. Finally, some model names listed (e.g., large-v3-turbo) may not be available in every whisper distribution — test with smaller models first.

Capability Analysis

Type: OpenClaw Skill Name: whisper-stt Version: 1.0.0 The OpenClaw skill bundle provides local speech-to-text transcription using OpenAI Whisper. All files, including the `SKILL.md` instructions, `scripts/demo.sh`, and `scripts/transcribe.py`, align with the stated purpose. The `transcribe.py` script utilizes the `whisper` library, which legitimately downloads models from known sources as part of its core functionality. There is no evidence of prompt injection attempts, unauthorized data exfiltration, malicious execution, persistence mechanisms, or other harmful behaviors.

Capability Assessment

✓ Purpose & Capability

Name/description (local Whisper STT) match the files and runtime instructions. The included scripts implement local transcription using the openai-whisper package; no unrelated credentials, binaries, or services are requested.

✓ Instruction Scope

SKILL.md and scripts only instruct installing Python packages (openai-whisper, torch), optionally ffmpeg, and running the local transcribe script. The scripts read the provided audio file and print formatted output; they do not access other system paths, environment variables, or remote endpoints directly from the skill code.

ℹ Install Mechanism

There is no install spec (instruction-only plus two script files) which is low risk. Runtime use of whisper.load_model will, however, typically download model weights from upstream hosts (large files). That network activity and large disk/cache use is expected behavior for Whisper but worth noting.

✓ Credentials

The skill declares no required environment variables, credentials, or config paths. The code does not read secrets or unrelated env vars. The dependency list (openai-whisper, torch, optional ffmpeg) is proportionate to local transcription.

✓ Persistence & Privilege

always is false and the skill does not request persistent elevated privileges or modify other skills. It runs only when invoked and does not attempt to change global agent settings.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install whisper-stt
After installation, invoke the skill by name or use /whisper-stt
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

- Initial release of the whisper-stt skill for free, local speech-to-text transcription using OpenAI Whisper. - Supports a range of audio/video input formats (mp3, wav, m4a, ogg, etc.) without API costs. - Multiple output formats available: json, txt, srt, and vtt (for subtitles). - Configurable model sizes for performance vs. accuracy tradeoffs. - Option to specify target language and leverage GPU acceleration if available. - Comprehensive usage instructions and troubleshooting included.

Metadata

Slug whisper-stt

Version 1.0.0

License —

All-time Installs 15

Active Installs 13

Total Versions 1

Frequently Asked Questions

What is Whisper STT?

Free local speech-to-text transcription using OpenAI Whisper. Transcribe audio files (mp3, wav, m4a, ogg, etc.) to text without API costs. Use when: (1) User... It is an AI Agent Skill for Claude Code / OpenClaw, with 1230 downloads so far.

How do I install Whisper STT?

Run "/install whisper-stt" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Whisper STT free?

Yes, Whisper STT is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Whisper STT support?

Whisper STT is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Whisper STT?

It is built and maintained by nic (@nickylin); the current version is v1.0.0.

More Skills