← Back to Skills Marketplace
515
Downloads
1
Stars
2
Active Installs
2
Versions
Install in OpenClaw
/install stt-simple
Description
Simple local Speech-To-Text using Whisper. One-command install with auto model download. Supports 99+ languages.
Usage Guidance
This skill appears to be what it says: a local Whisper-based STT installer + small runtime. Before installing: 1) Inspect install.sh (already included) and be comfortable with it running system package installs (apt-get/yum/brew) and pip installs; those commands can require sudo. 2) Expect a network download of the Whisper model (~244MB for 'small') and increased disk usage in ~/.cache/whisper and the output directory. 3) Consider running the install inside a container, VM, or non-root user path (change VENV_DIR/OUTPUT_DIR) if you prefer isolation. 4) If you need to audit dependencies, verify the 'openai-whisper' PyPI package/source. 5) No credentials or unexpected external endpoints were found, but pip installs execute package code — only run if you trust the sources or run in an isolated environment.
Capability Analysis
Type: OpenClaw Skill
Name: stt-simple
Version: 1.0.1
The skill provides a legitimate local speech-to-text service using the OpenAI Whisper library. The installation script (install.sh) and Python processor (stt_simple.py) perform standard environment setup, dependency management (FFmpeg/Whisper), and audio transcription without any evidence of data exfiltration, malicious execution, or unauthorized access.
Capability Assessment
Purpose & Capability
Name/description match the files and instructions: install.sh creates a virtualenv, installs openai-whisper and ffmpeg, triggers model download, and stt_simple.py transcribes audio and writes outputs. There are no unrelated credentials, binaries, or services requested.
Instruction Scope
SKILL.md and install.sh confine actions to installing dependencies, downloading the Whisper model, creating an output directory, and verifying installation. Note: the install script may call apt-get/yum/brew to install ffmpeg (system package manager operations) and writes under /root/.openclaw and ~/.cache/whisper, so it may require elevated privileges or path edits on non-root systems.
Install Mechanism
No remote arbitrary archives or shortened URLs; dependencies are installed via pip (openai-whisper) and system package managers for ffmpeg. The model is downloaded at runtime via whisper.load_model (network download, ~244MB for 'small'). This is expected for a local Whisper setup.
Credentials
The skill requests no environment variables or secrets. It only creates a venv and writes model/cache and output files to local paths. No extraneous credential access is present.
Persistence & Privilege
The skill does not request 'always' or attempt to modify other skills. It creates persistent files (venv, model cache, output) under /root/.openclaw and ~/.cache/whisper; installing system packages (apt-get/yum/brew) may require sudo. Consider adjusting paths if you cannot or do not want files under /root.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install stt-simple - After installation, invoke the skill by name or use
/stt-simple - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
- Added full English translations alongside Chinese throughout SKILL.md.
- All usage instructions, model table, options, and troubleshooting now include both Chinese and English explanations.
- Improved accessibility for non-Chinese speakers.
- No code or file changes; documentation update only.
v1.0.0
stt-simple 1.0.0 – initial release
- Easy one-command local Speech-To-Text setup using Whisper, with automatic model download.
- Supports transcription in 99+ languages.
- Provides both command line and Python interface examples.
- Includes install script that handles environment setup, dependencies, and model download.
- Supports multiple output formats: .txt, .json, .srt, and .vtt.
- Documentation covers quickstart, model selection, language codes, and troubleshooting steps.
Metadata
Frequently Asked Questions
What is Simple stt(sound-to-text) locally?
Simple local Speech-To-Text using Whisper. One-command install with auto model download. Supports 99+ languages. It is an AI Agent Skill for Claude Code / OpenClaw, with 515 downloads so far.
How do I install Simple stt(sound-to-text) locally?
Run "/install stt-simple" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Simple stt(sound-to-text) locally free?
Yes, Simple stt(sound-to-text) locally is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Simple stt(sound-to-text) locally support?
Simple stt(sound-to-text) locally is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Simple stt(sound-to-text) locally?
It is built and maintained by Kuikui (@lkisme); the current version is v1.0.1.
More Skills