← Back to Skills Marketplace

Simple stt(sound-to-text) locally

Name: Simple stt(sound-to-text) locally
Author: lkisme

by Kuikui · GitHub ↗ · v1.0.1 · MIT-0

cross-platform ✓ Security Clean

515

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install stt-simple

Description

Simple local Speech-To-Text using Whisper. One-command install with auto model download. Supports 99+ languages.

Usage Guidance

This skill appears to be what it says: a local Whisper-based STT installer + small runtime. Before installing: 1) Inspect install.sh (already included) and be comfortable with it running system package installs (apt-get/yum/brew) and pip installs; those commands can require sudo. 2) Expect a network download of the Whisper model (~244MB for 'small') and increased disk usage in ~/.cache/whisper and the output directory. 3) Consider running the install inside a container, VM, or non-root user path (change VENV_DIR/OUTPUT_DIR) if you prefer isolation. 4) If you need to audit dependencies, verify the 'openai-whisper' PyPI package/source. 5) No credentials or unexpected external endpoints were found, but pip installs execute package code — only run if you trust the sources or run in an isolated environment.

Capability Analysis

Type: OpenClaw Skill Name: stt-simple Version: 1.0.1 The skill provides a legitimate local speech-to-text service using the OpenAI Whisper library. The installation script (install.sh) and Python processor (stt_simple.py) perform standard environment setup, dependency management (FFmpeg/Whisper), and audio transcription without any evidence of data exfiltration, malicious execution, or unauthorized access.

Capability Assessment

✓ Purpose & Capability

Name/description match the files and instructions: install.sh creates a virtualenv, installs openai-whisper and ffmpeg, triggers model download, and stt_simple.py transcribes audio and writes outputs. There are no unrelated credentials, binaries, or services requested.

ℹ Instruction Scope

SKILL.md and install.sh confine actions to installing dependencies, downloading the Whisper model, creating an output directory, and verifying installation. Note: the install script may call apt-get/yum/brew to install ffmpeg (system package manager operations) and writes under /root/.openclaw and ~/.cache/whisper, so it may require elevated privileges or path edits on non-root systems.

✓ Install Mechanism

No remote arbitrary archives or shortened URLs; dependencies are installed via pip (openai-whisper) and system package managers for ffmpeg. The model is downloaded at runtime via whisper.load_model (network download, ~244MB for 'small'). This is expected for a local Whisper setup.

✓ Credentials

The skill requests no environment variables or secrets. It only creates a venv and writes model/cache and output files to local paths. No extraneous credential access is present.

ℹ Persistence & Privilege

The skill does not request 'always' or attempt to modify other skills. It creates persistent files (venv, model cache, output) under /root/.openclaw and ~/.cache/whisper; installing system packages (apt-get/yum/brew) may require sudo. Consider adjusting paths if you cannot or do not want files under /root.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install stt-simple
After installation, invoke the skill by name or use /stt-simple
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.1

- Added full English translations alongside Chinese throughout SKILL.md. - All usage instructions, model table, options, and troubleshooting now include both Chinese and English explanations. - Improved accessibility for non-Chinese speakers. - No code or file changes; documentation update only.

v1.0.0

stt-simple 1.0.0 – initial release - Easy one-command local Speech-To-Text setup using Whisper, with automatic model download. - Supports transcription in 99+ languages. - Provides both command line and Python interface examples. - Includes install script that handles environment setup, dependencies, and model download. - Supports multiple output formats: .txt, .json, .srt, and .vtt. - Documentation covers quickstart, model selection, language codes, and troubleshooting steps.

Metadata

Slug stt-simple

Version 1.0.1

License MIT-0

All-time Installs 3

Active Installs 2

Total Versions 2

Frequently Asked Questions

What is Simple stt(sound-to-text) locally?

Simple local Speech-To-Text using Whisper. One-command install with auto model download. Supports 99+ languages. It is an AI Agent Skill for Claude Code / OpenClaw, with 515 downloads so far.

How do I install Simple stt(sound-to-text) locally?

Run "/install stt-simple" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Simple stt(sound-to-text) locally free?

Yes, Simple stt(sound-to-text) locally is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Simple stt(sound-to-text) locally support?

Simple stt(sound-to-text) locally is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Simple stt(sound-to-text) locally?

It is built and maintained by Kuikui (@lkisme); the current version is v1.0.1.

More Skills