← Back to Skills Marketplace
lkisme

Simple sound-to-text skill locally

by Kuikui · GitHub ↗ · v1.0.2 · MIT-0
cross-platform ✓ Security Clean
248
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install sst-simple
Description
Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)...
Usage Guidance
This skill appears to do what it claims: local Whisper-based transcription. Before installing, be aware that the included install.sh will: (1) create a virtual environment under /root/.openclaw/venv/stt-simple, (2) run pip install openai-whisper (downloads packages from PyPI), (3) attempt to install FFmpeg via apt/yum/brew (requires sudo/root), and (4) trigger model downloads to ~/.cache/whisper (large files, network bandwidth). If you plan to run this on a shared system, consider running the install and transcription inside a container or VM, and review the script contents (they are included) before execution. No credentials are requested and there are no hidden external endpoints in the code, but only run third-party install scripts that you trust or in an isolated environment.
Capability Analysis
Type: OpenClaw Skill Name: sst-simple Version: 1.0.2 The skill bundle provides a legitimate local speech-to-text implementation using OpenAI's Whisper library. The installation script (scripts/install.sh) and Python logic (scripts/stt_simple.py) are transparent, well-documented, and perform expected tasks such as setting up a virtual environment, installing dependencies like FFmpeg, and transcribing audio files with session-based isolation. No evidence of data exfiltration, malicious execution, or harmful prompt injection was found.
Capability Assessment
Purpose & Capability
Name/description (local Whisper STT) match the included scripts and instructions. The install script, Python transcription script, model choices, and output locations are all consistent with the stated purpose.
Instruction Scope
SKILL.md and the scripts only reference local audio files, creating a local venv, installing dependencies, and writing outputs under /root/.openclaw/workspace/stt_output (and models to ~/.cache/whisper). There are no instructions to read unrelated system files, pull secrets, or send transcription data to external endpoints.
Install Mechanism
Installation is via a provided shell script that creates a Python venv, runs pip install openai-whisper, and may call system package managers (apt-get/yum/brew) to install ffmpeg. This is a common approach but requires network access to package registries and privileges to install system packages. The install script itself is included (no hidden download URLs).
Credentials
The skill requests no environment variables or credentials. Its filesystem paths are under /root/.openclaw and standard caches; this matches local STT behavior and does not require unrelated secrets.
Persistence & Privilege
Skill is not always-enabled and does not modify other skills or global agent configuration. It creates its own venv and output directories under /root/.openclaw, which is normal for an installed skill.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install sst-simple
  3. After installation, invoke the skill by name or use /sst-simple
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.2
- Support multi-agent usage - Refine SKILL.md
Metadata
Slug sst-simple
Version 1.0.2
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Simple sound-to-text skill locally?

Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)... It is an AI Agent Skill for Claude Code / OpenClaw, with 248 downloads so far.

How do I install Simple sound-to-text skill locally?

Run "/install sst-simple" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Simple sound-to-text skill locally free?

Yes, Simple sound-to-text skill locally is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Simple sound-to-text skill locally support?

Simple sound-to-text skill locally is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Simple sound-to-text skill locally?

It is built and maintained by Kuikui (@lkisme); the current version is v1.0.2.

💬 Comments