← Back to Skills Marketplace

Audio Transcribe

Name: Audio Transcribe
Author: aktheknight

by Alex Knight · GitHub ↗ · v1.0.0

cross-platform ✓ Security Clean

1785

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install audio-transcribe

Description

Auto-transcribe voice messages locally using faster-whisper with selectable Whisper models, no API key required.

README (SKILL.md)

Audio Transcription Skill

Auto-transcribe voice messages using faster-whisper (local, no API key needed).

Requirements

pip install faster-whisper

Models download automatically on first use.

Usage

Transcribe a file

python3 /root/clawd/skills/audio-transcribe/scripts/transcribe.py /path/to/audio.ogg

Change model (edit script)

Edit transcribe.py and change:

model = WhisperModel('small', device='cpu', compute_type='int8')  # Options: tiny, base, small, medium, large-v3

Models

Model	Size	VRAM/RAM	Speed	Use Case
tiny	39 MB	~1 GB	⚡⚡⚡	Quick drafts
base	74 MB	~1 GB	⚡⚡	Basic accuracy
small	244 MB	~2 GB	⚡	Recommended
medium	769 MB	~5 GB	🐢	Better accuracy
large-v3	1.5 GB	~10 GB	🐢🐢	Best accuracy

Integration

Clawdbot auto-transcribes incoming voice messages when this skill is enabled.

Files

scripts/transcribe.py — Main transcription script
SKILL.md — This file

Usage Guidance

This skill appears coherent for local transcription. Before installing: (1) be aware pip install faster-whisper will install third-party code from PyPI and may build native dependencies—review that package if you require strict supply-chain controls; (2) model files are downloaded on first use (network bandwidth, disk space, and cache location like ~/.cache may be used); (3) transcription can require significant RAM/CPU/GPU depending on model size—ensure your system meets the listed VRAM/RAM requirements; (4) the included script only reads the audio file you pass and prints the text (no hidden endpoints or credential access were found), but if you must guarantee full offline operation, pre-download models and verify faster-whisper’s sources before running.

Capability Analysis

Type: OpenClaw Skill Name: audio-transcribe Version: 1.0.0 The OpenClaw skill 'audio-transcribe' is benign. The `SKILL.md` provides clear, non-malicious instructions for installing dependencies and running the transcription script, with no evidence of prompt injection attempts. The `scripts/transcribe.py` script uses the `faster-whisper` library to perform local audio transcription, taking an audio file path as input. It does not contain any code for data exfiltration, malicious execution, persistence, or obfuscation. The model download on first use is a standard and expected behavior for local machine learning models.

Capability Assessment

✓ Purpose & Capability

Name/description (local transcription with faster-whisper) align with included script and SKILL.md. The script only loads a WhisperModel and transcribes a provided audio file.

✓ Instruction Scope

Runtime instructions and the script operate only on the provided audio path and model downloads; there are no references to unrelated files, environment variables, or external endpoints in the skill code itself.

ℹ Install Mechanism

No install spec in the registry; SKILL.md asks the user to pip install faster-whisper. This is expected but means a third-party package (and its dependencies) will be installed from PyPI and models will be downloaded from the network on first run.

✓ Credentials

The skill requests no environment variables, credentials, or config paths. The script only uses the command-line audio file argument.

✓ Persistence & Privilege

always is false and the skill does not modify agent configuration or request persistent privileges.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install audio-transcribe
After installation, invoke the skill by name or use /audio-transcribe
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

- Initial release of audio-transcribe skill: auto-transcribes voice messages locally using faster-whisper. - No API key required; models download automatically on first use. - Simple CLI for transcribing .ogg files. - Model selection supported by editing the script, with multiple accuracy/speed options. - Clawdbot integration: auto-transcribes incoming voice messages when enabled.

Metadata

Slug audio-transcribe

Version 1.0.0

License —

All-time Installs 16

Active Installs 14

Total Versions 1

Frequently Asked Questions

What is Audio Transcribe?

Auto-transcribe voice messages locally using faster-whisper with selectable Whisper models, no API key required. It is an AI Agent Skill for Claude Code / OpenClaw, with 1785 downloads so far.

How do I install Audio Transcribe?

Run "/install audio-transcribe" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Audio Transcribe free?

Yes, Audio Transcribe is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Audio Transcribe support?

Audio Transcribe is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Audio Transcribe?

It is built and maintained by Alex Knight (@aktheknight); the current version is v1.0.0.

More Skills