← 返回 Skills 市场
aktheknight

Audio Transcribe

作者 Alex Knight · GitHub ↗ · v1.0.0
cross-platform ✓ 安全检测通过
1785
总下载
1
收藏
14
当前安装
1
版本数
在 OpenClaw 中安装
/install audio-transcribe
功能描述
Auto-transcribe voice messages locally using faster-whisper with selectable Whisper models, no API key required.
使用说明 (SKILL.md)

Audio Transcription Skill

Auto-transcribe voice messages using faster-whisper (local, no API key needed).

Requirements

pip install faster-whisper

Models download automatically on first use.

Usage

Transcribe a file

python3 /root/clawd/skills/audio-transcribe/scripts/transcribe.py /path/to/audio.ogg

Change model (edit script)

Edit transcribe.py and change:

model = WhisperModel('small', device='cpu', compute_type='int8')  # Options: tiny, base, small, medium, large-v3

Models

Model Size VRAM/RAM Speed Use Case
tiny 39 MB ~1 GB ⚡⚡⚡ Quick drafts
base 74 MB ~1 GB ⚡⚡ Basic accuracy
small 244 MB ~2 GB Recommended
medium 769 MB ~5 GB 🐢 Better accuracy
large-v3 1.5 GB ~10 GB 🐢🐢 Best accuracy

Integration

Clawdbot auto-transcribes incoming voice messages when this skill is enabled.

Files

  • scripts/transcribe.py — Main transcription script
  • SKILL.md — This file
安全使用建议
This skill appears coherent for local transcription. Before installing: (1) be aware pip install faster-whisper will install third-party code from PyPI and may build native dependencies—review that package if you require strict supply-chain controls; (2) model files are downloaded on first use (network bandwidth, disk space, and cache location like ~/.cache may be used); (3) transcription can require significant RAM/CPU/GPU depending on model size—ensure your system meets the listed VRAM/RAM requirements; (4) the included script only reads the audio file you pass and prints the text (no hidden endpoints or credential access were found), but if you must guarantee full offline operation, pre-download models and verify faster-whisper’s sources before running.
功能分析
Type: OpenClaw Skill Name: audio-transcribe Version: 1.0.0 The OpenClaw skill 'audio-transcribe' is benign. The `SKILL.md` provides clear, non-malicious instructions for installing dependencies and running the transcription script, with no evidence of prompt injection attempts. The `scripts/transcribe.py` script uses the `faster-whisper` library to perform local audio transcription, taking an audio file path as input. It does not contain any code for data exfiltration, malicious execution, persistence, or obfuscation. The model download on first use is a standard and expected behavior for local machine learning models.
能力评估
Purpose & Capability
Name/description (local transcription with faster-whisper) align with included script and SKILL.md. The script only loads a WhisperModel and transcribes a provided audio file.
Instruction Scope
Runtime instructions and the script operate only on the provided audio path and model downloads; there are no references to unrelated files, environment variables, or external endpoints in the skill code itself.
Install Mechanism
No install spec in the registry; SKILL.md asks the user to pip install faster-whisper. This is expected but means a third-party package (and its dependencies) will be installed from PyPI and models will be downloaded from the network on first run.
Credentials
The skill requests no environment variables, credentials, or config paths. The script only uses the command-line audio file argument.
Persistence & Privilege
always is false and the skill does not modify agent configuration or request persistent privileges.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install audio-transcribe
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /audio-transcribe 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
- Initial release of audio-transcribe skill: auto-transcribes voice messages locally using faster-whisper. - No API key required; models download automatically on first use. - Simple CLI for transcribing .ogg files. - Model selection supported by editing the script, with multiple accuracy/speed options. - Clawdbot integration: auto-transcribes incoming voice messages when enabled.
元数据
Slug audio-transcribe
版本 1.0.0
许可证
累计安装 16
当前安装数 14
历史版本数 1
常见问题

Audio Transcribe 是什么?

Auto-transcribe voice messages locally using faster-whisper with selectable Whisper models, no API key required. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 1785 次。

如何安装 Audio Transcribe?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install audio-transcribe」即可一键安装,无需额外配置。

Audio Transcribe 是免费的吗?

是的,Audio Transcribe 完全免费(开源免费),可自由下载、安装和使用。

Audio Transcribe 支持哪些平台?

Audio Transcribe 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Audio Transcribe?

由 Alex Knight(@aktheknight)开发并维护,当前版本 v1.0.0。

💬 留言讨论