← 返回 Skills 市场
bohnwuks

Speech to text

作者 Ian Santos · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
466
总下载
0
收藏
5
当前安装
1
版本数
在 OpenClaw 中安装
/install stt
功能描述
Transcreve arquivos de áudio em português brasileiro para texto, suportando vários formatos e incluindo timestamps.
安全使用建议
This skill appears to do exactly what it says: local transcription using OpenAI Whisper. Before installing, consider: (1) Whisper will download model files on first run (can be large and requires network access); (2) you must install FFmpeg separately and ensure the 'inbound' folder path matches the script's computed path (SKILL.md's mkdir path may not match your environment); (3) the script will read and move any audio files placed in that folder and will write JSON transcriptions to an output directory—do not place sensitive audio there unless you are comfortable with it being processed and stored locally; (4) review requirements.txt and install dependencies in an isolated environment (virtualenv/container) to limit systemic impact. I reviewed the provided files and saw no code that exfiltrates data or requests unrelated credentials; if you want extra assurance, run the script in a sandboxed environment and inspect the full (non-truncated) stt_processor.py before production use.
功能分析
Type: OpenClaw Skill Name: stt Version: 1.0.0 The skill provides a legitimate speech-to-text service using the OpenAI Whisper library. The core logic in `stt_processor.py` implements standard file-processing patterns, including monitoring an inbound directory, transcribing audio files, and moving processed files to success/failure folders. No evidence of data exfiltration, malicious execution, or prompt injection was found; the code and instructions in `SKILL.md` are consistent with the stated purpose.
能力评估
Purpose & Capability
Name and description match the included code and SKILL.md: the package implements a Whisper-based transcriber, supports the listed audio formats and timestamps, and saves/moves files as expected. No unrelated credentials, binaries, or config paths are requested.
Instruction Scope
SKILL.md instructions are narrowly scoped to installing Python deps, FFmpeg, creating an inbound folder, and running the script. The script operates on a local media/inbound folder and writes transcriptions to an output directory. Two minor issues to be aware of: (1) SKILL.md asks to create ../../../media/inbound — the script computes a media path relative to the script location (workspace_dir = Path(__file__).parent.parent.parent then .parent / 'media'), so you should verify the exact folder path used in your deployment to avoid missed files; (2) Whisper will download model weights on first run (network and significant disk usage), which is expected but notable.
Install Mechanism
There is no installer in the registry spec; installation is via pip install -r requirements.txt and a separate FFmpeg install. Dependencies come from PyPI and standard package managers — no suspicious external URLs, archive downloads, or extract-on-disk steps are present in the manifest.
Credentials
The skill declares no environment variables or credentials and the code does not read secret env vars. It only reads/writes local filesystem paths (inbound, output, processed/failed). There are no requests for unrelated credentials.
Persistence & Privilege
The skill does not request always:true and is user-invocable only. It does not modify other skills or system-wide agent settings. Its runtime behavior (processing local files, saving results) is consistent with its purpose.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install stt
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /stt 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Speech-to-Text (STT) Skill – Initial Release - Transcribes audio files to text using OpenAI Whisper, optimized for Brazilian Portuguese. - Supports audio formats: OGG, WAV, MP3, M4A, FLAC, AAC, OPUS. - Offers transcription with timestamps. - Provides tools for transcribing individual files, batch processing, and folder monitoring. - Includes setup instructions and usage examples.
元数据
Slug stt
版本 1.0.0
许可证 MIT-0
累计安装 5
当前安装数 5
历史版本数 1
常见问题

Speech to text 是什么?

Transcreve arquivos de áudio em português brasileiro para texto, suportando vários formatos e incluindo timestamps. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 466 次。

如何安装 Speech to text?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install stt」即可一键安装,无需额外配置。

Speech to text 是免费的吗?

是的,Speech to text 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Speech to text 支持哪些平台?

Speech to text 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Speech to text?

由 Ian Santos(@bohnwuks)开发并维护,当前版本 v1.0.0。

💬 留言讨论