← 返回 Skills 市场

Voice Recognition

Name: Voice Recognition
Author: gykdly

作者 gykdly · GitHub ↗ · v1.0.0

cross-platform ✓ 安全检测通过

1940

总下载

当前安装

版本数

在 OpenClaw 中安装

/install voice-recognition

功能描述

Local speech-to-text with OpenAI Whisper CLI. Supports Chinese, English, 100+ languages with translation and summarization.

安全使用建议

This skill appears to do what it says: a small Python wrapper that invokes the local OpenAI Whisper CLI and writes transcripts locally. Before installing/use: (1) install openai-whisper from a trusted source (Homebrew tap) so the 'whisper' binary on your PATH is legitimate; (2) be aware the first run will download model weights to ~/.cache/whisper (large download and disk usage); (3) update the SKILL.md usage examples to point to the script location on your system instead of the hard-coded /Users/liyi/... path, and only create the suggested alias if you trust the script location; (4) transcripts are written next to the input audio file — check permissions and disk location; (5) if you want to reduce risk, run the script in an isolated environment (container or VM) until you confirm behavior. No signs of credential exfiltration or remote endpoints were found in the included files.

功能分析

Type: OpenClaw Skill Name: voice-recognition Version: 1.0.0 The skill bundle is designed for local speech-to-text using the OpenAI Whisper CLI. The `SKILL.md` provides clear usage instructions without any prompt injection attempts or malicious directives for the agent. The Python script `scripts/voice识别_升级版.py` uses `subprocess.run` to execute the `whisper` command, passing user-controlled audio file paths as distinct arguments, which is the recommended secure method to prevent shell injection. Output files are written locally to the same directory as the input audio, aligning with the skill's stated purpose. There is no evidence of data exfiltration, persistence mechanisms, or other malicious behaviors.

能力评估

ℹ Purpose & Capability

The name/description (local Whisper-based speech-to-text) match the included Python script and the SKILL.md. The README asks you to install openai-whisper via Homebrew and use Python 3.10+, which is appropriate. Minor oddity: usage examples in SKILL.md hard-code an absolute path (/Users/liyi/.openclaw/workspace/...) pointing to a specific user's workspace — this is inconsistent with distributing the script and should be updated to relative or generic paths.

ℹ Instruction Scope

Runtime instructions simply run the included Python script which calls the external 'whisper' CLI (no shell=True). The script reads an audio file, writes a .txt transcript beside that file, and can generate a simple local summary. It does not read unrelated system files or environment variables, nor does it post data to remote endpoints. Note: first run will download model weights to ~/.cache/whisper (network and disk usage).

✓ Install Mechanism

There is no install spec (instruction-only skill). The SKILL.md recommends 'brew install openai-whisper' which is a reasonable, low-risk installation path for the Whisper CLI.

✓ Credentials

The skill requests no environment variables, no credentials, and no config paths. The behavior (invoking a local 'whisper' binary) is proportionate to the stated function. Reminder: because it calls an external binary by name, it depends on the 'whisper' in PATH being the expected implementation.

✓ Persistence & Privilege

The skill does not request permanent/always inclusion, does not modify other skills, and contains no code that attempts to change system-wide agent settings. It only suggests an optional shell alias for convenience (user action).

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install voice-recognition
安装完成后，直接呼叫该 Skill 的名称或使用 /voice-recognition 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

Initial release of voice-recognition skill. - Provides local speech-to-text using OpenAI Whisper CLI, no API key required. - Supports 100+ languages, including Chinese and English. - Offers translation to English and text summarization features. - Compatible with various audio formats: MP3, M4A, WAV, OGG, FLAC, WebM. - Easy command-line usage and quick alias setup instructions included.

元数据

Slug voice-recognition

版本 1.0.0

许可证 —

累计安装 8

当前安装数 6

历史版本数 1

常见问题

Voice Recognition 是什么？

Local speech-to-text with OpenAI Whisper CLI. Supports Chinese, English, 100+ languages with translation and summarization. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 1940 次。

如何安装 Voice Recognition？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install voice-recognition」即可一键安装，无需额外配置。

Voice Recognition 是免费的吗？

是的，Voice Recognition 完全免费（开源免费），可自由下载、安装和使用。

Voice Recognition 支持哪些平台？

Voice Recognition 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Voice Recognition？

由 gykdly（@gykdly）开发并维护，当前版本 v1.0.0。