← 返回 Skills 市场
134
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install kai-realtime-voice
功能描述
Real-time voice streaming via MiniMax WebSocket API. Use for low-latency voice conversations and streaming audio generation.
使用说明 (SKILL.md)
Kai Realtime Voice
Real-time voice streaming using MiniMax WebSocket API.
Setup
API key already configured in OpenClaw config. Uses same key as kai-minimax-tts.
Usage
Test WebSocket connection:
bash {baseDir}/scripts/kai_realtime.sh --test
Stream text to audio:
bash {baseDir}/scripts/kai_realtime.sh --stream "Hello world"
Requirements
- Python3 with websockets library
- WebSocket support
- MiniMax API key (auto-injected)
Output
Audio streamed to stdout or saved to file.
安全使用建议
This skill appears to do what it claims (call MiniMax t2a endpoint and save audio), but there are small inconsistencies you should understand before installing: (1) SKILL.md mentions WebSocket streaming and Python websockets, but the bundled script currently uses a REST curl fallback — so you will not get true WebSocket low-latency streaming until that is implemented; (2) the script reads optional env vars (KAI_MINIMAX_WORKSPACE, KAI_ENGLISH_VOICE_ID) that are not documented in requires.env — check or set these to avoid files being written where you don't expect; (3) it will send text and your MINIMAX_API_KEY to api.minimax.io, so verify that domain and consider using a scoped/test API key first. If you need real-time WebSocket streaming, request or inspect the implementation update (a Python/websocket client) before relying on low-latency behavior. If you accept the REST fallback and understand where output files are saved, the footprint is small and the install risk is low.
功能分析
Type: OpenClaw Skill
Name: kai-realtime-voice
Version: 1.0.0
The skill bundle provides a bash script (`scripts/kai_realtime.sh`) to interact with the MiniMax API for text-to-speech generation. It uses standard tools like `curl` and `xxd` to communicate with the legitimate service endpoint (api.minimax.io) and process audio data. No evidence of malicious intent, data exfiltration, or unauthorized execution was found.
能力评估
Purpose & Capability
Name/description (real‑time voice via MiniMax) lines up with the actual network calls (requests to https://api.minimax.io/v1/t2a_v2). The single required env var MINIMAX_API_KEY is expected for accessing the MiniMax API.
Instruction Scope
SKILL.md and README text advertise WebSocket-based real-time streaming and require 'Python3 with websockets', but the provided script uses curl and a REST fallback (no WebSocket implementation). The script also reads optional env vars KAI_MINIMAX_WORKSPACE and KAI_ENGLISH_VOICE_ID that are not listed in requires.env. These mismatches are scope/instruction inconsistencies that could confuse users and agents.
Install Mechanism
No install spec; this is instruction-only plus a shell script. No downloads or archive extraction. Low install risk (nothing is written outside the workspace when run, aside from created output file).
Credentials
Only MINIMAX_API_KEY is declared and required, which is appropriate. The script makes optional use of KAI_MINIMAX_WORKSPACE and KAI_ENGLISH_VOICE_ID (defaults provided) — these are not declared in metadata and should be documented. The skill does not request unrelated credentials.
Persistence & Privilege
always:false and no autonomously persistent behavior. The script writes output to a workspace path (default under $HOME/.openclaw) but does not modify other skills or global configuration.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install kai-realtime-voice - 安装完成后,直接呼叫该 Skill 的名称或使用
/kai-realtime-voice触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
- Initial release of kai-realtime-voice.
- Enables real-time voice streaming through the MiniMax WebSocket API.
- Supports low-latency voice conversations and streaming audio generation.
- Audio can be streamed to stdout or saved to a file.
- Requires Python3 with websockets, WebSocket support, and a MiniMax API key.
元数据
常见问题
Kai Realtime Voice 是什么?
Real-time voice streaming via MiniMax WebSocket API. Use for low-latency voice conversations and streaming audio generation. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 134 次。
如何安装 Kai Realtime Voice?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install kai-realtime-voice」即可一键安装,无需额外配置。
Kai Realtime Voice 是免费的吗?
是的,Kai Realtime Voice 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Kai Realtime Voice 支持哪些平台?
Kai Realtime Voice 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Kai Realtime Voice?
由 Blaze🔥(@ogdegenblaze)开发并维护,当前版本 v1.0.0。
推荐 Skills