← 返回 Skills 市场

Kai Realtime Voice

Name: Kai Realtime Voice
Author: ogdegenblaze

作者 Blaze🔥 · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

134

总下载

当前安装

版本数

在 OpenClaw 中安装

/install kai-realtime-voice

功能描述

Real-time voice streaming via MiniMax WebSocket API. Use for low-latency voice conversations and streaming audio generation.

使用说明 (SKILL.md)

Kai Realtime Voice

Real-time voice streaming using MiniMax WebSocket API.

Setup

API key already configured in OpenClaw config. Uses same key as kai-minimax-tts.

Usage

Test WebSocket connection:

bash {baseDir}/scripts/kai_realtime.sh --test

Stream text to audio:

bash {baseDir}/scripts/kai_realtime.sh --stream "Hello world"

Requirements

Python3 with websockets library
WebSocket support
MiniMax API key (auto-injected)

Output

Audio streamed to stdout or saved to file.

安全使用建议

This skill appears to do what it claims (call MiniMax t2a endpoint and save audio), but there are small inconsistencies you should understand before installing: (1) SKILL.md mentions WebSocket streaming and Python websockets, but the bundled script currently uses a REST curl fallback — so you will not get true WebSocket low-latency streaming until that is implemented; (2) the script reads optional env vars (KAI_MINIMAX_WORKSPACE, KAI_ENGLISH_VOICE_ID) that are not documented in requires.env — check or set these to avoid files being written where you don't expect; (3) it will send text and your MINIMAX_API_KEY to api.minimax.io, so verify that domain and consider using a scoped/test API key first. If you need real-time WebSocket streaming, request or inspect the implementation update (a Python/websocket client) before relying on low-latency behavior. If you accept the REST fallback and understand where output files are saved, the footprint is small and the install risk is low.

功能分析

Type: OpenClaw Skill Name: kai-realtime-voice Version: 1.0.0 The skill bundle provides a bash script (`scripts/kai_realtime.sh`) to interact with the MiniMax API for text-to-speech generation. It uses standard tools like `curl` and `xxd` to communicate with the legitimate service endpoint (api.minimax.io) and process audio data. No evidence of malicious intent, data exfiltration, or unauthorized execution was found.

能力评估

✓ Purpose & Capability

Name/description (real‑time voice via MiniMax) lines up with the actual network calls (requests to https://api.minimax.io/v1/t2a_v2). The single required env var MINIMAX_API_KEY is expected for accessing the MiniMax API.

⚠ Instruction Scope

SKILL.md and README text advertise WebSocket-based real-time streaming and require 'Python3 with websockets', but the provided script uses curl and a REST fallback (no WebSocket implementation). The script also reads optional env vars KAI_MINIMAX_WORKSPACE and KAI_ENGLISH_VOICE_ID that are not listed in requires.env. These mismatches are scope/instruction inconsistencies that could confuse users and agents.

✓ Install Mechanism

No install spec; this is instruction-only plus a shell script. No downloads or archive extraction. Low install risk (nothing is written outside the workspace when run, aside from created output file).

ℹ Credentials

Only MINIMAX_API_KEY is declared and required, which is appropriate. The script makes optional use of KAI_MINIMAX_WORKSPACE and KAI_ENGLISH_VOICE_ID (defaults provided) — these are not declared in metadata and should be documented. The skill does not request unrelated credentials.

✓ Persistence & Privilege

always:false and no autonomously persistent behavior. The script writes output to a workspace path (default under $HOME/.openclaw) but does not modify other skills or global configuration.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install kai-realtime-voice
安装完成后，直接呼叫该 Skill 的名称或使用 /kai-realtime-voice 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

- Initial release of kai-realtime-voice. - Enables real-time voice streaming through the MiniMax WebSocket API. - Supports low-latency voice conversations and streaming audio generation. - Audio can be streamed to stdout or saved to a file. - Requires Python3 with websockets, WebSocket support, and a MiniMax API key.

元数据

Slug kai-realtime-voice

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题