← Back to Skills Marketplace
ogdegenblaze

Kai Realtime Voice

by Blaze🔥 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
134
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install kai-realtime-voice
Description
Real-time voice streaming via MiniMax WebSocket API. Use for low-latency voice conversations and streaming audio generation.
README (SKILL.md)

Kai Realtime Voice

Real-time voice streaming using MiniMax WebSocket API.

Setup

API key already configured in OpenClaw config. Uses same key as kai-minimax-tts.

Usage

Test WebSocket connection:

bash {baseDir}/scripts/kai_realtime.sh --test

Stream text to audio:

bash {baseDir}/scripts/kai_realtime.sh --stream "Hello world"

Requirements

  • Python3 with websockets library
  • WebSocket support
  • MiniMax API key (auto-injected)

Output

Audio streamed to stdout or saved to file.

Usage Guidance
This skill appears to do what it claims (call MiniMax t2a endpoint and save audio), but there are small inconsistencies you should understand before installing: (1) SKILL.md mentions WebSocket streaming and Python websockets, but the bundled script currently uses a REST curl fallback — so you will not get true WebSocket low-latency streaming until that is implemented; (2) the script reads optional env vars (KAI_MINIMAX_WORKSPACE, KAI_ENGLISH_VOICE_ID) that are not documented in requires.env — check or set these to avoid files being written where you don't expect; (3) it will send text and your MINIMAX_API_KEY to api.minimax.io, so verify that domain and consider using a scoped/test API key first. If you need real-time WebSocket streaming, request or inspect the implementation update (a Python/websocket client) before relying on low-latency behavior. If you accept the REST fallback and understand where output files are saved, the footprint is small and the install risk is low.
Capability Analysis
Type: OpenClaw Skill Name: kai-realtime-voice Version: 1.0.0 The skill bundle provides a bash script (`scripts/kai_realtime.sh`) to interact with the MiniMax API for text-to-speech generation. It uses standard tools like `curl` and `xxd` to communicate with the legitimate service endpoint (api.minimax.io) and process audio data. No evidence of malicious intent, data exfiltration, or unauthorized execution was found.
Capability Assessment
Purpose & Capability
Name/description (real‑time voice via MiniMax) lines up with the actual network calls (requests to https://api.minimax.io/v1/t2a_v2). The single required env var MINIMAX_API_KEY is expected for accessing the MiniMax API.
Instruction Scope
SKILL.md and README text advertise WebSocket-based real-time streaming and require 'Python3 with websockets', but the provided script uses curl and a REST fallback (no WebSocket implementation). The script also reads optional env vars KAI_MINIMAX_WORKSPACE and KAI_ENGLISH_VOICE_ID that are not listed in requires.env. These mismatches are scope/instruction inconsistencies that could confuse users and agents.
Install Mechanism
No install spec; this is instruction-only plus a shell script. No downloads or archive extraction. Low install risk (nothing is written outside the workspace when run, aside from created output file).
Credentials
Only MINIMAX_API_KEY is declared and required, which is appropriate. The script makes optional use of KAI_MINIMAX_WORKSPACE and KAI_ENGLISH_VOICE_ID (defaults provided) — these are not declared in metadata and should be documented. The skill does not request unrelated credentials.
Persistence & Privilege
always:false and no autonomously persistent behavior. The script writes output to a workspace path (default under $HOME/.openclaw) but does not modify other skills or global configuration.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install kai-realtime-voice
  3. After installation, invoke the skill by name or use /kai-realtime-voice
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of kai-realtime-voice. - Enables real-time voice streaming through the MiniMax WebSocket API. - Supports low-latency voice conversations and streaming audio generation. - Audio can be streamed to stdout or saved to a file. - Requires Python3 with websockets, WebSocket support, and a MiniMax API key.
Metadata
Slug kai-realtime-voice
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Kai Realtime Voice?

Real-time voice streaming via MiniMax WebSocket API. Use for low-latency voice conversations and streaming audio generation. It is an AI Agent Skill for Claude Code / OpenClaw, with 134 downloads so far.

How do I install Kai Realtime Voice?

Run "/install kai-realtime-voice" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Kai Realtime Voice free?

Yes, Kai Realtime Voice is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Kai Realtime Voice support?

Kai Realtime Voice is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Kai Realtime Voice?

It is built and maintained by Blaze🔥 (@ogdegenblaze); the current version is v1.0.0.

💬 Comments