← Back to Skills Marketplace

Kai Realtime Voice

Name: Kai Realtime Voice
Author: ogdegenblaze

by Blaze🔥 · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

134

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install kai-realtime-voice

Description

Real-time voice streaming via MiniMax WebSocket API. Use for low-latency voice conversations and streaming audio generation.

README (SKILL.md)

Kai Realtime Voice

Real-time voice streaming using MiniMax WebSocket API.

Setup

API key already configured in OpenClaw config. Uses same key as kai-minimax-tts.

Usage

Test WebSocket connection:

bash {baseDir}/scripts/kai_realtime.sh --test

Stream text to audio:

bash {baseDir}/scripts/kai_realtime.sh --stream "Hello world"

Requirements

Python3 with websockets library
WebSocket support
MiniMax API key (auto-injected)

Output

Audio streamed to stdout or saved to file.

Usage Guidance

This skill appears to do what it claims (call MiniMax t2a endpoint and save audio), but there are small inconsistencies you should understand before installing: (1) SKILL.md mentions WebSocket streaming and Python websockets, but the bundled script currently uses a REST curl fallback — so you will not get true WebSocket low-latency streaming until that is implemented; (2) the script reads optional env vars (KAI_MINIMAX_WORKSPACE, KAI_ENGLISH_VOICE_ID) that are not documented in requires.env — check or set these to avoid files being written where you don't expect; (3) it will send text and your MINIMAX_API_KEY to api.minimax.io, so verify that domain and consider using a scoped/test API key first. If you need real-time WebSocket streaming, request or inspect the implementation update (a Python/websocket client) before relying on low-latency behavior. If you accept the REST fallback and understand where output files are saved, the footprint is small and the install risk is low.

Capability Analysis

Type: OpenClaw Skill Name: kai-realtime-voice Version: 1.0.0 The skill bundle provides a bash script (`scripts/kai_realtime.sh`) to interact with the MiniMax API for text-to-speech generation. It uses standard tools like `curl` and `xxd` to communicate with the legitimate service endpoint (api.minimax.io) and process audio data. No evidence of malicious intent, data exfiltration, or unauthorized execution was found.

Capability Assessment

✓ Purpose & Capability

Name/description (real‑time voice via MiniMax) lines up with the actual network calls (requests to https://api.minimax.io/v1/t2a_v2). The single required env var MINIMAX_API_KEY is expected for accessing the MiniMax API.

⚠ Instruction Scope

SKILL.md and README text advertise WebSocket-based real-time streaming and require 'Python3 with websockets', but the provided script uses curl and a REST fallback (no WebSocket implementation). The script also reads optional env vars KAI_MINIMAX_WORKSPACE and KAI_ENGLISH_VOICE_ID that are not listed in requires.env. These mismatches are scope/instruction inconsistencies that could confuse users and agents.

✓ Install Mechanism

No install spec; this is instruction-only plus a shell script. No downloads or archive extraction. Low install risk (nothing is written outside the workspace when run, aside from created output file).

ℹ Credentials

Only MINIMAX_API_KEY is declared and required, which is appropriate. The script makes optional use of KAI_MINIMAX_WORKSPACE and KAI_ENGLISH_VOICE_ID (defaults provided) — these are not declared in metadata and should be documented. The skill does not request unrelated credentials.

✓ Persistence & Privilege

always:false and no autonomously persistent behavior. The script writes output to a workspace path (default under $HOME/.openclaw) but does not modify other skills or global configuration.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install kai-realtime-voice
After installation, invoke the skill by name or use /kai-realtime-voice
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

- Initial release of kai-realtime-voice. - Enables real-time voice streaming through the MiniMax WebSocket API. - Supports low-latency voice conversations and streaming audio generation. - Audio can be streamed to stdout or saved to a file. - Requires Python3 with websockets, WebSocket support, and a MiniMax API key.

Metadata

Slug kai-realtime-voice

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Kai Realtime Voice?

Real-time voice streaming via MiniMax WebSocket API. Use for low-latency voice conversations and streaming audio generation. It is an AI Agent Skill for Claude Code / OpenClaw, with 134 downloads so far.

How do I install Kai Realtime Voice?

Run "/install kai-realtime-voice" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Kai Realtime Voice free?

Yes, Kai Realtime Voice is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Kai Realtime Voice support?

Kai Realtime Voice is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Kai Realtime Voice?

It is built and maintained by Blaze🔥 (@ogdegenblaze); the current version is v1.0.0.

More Skills

Kai Realtime Voice

Kai Realtime Voice

Setup

Usage

Requirements

Output

What is Kai Realtime Voice?

How do I install Kai Realtime Voice?

Is Kai Realtime Voice free?

Which platforms does Kai Realtime Voice support?

Who created Kai Realtime Voice?

💬 Comments