← 返回 Skills 市场

Coze Tts

Name: Coze Tts
Author: franklu0819-lang

作者 xiaofei · GitHub ↗ · v1.0.3 · MIT-0

cross-platform ✓ 安全检测通过

259

总下载

当前安装

版本数

在 OpenClaw 中安装

/install coze-tts

功能描述

Text-to-Speech (TTS) using Coze API. Convert text to natural-sounding speech audio files. Supports multiple voices and output formats (mp3, ogg_opus, wav, pcm).

使用说明 (SKILL.md)

Coze Text-to-Speech (TTS)

Convert text to natural-sounding speech using Coze API.

Setup

1. Get your API Key: Get a key from Coze Platform

2. Set it in your environment:

export COZE_API_KEY="your-key-here"

Supported Output Formats

MP3 - Default format, widely compatible
OGG_OPUS - Optimized for streaming and messaging
WAV - Uncompressed audio
PCM - Raw audio data

Usage

Basic TTS

Convert text to speech with default settings:

bash scripts/text_to_speech.sh "你好，这是测试语音"

Save to Specific File

bash scripts/text_to_speech.sh "你好世界" -o output.mp3

Use Different Voice

bash scripts/text_to_speech.sh "你好" -v 2

Change Output Format

bash scripts/text_to_speech.sh "你好" -f ogg_opus

Full Options

bash scripts/text_to_speech.sh "要转换的文本" -o output.mp3 -v 1 -f mp3

Parameters:

text (required): Text to convert to speech
-o, --output (optional): Output file path (default: auto-generated)
-v, --voice (optional): Voice ID (default: 1)
-f, --format (optional): Output format - mp3/ogg_opus/wav/pcm (default: mp3)

Output

The script saves the audio file and outputs:

File path
File size
Audio duration (if ffprobe is available)

Example output:

✓ Audio saved: coze_tts_20260324_235030_a1b2c3d4.mp3
  Size: 25.3 KB
  Duration: ~3 seconds

Workflow Examples

Generate Notification Audio

bash scripts/text_to_speech.sh "您有一条新消息" -o notification.mp3

Create Voice Greeting

bash scripts/text_to_speech.sh "欢迎使用 Coze 语音服务" -v 2 -o greeting.mp3

Generate OGG for Messaging

bash scripts/text_to_speech.sh "你好" -f ogg_opus -o message.ogg

Batch Generate

for text in "你好" "谢谢" "再见"; do
    bash scripts/text_to_speech.sh "$text" -o "${text}.mp3"
done

Integration with Other Skills

Combine with coze-asr for voice conversation:

# 1. User speaks -> ASR converts to text
bash coze-asr/scripts/speech_to_text.sh input.ogg

# 2. Process text with AI...

# 3. AI response -> TTS converts to speech
bash coze-tts/scripts/text_to_speech.sh "AI的回复" -o response.mp3

Troubleshooting

Authentication Error:

Check COZE_API_KEY is set correctly
Verify API key has TTS permissions

Invalid Voice ID:

Voice ID should be a number (int64 format)
Try voice_id: 1 as default

File Not Created:

Check write permissions in output directory
Ensure sufficient disk space

Limitations

Text length limits apply (check Coze documentation)
Rate limits may apply based on your plan
Some voices may not support all output formats

API Reference

Endpoint: POST https://api.coze.cn/v1/audio/speech
Authentication: Bearer token (COZE_API_KEY)
Content-Type: application/json

Required Environment Variables

Variable	Description	Required
`COZE_API_KEY`	Coze API authentication key	Yes

Required Tools

Tool	Purpose	Required
`jq`	JSON processing	Yes
`ffprobe`	Audio duration detection	Optional

License

MIT

安全使用建议

This skill is coherent with its TTS purpose, but review before installing: (1) Confirm the API endpoint (https://api.coze.cn) and that you trust Coze and your API key—the script will send the text you provide to that external service. (2) Note the mismatch between documented default voice (1) and the script's VOICE_ID=6—test and adjust the default if needed. (3) The metadata/version in _meta.json differs from registry metadata; this is likely packaging drift but worth noting. (4) The skill declares jq as required but the script also expects curl, md5sum, stat, bc (and optionally ffprobe); ensure those tools exist on your system. (5) Limit the scope of the COZE_API_KEY (use least privilege / appropriate plan) and do not expose it publicly. If any of these points worry you or you need the script to behave differently, inspect or modify the script locally before use.

功能分析

Type: OpenClaw Skill Name: coze-tts Version: 1.0.3 The skill is a legitimate Text-to-Speech utility that interfaces with the official Coze API (api.coze.cn). The primary script, scripts/text_to_speech.sh, uses standard tools like curl and jq to send user-provided text to the API and save the resulting audio file, with no evidence of data exfiltration, malicious execution, or prompt injection.

能力评估

ℹ Purpose & Capability

Name/description align with the files and behavior: the script posts text to https://api.coze.cn/v1/audio/speech and saves audio. However there are minor inconsistencies: SKILL.md and references state default voice_id is 1, while the script sets VOICE_ID=6 and help text claims default 1. _meta.json version (1.0.2) differs from registry metadata (1.0.3). These look like packaging/documentation drift, not maliciousness.

✓ Instruction Scope

SKILL.md instructs running the included shell script and only documents use of COZE_API_KEY and jq; the script's runtime actions are confined to building JSON, calling the documented Coze API endpoint, writing an audio file locally, and optionally using ffprobe. It does not attempt to read unrelated system files or other env vars.

✓ Install Mechanism

This is an instruction-only skill with a shipped shell script and no install spec or remote downloads. Nothing is pulled from arbitrary URLs or executed during install.

ℹ Credentials

The only required env var is COZE_API_KEY which is appropriate for calling the Coze service. One minor proportionality issue: required binaries lists only jq, but the script also uses common utilities (curl, md5sum, stat, bc, date, ffprobe optional). These are typical but should be documented explicitly.

✓ Persistence & Privilege

The skill does not request elevated or persistent platform privileges (always:false). It does not modify other skills or system-wide settings.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install coze-tts
安装完成后，直接呼叫该 Skill 的名称或使用 /coze-tts 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.3

默认音色改为 6

v1.0.2

修复脚本语法错误，添加API参考文档

v1.0.1

优化：移除未声明的飞书集成脚本，更新文档以准确反映功能

v1.0.0

首次发布：Coze 语音合成技能，支持 mp3/ogg_opus/wav/pcm 格式，支持语速调整

元数据

Slug coze-tts

版本 1.0.3

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 4

常见问题

Coze Tts 是什么？

Text-to-Speech (TTS) using Coze API. Convert text to natural-sounding speech audio files. Supports multiple voices and output formats (mp3, ogg_opus, wav, pcm). 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 259 次。

如何安装 Coze Tts？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install coze-tts」即可一键安装，无需额外配置。

Coze Tts 是免费的吗？

是的，Coze Tts 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Coze Tts 支持哪些平台？

Coze Tts 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Coze Tts？

由 xiaofei（@franklu0819-lang）开发并维护，当前版本 v1.0.3。