← 返回 Skills 市场

语音合成服务

Name: 语音合成服务
Author: xiaobenyang-com

作者 xiaobenyang-com · GitHub ↗ · v1.0.7 · MIT-0

cross-platform ⚠ suspicious

总下载

当前安装

版本数

在 OpenClaw 中安装

/install speech-synthesis

功能描述

一个集成了Microsoft Edge高质量语音合成能力的MCP服务器，支持多语言语音生成、音频合并和云端存储。

使用说明 (SKILL.md)

语音合成服务器

一个集成了Microsoft Edge高质量语音合成能力的MCP服务器，支持多语言语音生成、音频合并和云端存储。

⚠️ 强制要求：API 密钥

此 Skill 必须配置 API 密钥才能使用。

首次使用时，如果 .env 中没有 XBY_APIKEY，必须使用 AskUserQuestion 工具向用户询问 API 密钥
拿到用户提供的密钥后，调用 scripts.config.set_api_key(api_key) 保存，然后继续处理
获取 API 密钥：https://xiaobenyang.com
禁止在缺少 API 密钥时自行搜索或编造数据

工作流程（必须遵守）

你（大模型）是路由层，负责理解用户意图、选择工具、提取参数。代码只负责调用API。

用户输入 → 你选择工具 → 提取该工具需要的参数 → 调用 scripts.tools 中的函数 → 返回结果给用户

步骤

检查 API 密钥：如果 scripts.config.settings.api_key 为空，使用 AskUserQuestion 询问用户，拿到后调用 scripts.config.set_api_key(key) 保存
选择工具：根据用户意图从下方工具列表中选择对应的工具函数
提取参数：根据选中的工具，提取该工具需要的参数
调用工具：使用关键字参数调用 scripts.tools 中的函数，例如 scripts.tools.search_schools(score='520', province='北京', category='综合')
返回结果：将工具返回的 raw 数据整理后展示给用户

工具选择规则

根据用户意图选择对应的工具函数：

用户意图	工具函数
Generate speech audio from text using Microsoft Edge TTS. Supports multi-role conversations and audio merging.	`scripts.tools.generate_speech`

如果参数不完整，使用 AskUserQuestion 向用户询问缺失的参数。

工具函数说明

scripts.tools.generate_speech

工具描述：Generate speech audio from text using Microsoft Edge TTS. Supports multi-role conversations and audio merging.

参数定义

参数名称	参数类型	是否必填	默认值	描述
merge_output	boolean	true		If true, merges all speech items into a single MP3 file. If false, returns separate files.
items	array	true		List of speech segments to generate.

返回值处理

工具函数返回 dict 对象：

result["raw"] - API 原始返回数据（JSON），直接将此数据整理后展示给用户
result["success"] - 是否成功（True/False）
result["message"] - 状态消息

项目结构

xiaobenyang_gaokao_skill/
├── scripts/
│   ├── __init__.py
│   ├── config.py       # 配置管理 + set_api_key()
│   ├── call_api.py      # API 客户端 + call_api()
│   └── tools.py         # 工具函数（直接调用）
├── requirements.txt
└── SKILL.md

注意事项

API 密钥是必需的，无密钥时必须通过 AskUserQuestion 询问用户
禁止在缺少 API 密钥时自行搜索或编造数据

安全使用建议

Install only if you trust the publisher and are comfortable giving a XiaoBenYang API key to this skill and having it saved in a local .env file. Avoid reusing a sensitive or high-privilege key, and consider waiting for a cleaned-up version that removes Gaokao references, documents the external service data flow, and uses safer credential storage.

能力标签

requires-sensitive-credentials

能力评估

⚠ Purpose & Capability

The declared purpose is Microsoft Edge TTS speech generation, and the visible tool calls generate_speech, but the artifact also contains Gaokao/XiaoBenYang identifiers, a Gaokao docstring, default_year, and a school-search example that do not fit the stated purpose.

⚠ Instruction Scope

The instructions require the agent to ask for an XBY_APIKEY and save it before use; this credential requirement is disclosed, but the copied unrelated workflow example and mismatched project structure make the operational scope unclear.

ℹ Install Mechanism

Dependencies are ordinary Python packages and the static dependency registry check was clean; no install-time command execution or package substitution evidence was found.

⚠ Credentials

For a speech synthesis skill, sending user text and a user-supplied API key to the third-party xiaobenyang MCP endpoint may be acceptable if intended, but the artifact does not cleanly explain the third-party trust boundary or why Gaokao-specific configuration remains.

⚠ Persistence & Privilege

The skill writes the supplied API key into a local .env file and process environment without secure storage, deletion guidance, or explicit warning about plaintext persistence.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install speech-synthesis
安装完成后，直接呼叫该 Skill 的名称或使用 /speech-synthesis 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.7

- Clarified API key is now required; users will be prompted if one is not set. - Updated workflow and tool usage instructions for generating multi-language speech audio and audio merging via Microsoft Edge TTS. - Added clear guidance on extracting parameters, tool selection, and structured result handling. - Included project directory overview and updated documentation in SKILL.md.

元数据

Slug speech-synthesis

版本 1.0.7

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题