← 返回 Skills 市场
scikkk

Clone Wizard

作者 scikkk · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
224
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install clone-wizard
功能描述
Guided voice cloning workflow — from recording tips to first playback. Use when users want to clone their voice, create a custom voice, or ask "怎么克隆声音", "我想用...
使用说明 (SKILL.md)

SenseAudio Clone Wizard

A guided experience for voice cloning. The API itself is simple — the hard part is getting a clean recording. This skill handles that.

Note: SenseAudio does not support cloning via API. The actual cloning happens on the platform at https://senseaudio.cn. This skill guides the user through the full process and handles the steps that can be automated.


Phase 1: Before Recording

When a user says they want to clone their voice, show this guide before they record anything:


录音前请注意:

  • 环境:找一个安静的房间,关掉空调、风扇、电视
  • 距离:嘴巴距离麦克风约 20cm,不要太近(会爆音)也不要太远(声音发虚)
  • 时长:3–30 秒,朗读一段自然的文字效果最好
  • 格式:MP3、WAV 或 AAC,文件不超过 50MB
  • 内容:朗读一段话,语速自然,避免长时间停顿

推荐朗读内容(约 15 秒):

"大家好,我是 [你的名字]。今天天气不错,阳光明媚。我喜欢在这样的日子里出去走走,感受大自然的美好。希望每一天都能这样轻松愉快。"

录好后把文件发给我,我来帮你检测质量。


Phase 2: Audio Quality Check

When the user uploads an audio file, run the quality check:

RESULT=$(curl -s -X POST https://api.senseaudio.cn/v1/audio/analysis \
  -H "Authorization: Bearer $SENSEAUDIO_API_KEY" \
  -F "model=sense-asr-check" \
  -F "file=@\x3CAUDIO_FILE>")

Parse the response and translate it into plain language:

Noise score interpretation:

noise_score What to tell the user
0.0 – 0.2 录音质量很好,可以直接提交克隆
0.2 – 0.4 录音质量不错,有轻微噪声,克隆效果可能略有影响,建议在条件允许时重录
0.4 – 0.6 录音噪声较明显,建议重录以获得更好的克隆效果
0.6 – 1.0 录音噪声严重,强烈建议重录,否则克隆出的音色可能失真

Noise type translations:

noise_types value 告诉用户
background_noise 背景噪声(周围环境声)
hum 嗡嗡声(可能是空调、电器)
static 静电杂音(可能是线材或设备问题)
wind 风噪(话筒离嘴太近或有风)
echo / reverb 回声/混响(房间太空旷,建议换小房间或靠近软装)

Example diagnosis output:

你的录音检测结果:
- 时长:14.3 秒 ✓
- 噪声评分:0.52(中等)
- 检测到:嗡嗡声(可能是空调)

建议:关掉空调后重新录一段。如果条件不允许,可以继续提交,
但克隆出的音色可能带有轻微底噪。要继续还是重录?

If has_noise is false or score \x3C 0.2, skip the warning and proceed directly to Phase 3.

Phase 3: Platform Cloning

The cloning itself must be done on the SenseAudio platform — the API does not support direct cloning requests.

Guide the user:


录音质量通过!接下来去平台完成克隆:

  1. 打开 https://senseaudio.cn,登录你的账号
  2. 进入「音色克隆」页面
  3. 点击「上传音频」,选择你刚才的录音文件
  4. 系统会自动训练,通常几秒内完成
  5. 训练完成后,在「我的音色」中找到新音色,复制它的 voice_id
  6. 把 voice_id 发给我,我来帮你生成第一段试听

注意槽位: 每个账号有有限的克隆槽位。如果槽位已满,需要先删除旧音色才能克隆新的。


Phase 4: First Playback

Once the user provides their voice_id, synthesize a welcome message so they can hear their AI voice for the first time:

Default preview text (warm and personal):

"你好!这是我的 AI 声音。从今天起,我可以用这个声音说任何我想说的话了。"

curl -s -X POST https://api.senseaudio.cn/v1/t2a_v2 \
  -H "Authorization: Bearer $SENSEAUDIO_API_KEY" \
  -H "Content-Type: application/json" \
  -d "{
    \"model\": \"SenseAudio-TTS-1.0\",
    \"text\": \"你好!这是我的 AI 声音。从今天起,我可以用这个声音说任何我想说的话了。\",
    \"stream\": false,
    \"voice_setting\": { \"voice_id\": \"\x3CVOICE_ID>\" },
    \"audio_setting\": { \"format\": \"mp3\" }
  }" -o preview.json

jq -r '.data.audio' preview.json | xxd -r -p > my_voice_preview.mp3

After generating: tell the user the file is ready, and mention they can now use this voice_id with any SenseAudio TTS feature — including the senseaudio-tts-quick skill for quick synthesis or senseaudio-polyglot-tts for precise pronunciation control.

安全使用建议
This skill is coherent with its stated purpose, but consider the following before installing or using it: - Privacy: voice recordings are biometric data. Only upload recordings you control and are comfortable sharing with SenseAudio. Review senseaudio.cn's privacy policy and terms for voice cloning and retention. - API key safety: the skill needs your SENSEAUDIO_API_KEY. Keep that key secret and do not paste it into unfamiliar places. If the agent stores files or logs, the key should not be written to logs or public locations. - Files written to disk: the skill's examples write preview.json and my_voice_preview.mp3; verify where the agent stores these files and who can access them on your device or workspace. - Authenticity: the endpoints used are api.senseaudio.cn and senseaudio.cn which match the Homepage in metadata. If you obtained a different SDK or plugin, confirm it points to the official domain. - Consent & legal: cloning voices may have legal/ethical implications (consent from the recorded person). Ensure you have the right to clone the voice. If you want stronger assurance, ask the skill author for: a privacy/security statement from senseaudio.cn, or concrete details on how the agent handles temporary files and logs. Otherwise this skill appears internally consistent and proportionate to its stated function.
功能分析
Type: OpenClaw Skill Name: clone-wizard Version: 1.0.0 The skill provides a legitimate guided workflow for voice cloning via the SenseAudio platform. It uses standard tools (curl, jq, xxd) to perform audio quality analysis and generate voice previews through the official API (api.senseaudio.cn), with no evidence of malicious intent or data exfiltration.
能力评估
Purpose & Capability
Name/description, required env var (SENSEAUDIO_API_KEY), required binaries (curl, jq, xxd), and referenced endpoints (senseaudio.cn, api.senseaudio.cn) are consistent with a guided workflow that calls SenseAudio APIs for analysis and TTS preview. No unrelated services or credentials are requested.
Instruction Scope
SKILL.md instructs the agent to POST user audio for analysis, to call the TTS endpoint with a voice_id, and to write/read local files (preview.json, my_voice_preview.mp3). These are within the stated purpose, but they involve handling users' audio files (sensitive biometric data) and writing output to disk — the user should be aware of where files are stored and how they are shared.
Install Mechanism
No install spec and no code files (instruction-only) — lowest-risk install model. The skill relies on standard CLI tools (curl, jq, xxd) already declared and used in examples; nothing is downloaded from arbitrary URLs.
Credentials
Only a single credential (SENSEAUDIO_API_KEY) is required and is used by the example curl commands. No unrelated secrets or system config paths are requested. This is proportionate to calling the provider's API.
Persistence & Privilege
Skill is user-invocable and not always-enabled; it does not request permanent presence or modify other skills/system settings. Autonomous invocation is allowed by default, which is normal and not by itself a concern here.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install clone-wizard
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /clone-wizard 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
senseaudio-clone-wizard 1.0.0 - Introduces a step-by-step guided workflow for voice cloning via SenseAudio, supporting both English and Chinese requests. - Helps users prepare optimal recordings with practical tips and sample texts. - Automates audio quality checks and provides user-friendly feedback and actionable suggestions. - Guides users through the required steps on the SenseAudio platform for actual cloning. - Assists with first-time playback by using the new cloned voice for an instant preview and recommends next steps to use their custom voice.
元数据
Slug clone-wizard
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Clone Wizard 是什么?

Guided voice cloning workflow — from recording tips to first playback. Use when users want to clone their voice, create a custom voice, or ask "怎么克隆声音", "我想用... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 224 次。

如何安装 Clone Wizard?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install clone-wizard」即可一键安装,无需额外配置。

Clone Wizard 是免费的吗?

是的,Clone Wizard 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Clone Wizard 支持哪些平台?

Clone Wizard 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Clone Wizard?

由 scikkk(@scikkk)开发并维护,当前版本 v1.0.0。

💬 留言讨论