← Back to Skills Marketplace

SOTA Zero-shot Voice Cloning TTS

Name: SOTA Zero-shot Voice Cloning TTS
Author: xiami2019

by Qinyuan Cheng · GitHub ↗ · v1.0.0

cross-platform ⚠ suspicious

421

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install moss-tts

Description

Voice-first OpenClaw skill powered by MOSS APIs. Use when a user wants spoken replies in a preferred timbre, either from an existing voice_id or from a refer...

Usage Guidance

Before installing, confirm the provider and provenance: ask the publisher to (1) update registry metadata to list the required MOSI_API_KEY and MOSI_BASE_URL, (2) provide a homepage or source repo so you can verify the service and privacy policy, and (3) clarify whether 'MOSS' vs 'MOSI' is a typo or different provider. Treat the MOSI_API_KEY like any secret — only grant a key scoped to minimal permissions and avoid reusing high-privilege keys. Understand that using local audio will upload user files to the external service (privacy risk); if that is unacceptable, do not enable local-file uploads. If you proceed, consider restricting the skill's network access or using an API key with limited quota/expiration, and verify logs to ensure keys are never echoed.

Capability Analysis

Type: OpenClaw Skill Name: moss-tts Version: 1.0.0 The skill defines a workflow that involves fetching content from a user-provided `reference_audio` URL and uploading files specified by a `local audio path` to an external API (`https://studio.mosi.cn`). While these actions are necessary for the skill's stated purpose (TTS with custom voices), they introduce potential Server-Side Request Forgery (SSRF) and arbitrary file upload vulnerabilities if the AI agent's underlying execution environment does not properly sanitize or restrict these inputs. The `SKILL.md` file does not contain explicit malicious instructions, but the capabilities themselves are high-risk if exploited due to implementation flaws in the agent.

Capability Assessment

⚠ Purpose & Capability

The skill claims to be a 'MOSS' TTS integration but the SKILL.md uses 'MOSI' names (MOSI_API_KEY, MOSI_BASE_URL) and a default host https://studio.mosi.cn; registry metadata lists no required env vars or homepage/source. Asking for an API key and the ability to upload local audio is coherent with a TTS/cloning skill, but the mismatch between the declared registry metadata and the runtime requirements (missing required env var in metadata, no homepage/source) is an incoherence that needs explanation.

ℹ Instruction Scope

Instructions are specific: accept text and a voice source, upload local audio via POST /api/v1/files/upload, call voice clone and TTS endpoints, poll for status, decode base64 audio, and return a file path and metadata. These actions are consistent with voice-cloning TTS functionality but include transmitting local audio files to an external service — users should be aware of privacy implications. The SKILL.md does instruct to not log API keys, which is good.

✓ Install Mechanism

No install spec and no code files (instruction-only) — lowest installation risk. Nothing is written to disk by an installer here; runtime network calls and file uploads are the main runtime surface.

⚠ Credentials

The SKILL.md requires MOSI_API_KEY (and optionally MOSI_BASE_URL), which is proportionate for an external TTS API. However, the registry metadata claims no required env vars — that discrepancy is a red flag. Also there is no documented owner homepage or source to verify what MOSI_API_KEY grants or what data is retained by the service.

✓ Persistence & Privilege

The skill is not always-enabled and uses normal autonomous invocation defaults. It does not request any system-level persistence or modify other skill configurations (based on SKILL.md).

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install moss-tts
After installation, invoke the skill by name or use /moss-tts
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release

Metadata

Slug moss-tts

Version 1.0.0

License —

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is SOTA Zero-shot Voice Cloning TTS?

Voice-first OpenClaw skill powered by MOSS APIs. Use when a user wants spoken replies in a preferred timbre, either from an existing voice_id or from a refer... It is an AI Agent Skill for Claude Code / OpenClaw, with 421 downloads so far.

How do I install SOTA Zero-shot Voice Cloning TTS?

Run "/install moss-tts" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is SOTA Zero-shot Voice Cloning TTS free?

Yes, SOTA Zero-shot Voice Cloning TTS is completely free (open-source). You can download, install and use it at no cost.

Which platforms does SOTA Zero-shot Voice Cloning TTS support?

SOTA Zero-shot Voice Cloning TTS is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created SOTA Zero-shot Voice Cloning TTS?

It is built and maintained by Qinyuan Cheng (@xiami2019); the current version is v1.0.0.

More Skills