← Back to Skills Marketplace
xiami2019

SOTA Zero-shot Voice Cloning TTS

by Qinyuan Cheng · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
421
Downloads
1
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install moss-tts
Description
Voice-first OpenClaw skill powered by MOSS APIs. Use when a user wants spoken replies in a preferred timbre, either from an existing voice_id or from a refer...
Usage Guidance
Before installing, confirm the provider and provenance: ask the publisher to (1) update registry metadata to list the required MOSI_API_KEY and MOSI_BASE_URL, (2) provide a homepage or source repo so you can verify the service and privacy policy, and (3) clarify whether 'MOSS' vs 'MOSI' is a typo or different provider. Treat the MOSI_API_KEY like any secret — only grant a key scoped to minimal permissions and avoid reusing high-privilege keys. Understand that using local audio will upload user files to the external service (privacy risk); if that is unacceptable, do not enable local-file uploads. If you proceed, consider restricting the skill's network access or using an API key with limited quota/expiration, and verify logs to ensure keys are never echoed.
Capability Analysis
Type: OpenClaw Skill Name: moss-tts Version: 1.0.0 The skill defines a workflow that involves fetching content from a user-provided `reference_audio` URL and uploading files specified by a `local audio path` to an external API (`https://studio.mosi.cn`). While these actions are necessary for the skill's stated purpose (TTS with custom voices), they introduce potential Server-Side Request Forgery (SSRF) and arbitrary file upload vulnerabilities if the AI agent's underlying execution environment does not properly sanitize or restrict these inputs. The `SKILL.md` file does not contain explicit malicious instructions, but the capabilities themselves are high-risk if exploited due to implementation flaws in the agent.
Capability Assessment
Purpose & Capability
The skill claims to be a 'MOSS' TTS integration but the SKILL.md uses 'MOSI' names (MOSI_API_KEY, MOSI_BASE_URL) and a default host https://studio.mosi.cn; registry metadata lists no required env vars or homepage/source. Asking for an API key and the ability to upload local audio is coherent with a TTS/cloning skill, but the mismatch between the declared registry metadata and the runtime requirements (missing required env var in metadata, no homepage/source) is an incoherence that needs explanation.
Instruction Scope
Instructions are specific: accept text and a voice source, upload local audio via POST /api/v1/files/upload, call voice clone and TTS endpoints, poll for status, decode base64 audio, and return a file path and metadata. These actions are consistent with voice-cloning TTS functionality but include transmitting local audio files to an external service — users should be aware of privacy implications. The SKILL.md does instruct to not log API keys, which is good.
Install Mechanism
No install spec and no code files (instruction-only) — lowest installation risk. Nothing is written to disk by an installer here; runtime network calls and file uploads are the main runtime surface.
Credentials
The SKILL.md requires MOSI_API_KEY (and optionally MOSI_BASE_URL), which is proportionate for an external TTS API. However, the registry metadata claims no required env vars — that discrepancy is a red flag. Also there is no documented owner homepage or source to verify what MOSI_API_KEY grants or what data is retained by the service.
Persistence & Privilege
The skill is not always-enabled and uses normal autonomous invocation defaults. It does not request any system-level persistence or modify other skill configurations (based on SKILL.md).
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install moss-tts
  3. After installation, invoke the skill by name or use /moss-tts
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release
Metadata
Slug moss-tts
Version 1.0.0
License
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is SOTA Zero-shot Voice Cloning TTS?

Voice-first OpenClaw skill powered by MOSS APIs. Use when a user wants spoken replies in a preferred timbre, either from an existing voice_id or from a refer... It is an AI Agent Skill for Claude Code / OpenClaw, with 421 downloads so far.

How do I install SOTA Zero-shot Voice Cloning TTS?

Run "/install moss-tts" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is SOTA Zero-shot Voice Cloning TTS free?

Yes, SOTA Zero-shot Voice Cloning TTS is completely free (open-source). You can download, install and use it at no cost.

Which platforms does SOTA Zero-shot Voice Cloning TTS support?

SOTA Zero-shot Voice Cloning TTS is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created SOTA Zero-shot Voice Cloning TTS?

It is built and maintained by Qinyuan Cheng (@xiami2019); the current version is v1.0.0.

💬 Comments