← Back to Skills Marketplace
guoqiao

MLX TTS

by guoqiao · GitHub ↗ · v0.0.3
darwin ⚠ suspicious
1251
Downloads
0
Stars
3
Active Installs
3
Versions
Install in OpenClaw
/install mlx-tts
Description
Text-To-Speech with MLX (Apple Silicon) and opensource models (default QWen3-TTS) locally.
README (SKILL.md)

MLX TTS

Text-To-Speech with MLX (Apple Silicon) and open-source models (default QWen3-TTS) locally.

Free and Fast. No API key required. No server required.

Requirements

  • mlx: macOS with Apple Silicon
  • brew: used to install deps if not available

Installation

bash ${baseDir}/install.sh

This script will use brew to install these CLI tools if not available:

  • uv: install python package and run python script
  • mlx_audio: do the real job

Usage

To generate audio from text, run this script:

bash ${baseDir}/mlx-tts.sh "\x3Ctext>"

Agent Instructions

  1. Run the script: Pass the text to be spoken as an argument.
  2. Handle Output: The script will output a path to a audio file. Use the message tool to send the audio file to the user as an voice message:
{
   "action": "send",
   "filePath": "\x3Cfilepath>"
}

Example: User: "Say hello world" Agent:

  1. Runs bash path/to/mlx-tts.sh "hello world"
  2. Receives output: /tmp/folder/audio.ogg
  3. Calls message(action="send", filePath="/tmp/folder/audio.ogg", ...)
Usage Guidance
This skill likely does what it says (generates local TTS) but it will download and install a third‑party tool ('mlx-audio') as a pre-release package and downloads models at runtime. Key things to consider before installing: 1) always:true is set unnecessarily — disable or remove that unless you understand why it must be always-present; 2) inspect the sources for the 'uv' tool and the 'mlx-audio' package (where does 'uv' fetch from?), and prefer signed/official releases rather than forced pre-release installs; 3) run the install in a sandbox/VM or an isolated macOS user account and review what gets written to disk; 4) be prepared for large network and disk usage when models are downloaded; 5) if you cannot verify the upstream repositories or the package provenance, avoid enabling the skill on machines with sensitive data. If you want to proceed safely, run install.sh manually, audit installed files, and avoid granting broader privileges.
Capability Analysis
Type: OpenClaw Skill Name: mlx-tts Version: 0.0.3 The skill bundle is designed for local Text-To-Speech functionality using MLX on Apple Silicon. The `install.sh` script uses `brew` and `uv` to install necessary dependencies (`ffmpeg`, `uv`, `mlx-audio`), which is standard for setting up a local environment. The `mlx-tts.sh` script processes user text, generates audio using `mlx_audio.tts.generate`, converts it with `ffmpeg`, and outputs the file path. The `SKILL.md` provides clear instructions for the agent to execute these scripts and handle the audio output. There is no evidence of data exfiltration, malicious execution (beyond installing legitimate tools for the stated purpose), persistence mechanisms, prompt injection attempts against the agent, or obfuscation.
Capability Assessment
Purpose & Capability
The name/description (local TTS on Apple Silicon) aligns with the included scripts which call a local 'mlx_audio' tool and ffmpeg. Requiring brew is reasonable. However, the skill installs an external tool ('mlx-audio') and pulls models by name at runtime — this is consistent with the purpose but worth noting because it requires downloading third-party code and potentially large model weights.
Instruction Scope
Runtime instructions are narrowly scoped: run install.sh and mlx-tts.sh which produce a local audio file and return its path. The scripts do not read unrelated system files or request credentials. They do, however, execute commands that fetch and run external code and download models (the model string is passed to the tool), which is within the TTS goal but expands the attack surface.
Install Mechanism
No formal install spec in registry; install.sh uses brew and then runs 'uv tool install --force "mlx-audio" --prerelease=allow'. That command force-installs a pre-release third-party tool from the network without provenance. Installing remote/unverified code and pre-release packages increases risk — the install will write code to disk and enable execution of code not present in the registry.
Credentials
The skill does not request environment variables, credentials, or config paths. It only requires brew (and will install uv and ffmpeg). There are no obvious requests for unrelated secrets or system configuration.
Persistence & Privilege
The skill is marked always:true, meaning it will be force-included in every agent run. There is no clear justification for always:true for a user-invoked TTS utility. Combined with the installer behavior (downloading and installing a third-party tool), this increases the blast radius if the tool or package supply chain is malicious or compromised.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install mlx-tts
  3. After installation, invoke the skill by name or use /mlx-tts
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.0.3
Version 0.0.3 - Added more usage triggers (e.g. "Say <text>", "Reply with voice message ...") for broader activation. - Expanded documentation with agent instructions for handling and sending the generated audio files as voice messages. - Updated wording and formatting in documentation for clarity and consistency. - Removed the version field from SKILL.md metadata for simplified versioning.
v0.0.2
- Updated SKILL.md to reflect new usage instructions and correct script name. - Added note about initial model download possibly causing slower first run. - Updated metadata homepage link. - Bumped version to 0.0.2.
v0.0.1
- Initial release of mlx-tts: local text-to-speech using MLX on Apple Silicon. - Utilizes open-source models (default: QWen3-TTS). - No API key or external server required. - Supports automatic installation of required CLI tools via Homebrew. - Provides simple CLI triggers for converting text to audio.
Metadata
Slug mlx-tts
Version 0.0.3
License
All-time Installs 3
Active Installs 3
Total Versions 3
Frequently Asked Questions

What is MLX TTS?

Text-To-Speech with MLX (Apple Silicon) and opensource models (default QWen3-TTS) locally. It is an AI Agent Skill for Claude Code / OpenClaw, with 1251 downloads so far.

How do I install MLX TTS?

Run "/install mlx-tts" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is MLX TTS free?

Yes, MLX TTS is completely free (open-source). You can download, install and use it at no cost.

Which platforms does MLX TTS support?

MLX TTS is cross-platform and runs anywhere OpenClaw / Claude Code is available (darwin).

Who created MLX TTS?

It is built and maintained by guoqiao (@guoqiao); the current version is v0.0.3.

💬 Comments