← 返回 Skills 市场

Auto-Talk-TTS

Name: Auto-Talk-TTS
Author: stefanochiodino

作者 StefanoChiodino · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

117

总下载

当前安装

版本数

在 OpenClaw 中安装

/install auto-talk-tts

功能描述

Auto-speak every message using edge-tts. Automatically converts all responses to speech asynchronously in the background. Install the package if needed, then...

使用说明 (SKILL.md)

Auto-Talk-TTS Skill

Overview

Automatically speaks every message you generate using Microsoft Edge's neural TTS service. Runs asynchronously in the background so your conversation continues immediately while audio generates.

Quick Start

Every message you send is automatically spoken aloud.

The skill wraps your output with auto-speak which:

Installs node-edge-tts if needed
Converts your message to speech asynchronously
Plays the audio in the background
Continues your conversation immediately

Usage

Automatic: Every message is auto-spoken. No action needed from you.

Manual override: If you need to speak something specific:

auto-speak "Your text here"

How It Works

Detect output: When you generate a message
Wrap with auto-speak: The message gets passed through the auto-speak wrapper
Install if needed: First run installs node-edge-tts package
Generate audio: Convert text to MP3 in background
Play audio: Use afplay to play the audio file
Continue: Your conversation flows without waiting for audio

Configuration

Edit config.json to customize:

{
  "voice": "en-US-MichelleNeural",
  "rate": "default",
  "pitch": "default",
  "volume": "default",
  "summaryLength": 100
}

Options:

voice: Any Edge neural voice (see edge-tts skill for full list)
rate: Speech speed (-20% to +50%, or default)
pitch: Voice pitch (-20Hz to +20Hz, or default)
volume: Audio volume (-100% to +100%, or default)
summaryLength: How many characters to summarize (default: 100)

Installation

First run will automatically install node-edge-tts:

npm install -g node-edge-tts

Or use the bundled installer:

cd /Users/stefano/.openclaw/workspace/skills/auto-talk-tts
npm install

Files

auto-speak

Wrapper script that:

Checks if node-edge-tts is installed
Installs it if needed
Converts text to speech asynchronously
Plays audio with afplay
Continues immediately

config.json

User configuration for voice, rate, pitch, volume settings.

package.json

NPM package with node-edge-tts dependency.

SKILL.md

This file.

Integration with SOUL.md

Add this line to the top of your SOUL.md:

## MUST SPEAK WITH TTS EVERY TIME (ASYNCHRONOUSLY)

**I must use the edge-tts skill to convert my response to speech before sending it to you.** Every single message I send must be spoken aloud using the tts tool. This is non-negotiable - I speak to you, I don't just type.

**Async approach:** I run the TTS command in the background using `&` or `nohup` so the conversation continues immediately while audio generates in the background.

Workflow

Automatic (Recommended)

You generate a message
auto-speak wraps it: The message gets passed through the wrapper
Background TTS: Audio generates in background process
Immediate reply: Your conversation continues without waiting
Audio plays: User hears your response

Manual (If needed)

# Auto-speak any text
auto-speak "Your message here"

# Custom voice
auto-speak "Your message" --voice en-GB-RyanNeural

# Custom speed
auto-speak "Your message" --rate +20%

Testing

# Test the auto-speak wrapper
auto-speak "This is a test. If you can hear this, it's working!"

# Check if node-edge-tts is installed
npm list -g node-edge-tts

# Install manually if needed
npm install -g node-edge-tts

Troubleshooting

No audio? Check that node-edge-tts is installed: npm list -g node-edge-tts
Install needed? First run will auto-install, or run: npm install -g node-edge-tts
Audio not playing? Check system volume and audio output settings
Slow? Use async mode - audio generates in background, conversation continues immediately

Notes

Non-blocking: Conversation flows while audio generates
No API key: Uses Microsoft Edge's free TTS service
Neural voices: High-quality, natural-sounding speech
Multiple voices: See edge-tts skill for full voice list
Configurable: Customize voice, speed, pitch, volume in config.json
Automatic installation: First run installs node-edge-tts if missing

Do not install or run this skill yet. Key issues: (1) The package refers to an 'auto-speak' executable but that file is not present—ask the author for the missing script or full source before trusting the package. (2) The SKILL.md recommends running 'npm install -g' (global install) which changes your system; avoid global installs unless you trust the package source. (3) It instructs you to edit SOUL.md to force TTS for every message—this changes agent-wide behavior and should only be done deliberately. (4) It hardcodes use of 'afplay' (macOS-only) and even references a specific user path (/Users/stefano/...), suggesting the package may be a local/unfinished copy. (5) node-edge-tts will be fetched from npm; verify that package and its dependencies are legitimate and inspect the actual auto-speak implementation for unexpected network calls, data exfiltration, or credential use. Suggested next steps: request the missing auto-speak script or a complete release (with source), review that script line-by-line, run installs in an isolated environment or container, avoid editing SOUL.md unless you understand and accept the global behavioral change, and prefer installing only from trusted package authors or vetted release artifacts.

功能分析

Type: OpenClaw Skill Name: auto-talk-tts Version: 1.0.0 The skill provides text-to-speech functionality but employs high-risk instructions in SKILL.md, directing the AI agent to modify its core personality file (SOUL.md) with 'non-negotiable' commands. It encourages the use of background shell execution (nohup/&) and global package installations (npm install -g), which are risky behaviors in an agentic environment. While the stated intent is functional, the use of prompt injection to ensure behavioral persistence and the reliance on unvalidated shell execution warrant a suspicious classification.

能力评估

⚠ Purpose & Capability

The skill claims to auto-speak every message using node-edge-tts, which explains the npm dependency and TTS behavior. However package.json advertises a bin named "auto-speak" and main "auto-speak" but no corresponding auto-speak script file is included in the package manifest—this is an inconsistency. The SKILL.md also references a specific workspace path (/Users/stefano/.openclaw/...) which is user-specific and unrelated to a generic TTS skill.

⚠ Instruction Scope

Instructions go beyond simply converting text to speech: they tell you to add a top-level line to SOUL.md to require TTS for every response (modifying an agent-level policy/config), recommend global npm installs, and hard-code use of 'afplay' to play audio. The SOUL.md edit would force global behavior across the agent and is scope-creep; afplay is macOS-only so the skill is not cross-platform as implied. These broadened instructions warrant caution.

⚠ Install Mechanism

There is no formal install spec — the runtime instructions tell the user to run 'npm install -g node-edge-tts' or 'npm install' in a local workspace. Recommending global npm installs touches system state and brings supply-chain risk. The package.json suggests the skill provides a local 'auto-speak' executable, but that executable is missing from the files provided, making the package incomplete or inconsistent.

✓ Credentials

The skill does not request environment variables, credentials, or config paths. That matches SKILL.md claims of 'No API key'. However the skill will perform network operations via npm and the node-edge-tts package (not shown), which is a normal requirement for installing dependencies but still carries usual network/supply-chain risk.

⚠ Persistence & Privilege

The skill itself does not request 'always: true' or elevated platform privileges, but it explicitly instructs users to edit SOUL.md to force the agent to always use TTS for every message. That instruction would change agent-wide behavior and effectively persist this skill's effect across interactions—this is a meaningful privilege escalation via configuration coercion and should be treated cautiously.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install auto-talk-tts
安装完成后，直接呼叫该 Skill 的名称或使用 /auto-talk-tts 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

Initial release: Auto-speak every message using edge-tts. Installs node-edge-tts if needed and speaks asynchronously in background.

元数据

Slug auto-talk-tts

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题

Auto-Talk-TTS 是什么？

Auto-speak every message using edge-tts. Automatically converts all responses to speech asynchronously in the background. Install the package if needed, then... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 117 次。

如何安装 Auto-Talk-TTS？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install auto-talk-tts」即可一键安装，无需额外配置。

Auto-Talk-TTS 是免费的吗？

是的，Auto-Talk-TTS 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Auto-Talk-TTS 支持哪些平台？

Auto-Talk-TTS 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Auto-Talk-TTS？

由 StefanoChiodino（@stefanochiodino）开发并维护，当前版本 v1.0.0。