← Back to Skills Marketplace

voice-output

Name: voice-output
Author: olddirtybikertony

by OlddirtybikerTony · GitHub ↗ · v1.0.1 · MIT-0

cross-platform ⚠ suspicious

131

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install voice-output

Description

Use when Tony says voice reply or asks to speak. Speaks the response aloud via Doubao TTS to MOMAX BS6.

Usage Guidance

This skill looks like a legitimate TTS helper but has a few issues you should consider before installing: - Hard-coded credential: The script contains an ACCESS_TOKEN string and APPID embedded in source. That token could be abused by whoever has it, may consume someone else's quota, or may be invalid. Ask the author to remove embedded secrets and require the user to provide their own token (e.g., via an environment variable) before use. If you install it, treat the embedded token as untrusted and consider rotating your own service credentials if you test it. - Data sent to external service: All text passed to this skill is transmitted to Doubao/ByteDance's TTS endpoint. Do not send sensitive, private, or regulated data through this skill unless you control the service account and have reviewed privacy/compliance. - Path and OS assumptions: SKILL.md uses an absolute user path (/Users/tony/...) and the code uses macOS 'afplay'. If you are not on macOS or your skill workspace is elsewhere, update the invocation instructions and the player command. - Recommended actions before installing: (1) Request that the maintainer replace the hard-coded ACCESS_TOKEN with a requirement to set an env var (and update SKILL.md). (2) Verify who owns the embedded token (if you must use it temporarily) and whether it should be rotated. (3) Run the script in a sandbox or isolated account first and confirm it only sends the expected TTS requests and deletes temp files. (4) If you need cross-platform playback, update the code to detect the OS and choose an appropriate player. If the maintainer cannot or will not remove the embedded token and document how credentials are set, treat this package as risky and prefer a version that requires user-supplied credentials.

Capability Analysis

Type: OpenClaw Skill Name: voice-output Version: 1.0.1 The skill provides text-to-speech functionality by integrating with the Doubao (ByteDance) TTS API and using the macOS 'afplay' utility for audio playback. The implementation in 'voice_speak.py' is functionally sound, employing a PID-based lock file to prevent overlapping audio and safely handling subprocess calls without shell execution. While the bundle contains hardcoded API credentials (APPID and ACCESS_TOKEN) and a user-specific directory path in 'SKILL.md', these appear to be artifacts of a local development environment rather than intentional vulnerabilities or malicious features. The data flow is restricted to the stated purpose of converting text to audio via a legitimate third-party service.

Capability Assessment

⚠ Purpose & Capability

The skill's stated purpose (speak responses via Doubao TTS to MOMAX BS6) matches the included code and docs. However, the code embeds an ACCESS_TOKEN and APPID inside the script instead of declaring/asking for credentials via environment variables as the SKILL.md/metadata imply. Embedding an external service token in distributed code is disproportionate and unexpected.

⚠ Instruction Scope

SKILL.md instructs calling a hard-coded user-specific path (/Users/tony/.openclaw/...), and the script assumes macOS 'afplay'. The runtime runs network calls to https://openspeech.bytedance.com/api/v1/tts and writes temporary mp3 files and a /tmp lock file. Those operations are consistent with TTS but the absolute path and OS-specific player are brittle and reveal a path- and OS-specific assumption that is not declared in metadata.

✓ Install Mechanism

This is an instruction-only skill with a bundled Python script and no install specification. No additional packages are pulled or arbitrary download URLs used, which keeps install risk low.

⚠ Credentials

The package requests no environment variables, yet the script contains a hard-coded ACCESS_TOKEN (token string present) and APPID. This is inconsistent and risky: the token in the package could be leaked, abused, expired, or belong to someone else. Also, SKILL.md mentions Doubao TTS credentials but does not instruct the user to supply or rotate an ACCESS_TOKEN via env vars.

✓ Persistence & Privilege

The skill does not request permanent system presence (always=false), does not modify other skills' configs, and only creates temporary files and a lock file under /tmp. No elevated privileges or persistent system-wide changes are requested.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install voice-output
After installation, invoke the skill by name or use /voice-output
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.1

Fixed YAML frontmatter format, registered to openclaw-managed

v1.0.0

voice-output skill v1.0.0 - Adds voice reply capability using Doubao TTS, played aloud via afplay to MOMAX BS6. - Triggers on specific phrases (e.g., "语音回复", "voice reply") or explicit requests from Tony; automatically suppresses if Tony asks for text-only. - Outputs both a detailed written response and a shorter, conversational spoken version. - Spoken content uses natural, summarized speech rather than reading the written reply verbatim. - Integrates with Doubao TTS 2.0 API and uses a default, natural-sounding female voice.

Metadata

Slug voice-output

Version 1.0.1

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 2

Frequently Asked Questions

What is voice-output?

Use when Tony says voice reply or asks to speak. Speaks the response aloud via Doubao TTS to MOMAX BS6. It is an AI Agent Skill for Claude Code / OpenClaw, with 131 downloads so far.

How do I install voice-output?

Run "/install voice-output" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is voice-output free?

Yes, voice-output is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does voice-output support?

voice-output is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created voice-output?

It is built and maintained by OlddirtybikerTony (@olddirtybikertony); the current version is v1.0.1.

More Skills