← Back to Skills Marketplace

Jarvis Vocal

Name: Jarvis Vocal
Author: kishen35

by Kishen35 · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ Security Clean

142

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install jarvis-vocal

Description

Authentic J.A.R.V.I.S. voice synthesis using Piper TTS with HuggingFace-trained model. Generates movie-accurate voice locally and can push to connected Andro...

README (SKILL.md)

jarvis-vocal

Uses the authentic J.A.R.V.I.S. voice model from HuggingFace (trained on actual movie lines) via Piper TTS. No audio effects needed — the voice is naturally cinematic and British.

Credit: Voice model by jgkawell — see the discussion for details on training and samples.

Usage

Generate a WAV file:

{baseDir}/bin/jarvis-tts "Text to speak" ./output.wav

Stream directly to an Android device (if ADB connected):

{baseDir}/bin/jarvis-tts "Text to speak" - | adb push - /sdcard/Download/temp.wav

Installation

Prerequisites

pipx install piper-tts
sudo apt install ffmpeg  # or equivalent

Install Voice Model

# Create voice directory
mkdir -p ~/.local/share/piper/voices/en_GB

# Download models via HuggingFace CLI
cd ~/.local/share/piper/voices/en_GB
hf download jgkawell/jarvis en/en_GB/jarvis/high/jarvis-high.onnx --local-dir .
hf download jgkawell/jarvis en/en_GB/jarvis/high/jarvis-high.onnx.json --local-dir .
# Optional: medium quality model
hf download jgkawell/jarvis en/en_GB/jarvis/medium/jarvis-medium.onnx --local-dir .
hf download jgkawell/jarvis en/en_GB/jarvis/medium/jarvis-medium.onnx.json --local-dir .

Integration

Works with OpenClaw Android nodes via ADB over Tailscale. Use jarvis-speak wrapper for one-command push+play:

jarvis-speak "Systems at your service, Sir."

Or use streaming mode (faster, ephemeral):

jarvis-speak "Message" --stream

Configuration

Setting	Default	Description
Model	`jarvis-high`	Voice quality: `high` (114MB) or `medium` (63MB)
Speed	1.0 (native)	Piper length-scale — adjust for faster/slower speech
Volume	1.0	Post-processing volume boost

Edit jarvis-speak script to change defaults.

Troubleshooting

"Model not found" → Download models to ~/.local/share/piper/voices/en_GB/jarvis-*

ADB connection refused → Ensure phone's ADB over WiFi is enabled and paired with laptop (port 5555)

Audio doesn't play → Check Android receives the file at /sdcard/Download/jarvis-current.wav and has a WAV-capable media player

License

MIT — The voice model is MIT licensed by jgkawell.

Credits

Voice model: jgkawell/jarvis on HuggingFace — trained on Marvel movie lines
TTS engine: Piper by Rhasspy
Integration: OpenClaw by Aidan Park

Usage Guidance

This skill appears coherent, but before installing or running anything: 1) Inspect and verify the upstream packages it asks you to install (piper-tts, HuggingFace CLI, ffmpeg) and install them manually via trusted channels. 2) Confirm the HuggingFace model license and provenance — README claims movie-line training (legal/ethical risk, even if model metadata says MIT). 3) Understand ADB/Tailscale implications — pairing a phone gives the host permission to push files and trigger playback; only pair with devices you trust. 4) Review any wrapper scripts (jarvis-speak/jarvis-tts) before running to ensure they do only generation/push/cleanup and contain no unexpected commands. 5) Because installation is manual, nothing in the skill will automatically exfiltrate credentials, but installed third-party tools will run on your machine—treat them as you would any pip-installed software.

Capability Analysis

Type: OpenClaw Skill Name: jarvis-vocal Version: 1.0.0 The jarvis-vocal skill provides J.A.R.V.I.S. voice synthesis using the Piper TTS engine and a specific HuggingFace-hosted model (jgkawell/jarvis). The instructions in SKILL.md and README.md describe legitimate use cases for generating audio and pushing it to Android devices via ADB. No evidence of data exfiltration, malicious execution, or prompt injection was found; the use of ADB is transparently documented as a requirement for the skill's core functionality.

Capability Assessment

✓ Purpose & Capability

Name/description (J.A.R.V.I.S. voice via Piper TTS and HuggingFace model) match the instructions and package metadata: the SKILL.md and README show how to install Piper, download the model, generate WAVs, and push them to Android devices via ADB/Tailscale. No unrelated binaries, env vars, or config paths are required by the skill itself.

ℹ Instruction Scope

Instructions are narrowly scoped to: installing piper-tts/ffmpeg, using the HuggingFace CLI to download model files into ~/.local/share/piper/voices, generating audio, and optionally streaming/pushing via adb. They do not instruct reading unrelated system files or exporting arbitrary data. Note: using ADB/Tailscale gives device-level access to paired Android devices (expected for the advertised capability); README contains a sample IP which is just illustrative but could be misleading if copied without understanding.

ℹ Install Mechanism

This is an instruction-only skill (no automated install). It tells the user to pipx install piper-tts and to use the HuggingFace CLI and ffmpeg. Those are reasonable for TTS but do require installing third-party software that will run locally—verify the upstream packages (piper-tts, hf CLI) before installing. Because install is manual, the skill itself does not download or execute code automatically.

✓ Credentials

The skill declares no required environment variables or credentials. Operationally, the workflow may prompt for HuggingFace auth if model access requires it and requires an ADB-paired Android device (device pairing grants access to push/play files). There are no unrelated secret requests in the manifest or instructions.

✓ Persistence & Privilege

The skill is not force-included (always: false) and has no install step that persists code automatically. It does not request elevated agent privileges or modify other skills. Note that the platform default allows autonomous invocation; that is expected and not by itself a concern here.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install jarvis-vocal
After installation, invoke the skill by name or use /jarvis-vocal
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release: Authentic J.A.R.V.I.S. voice using Piper TTS with HuggingFace model. Wireless delivery to Android via ADB over Tailscale.

Metadata

Slug jarvis-vocal

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Jarvis Vocal?

Authentic J.A.R.V.I.S. voice synthesis using Piper TTS with HuggingFace-trained model. Generates movie-accurate voice locally and can push to connected Andro... It is an AI Agent Skill for Claude Code / OpenClaw, with 142 downloads so far.

How do I install Jarvis Vocal?

Run "/install jarvis-vocal" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Jarvis Vocal free?

Yes, Jarvis Vocal is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Jarvis Vocal support?

Jarvis Vocal is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Jarvis Vocal?

It is built and maintained by Kishen35 (@kishen35); the current version is v1.0.0.

More Skills