← Back to Skills Marketplace
kishen35

Jarvis Vocal

by Kishen35 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
142
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install jarvis-vocal
Description
Authentic J.A.R.V.I.S. voice synthesis using Piper TTS with HuggingFace-trained model. Generates movie-accurate voice locally and can push to connected Andro...
README (SKILL.md)

jarvis-vocal

Uses the authentic J.A.R.V.I.S. voice model from HuggingFace (trained on actual movie lines) via Piper TTS. No audio effects needed — the voice is naturally cinematic and British.

Credit: Voice model by jgkawell — see the discussion for details on training and samples.

Usage

Generate a WAV file:

{baseDir}/bin/jarvis-tts "Text to speak" ./output.wav

Stream directly to an Android device (if ADB connected):

{baseDir}/bin/jarvis-tts "Text to speak" - | adb push - /sdcard/Download/temp.wav

Installation

Prerequisites

pipx install piper-tts
sudo apt install ffmpeg  # or equivalent

Install Voice Model

# Create voice directory
mkdir -p ~/.local/share/piper/voices/en_GB

# Download models via HuggingFace CLI
cd ~/.local/share/piper/voices/en_GB
hf download jgkawell/jarvis en/en_GB/jarvis/high/jarvis-high.onnx --local-dir .
hf download jgkawell/jarvis en/en_GB/jarvis/high/jarvis-high.onnx.json --local-dir .
# Optional: medium quality model
hf download jgkawell/jarvis en/en_GB/jarvis/medium/jarvis-medium.onnx --local-dir .
hf download jgkawell/jarvis en/en_GB/jarvis/medium/jarvis-medium.onnx.json --local-dir .

Integration

Works with OpenClaw Android nodes via ADB over Tailscale. Use jarvis-speak wrapper for one-command push+play:

jarvis-speak "Systems at your service, Sir."

Or use streaming mode (faster, ephemeral):

jarvis-speak "Message" --stream

Configuration

Setting Default Description
Model jarvis-high Voice quality: high (114MB) or medium (63MB)
Speed 1.0 (native) Piper length-scale — adjust for faster/slower speech
Volume 1.0 Post-processing volume boost

Edit jarvis-speak script to change defaults.

Troubleshooting

"Model not found" → Download models to ~/.local/share/piper/voices/en_GB/jarvis-*

ADB connection refused → Ensure phone's ADB over WiFi is enabled and paired with laptop (port 5555)

Audio doesn't play → Check Android receives the file at /sdcard/Download/jarvis-current.wav and has a WAV-capable media player

License

MIT — The voice model is MIT licensed by jgkawell.

Credits

  • Voice model: jgkawell/jarvis on HuggingFace — trained on Marvel movie lines
  • TTS engine: Piper by Rhasspy
  • Integration: OpenClaw by Aidan Park
Usage Guidance
This skill appears coherent, but before installing or running anything: 1) Inspect and verify the upstream packages it asks you to install (piper-tts, HuggingFace CLI, ffmpeg) and install them manually via trusted channels. 2) Confirm the HuggingFace model license and provenance — README claims movie-line training (legal/ethical risk, even if model metadata says MIT). 3) Understand ADB/Tailscale implications — pairing a phone gives the host permission to push files and trigger playback; only pair with devices you trust. 4) Review any wrapper scripts (jarvis-speak/jarvis-tts) before running to ensure they do only generation/push/cleanup and contain no unexpected commands. 5) Because installation is manual, nothing in the skill will automatically exfiltrate credentials, but installed third-party tools will run on your machine—treat them as you would any pip-installed software.
Capability Analysis
Type: OpenClaw Skill Name: jarvis-vocal Version: 1.0.0 The jarvis-vocal skill provides J.A.R.V.I.S. voice synthesis using the Piper TTS engine and a specific HuggingFace-hosted model (jgkawell/jarvis). The instructions in SKILL.md and README.md describe legitimate use cases for generating audio and pushing it to Android devices via ADB. No evidence of data exfiltration, malicious execution, or prompt injection was found; the use of ADB is transparently documented as a requirement for the skill's core functionality.
Capability Assessment
Purpose & Capability
Name/description (J.A.R.V.I.S. voice via Piper TTS and HuggingFace model) match the instructions and package metadata: the SKILL.md and README show how to install Piper, download the model, generate WAVs, and push them to Android devices via ADB/Tailscale. No unrelated binaries, env vars, or config paths are required by the skill itself.
Instruction Scope
Instructions are narrowly scoped to: installing piper-tts/ffmpeg, using the HuggingFace CLI to download model files into ~/.local/share/piper/voices, generating audio, and optionally streaming/pushing via adb. They do not instruct reading unrelated system files or exporting arbitrary data. Note: using ADB/Tailscale gives device-level access to paired Android devices (expected for the advertised capability); README contains a sample IP which is just illustrative but could be misleading if copied without understanding.
Install Mechanism
This is an instruction-only skill (no automated install). It tells the user to pipx install piper-tts and to use the HuggingFace CLI and ffmpeg. Those are reasonable for TTS but do require installing third-party software that will run locally—verify the upstream packages (piper-tts, hf CLI) before installing. Because install is manual, the skill itself does not download or execute code automatically.
Credentials
The skill declares no required environment variables or credentials. Operationally, the workflow may prompt for HuggingFace auth if model access requires it and requires an ADB-paired Android device (device pairing grants access to push/play files). There are no unrelated secret requests in the manifest or instructions.
Persistence & Privilege
The skill is not force-included (always: false) and has no install step that persists code automatically. It does not request elevated agent privileges or modify other skills. Note that the platform default allows autonomous invocation; that is expected and not by itself a concern here.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install jarvis-vocal
  3. After installation, invoke the skill by name or use /jarvis-vocal
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release: Authentic J.A.R.V.I.S. voice using Piper TTS with HuggingFace model. Wireless delivery to Android via ADB over Tailscale.
Metadata
Slug jarvis-vocal
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Jarvis Vocal?

Authentic J.A.R.V.I.S. voice synthesis using Piper TTS with HuggingFace-trained model. Generates movie-accurate voice locally and can push to connected Andro... It is an AI Agent Skill for Claude Code / OpenClaw, with 142 downloads so far.

How do I install Jarvis Vocal?

Run "/install jarvis-vocal" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Jarvis Vocal free?

Yes, Jarvis Vocal is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Jarvis Vocal support?

Jarvis Vocal is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Jarvis Vocal?

It is built and maintained by Kishen35 (@kishen35); the current version is v1.0.0.

💬 Comments