← Back to Skills Marketplace
pengling9405

Telegram Multilingual Voice Reply

by 0x1 · GitHub ↗ · v0.1.0
cross-platform ⚠ suspicious
642
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install telegram-multilingual-voice-reply
Description
Smart Telegram reply workflow for OpenClaw: if the user sends text, reply with text; if the user sends a voice note/audio, transcribe locally using the insta...
Usage Guidance
This skill appears internally consistent and implements local ASR/TTS as described. Before installing or running it, make sure you: (1) install mlx_audio and ffmpeg from trusted sources and be aware those installs will likely download large model files (network + disk usage); (2) run the scripts in an environment you control (container/venv) because they will execute local binaries and create temporary files; (3) avoid passing paths to sensitive files as the scripts operate on file paths you provide; (4) note the scripts call external CLIs found on PATH or ~/.local/bin — ensure those executables are the genuine tools you intend to use to prevent accidental execution of a replaced/malicious binary. If you need the agent to run in a more restricted environment, prefer isolating execution or denying network/model downloads.
Capability Analysis
Type: OpenClaw Skill Name: telegram-multilingual-voice-reply Version: 0.1.0 The `SKILL.md` file instructs the AI agent to execute `python3` scripts (`scripts/mlx_asr.py` and `scripts/mlx_tts_voice.py`) with arguments directly derived from user input (e.g., audio file paths, language, generated reply text). This creates a significant shell injection vulnerability if the OpenClaw agent does not rigorously sanitize or escape user-controlled input when constructing and executing these shell commands. While the Python scripts themselves use `subprocess.run` with lists (generally safer), the risk lies in how the agent constructs the initial command string, which could lead to arbitrary code execution on the host system.
Capability Assessment
Purpose & Capability
The name/description (Telegram multilingual voice reply) align with the included artifacts: SKILL.md describes local ASR/TTS behavior and the repository includes two helper scripts (mlx_asr.py and mlx_tts_voice.py) that implement those functions using mlx_audio and ffmpeg. The models named (Qwen3-ASR, Qwen3-TTS) are consistent with the stated approach.
Instruction Scope
SKILL.md explicitly limits actions to: transcribing audio with the provided scripts, generating TTS, and sending a Telegram voice message with a caption that matches the voice. The instructions do not ask the agent to read unrelated files, access secrets, or call unknown external endpoints. The scripts operate on audio files provided to them and create temporary files as expected.
Install Mechanism
There is no install spec (instruction-only), which reduces risk. However the scripts depend on external tooling (mlx_audio CLI or importable mlx_audio package and ffmpeg). Installing or running those tools may pull large model files from upstream (HuggingFace/other), which implies network activity and substantial disk use; this is expected for local ASR/TTS but worth noting.
Credentials
The skill requests no environment variables or credentials. The scripts look for a fallback CLI under the user's home (~/.local/bin) and use temporary files — both are reasonable for this use case. They do not read arbitrary config paths or secret env vars.
Persistence & Privilege
The skill is not always-on and does not modify other skills or system-wide config. It runs as-invoked and does not request elevated persistence or cross-skill access.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install telegram-multilingual-voice-reply
  3. After installation, invoke the skill by name or use /telegram-multilingual-voice-reply
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.0
Voice-first Telegram smart replies with local MLX-Audio STT, multilingual follow, and single-message voice+caption output.
Metadata
Slug telegram-multilingual-voice-reply
Version 0.1.0
License
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Telegram Multilingual Voice Reply?

Smart Telegram reply workflow for OpenClaw: if the user sends text, reply with text; if the user sends a voice note/audio, transcribe locally using the insta... It is an AI Agent Skill for Claude Code / OpenClaw, with 642 downloads so far.

How do I install Telegram Multilingual Voice Reply?

Run "/install telegram-multilingual-voice-reply" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Telegram Multilingual Voice Reply free?

Yes, Telegram Multilingual Voice Reply is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Telegram Multilingual Voice Reply support?

Telegram Multilingual Voice Reply is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Telegram Multilingual Voice Reply?

It is built and maintained by 0x1 (@pengling9405); the current version is v0.1.0.

💬 Comments