← Back to Skills Marketplace
aurora2035

xeon_asr

by aurora2035 · GitHub ↗ · v0.1.4 · MIT-0
cross-platform ⚠ suspicious
488
Downloads
0
Stars
0
Active Installs
11
Versions
Install in OpenClaw
/install xeonasr
Description
Automatically converts received voice messages to text via an external ASR service, supporting multiple audio formats and integrating with OpenClaw.
Usage Guidance
Do not install this blindly. Key things to consider before proceeding: (1) The package is mislabeled — it claims to be an ASR skill but installs a persistent TTS service that modifies ~/.openclaw/openclaw.json and enables systemd services. If you expected ASR/transcription, do not install. (2) The installer sets HF_ENDPOINT=https://hf-mirror.com (an undocumented external mirror) — if you must install, inspect or remove that setting and prefer official Hugging Face endpoints. (3) Run install.sh with --skip-start and review generated config.json and tts_config.json before starting services. (4) Inspect xdp-tts-service package source (pip target) and any model repos before allowing automatic model downloads. (5) Consider running the installer in a sandbox/container or a throwaway VM first, or only install if you explicitly want a local TTS service that will run continuously. (6) If you do install, keep backups of your OpenClaw config (install script does a backup but verify) and review the created systemd user unit files before enabling them.
Capability Analysis
Type: OpenClaw Skill Name: xeonasr Version: 0.1.4 The skill bundle implements a local TTS and voice cloning service but exhibits high-risk behaviors during its installation and configuration phases. Specifically, `setup_env.sh` attempts to install system-level dependencies using `sudo apt-get/yum`, and `configure_openclaw_integration.sh` programmatically modifies the core `openclaw.json` configuration file. While these actions are plausibly necessary for the stated purpose of setting up a local OpenVINO inference environment, the requirement for elevated privileges and the automated tampering with global configuration files represent a significant attack surface. No clear evidence of intentional malice, such as data exfiltration or backdoors, was found.
Capability Assessment
Purpose & Capability
Metadata at the top says 'xeon_asr' and describes ASR/transcription, but the files, SKILL.md, README, and server.js implement a TTS (xeontts) service and explicitly say it must NOT handle ASR. The registry metadata (slug owner, _meta.json) and file names all point to xeontts/xeon-tts. This is a fundamental mislabeling/incoherence: an ASR-labelled skill that actually installs a TTS service.
Instruction Scope
SKILL.md and scripts instruct the agent to modify the user's OpenClaw config (~/.openclaw/openclaw.json), create backups, write a channels.qqbot.xeonTts block, create runtime/output/reference dirs, and start two services (Flask on 5002 and Node on 9002). The code also invokes external commands (child_process execFile imported in server.js, used for media probing like ffprobe). These instructions go beyond local, ephemeral actions and change user config and run persistent services — appropriate for a TTS skill but inconsistent with the ASR description.
Install Mechanism
There is no formal install spec in registry metadata (instruction-only), but the bundle includes install.sh/setup_env.sh that download and install Miniconda, create a Python venv, pip-install a package (xdp-tts-service), run npm install, and register and enable user-level systemd services. Most sources are standard (repo.anaconda.com, PyPI), but setup_env.sh unconditionally sets HF_ENDPOINT=https://hf-mirror.com (an external mirror) which is not documented in SKILL.md/README — pointing model downloads to an unexpected mirror is a risk.
Credentials
Registry metadata declared no required environment variables, but the scripts rely on many optional env vars (BASE_MODEL_PATH, CUSTOM_MODEL_PATH, BASE_MODEL_REPO, CUSTOM_MODEL_REPO, XDP_TTS_PIP_SPEC, BASE_CHECKPOINT_PATH) and setup_env.sh exports HF_ENDPOINT and HF_HUB_ENABLE_HF_TRANSFER. The unadvertised HF_ENDPOINT redirect to https://hf-mirror.com is unexpected and not justified in the top-level metadata or SKILL.md. No secrets are requested, but environment behavior differs from declared requirements.
Persistence & Privilege
The installer registers and enables two systemd user services, starts background processes, and modifies the user's OpenClaw configuration file (backing it up first). While not 'always: true', this gives the skill persistent presence and automatic restart on login — reasonable for a local long-running TTS service but a significant privilege and should be explicit to the user. It doesn't alter other skills' credentials, but it does modify the OpenClaw config file.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install xeonasr
  3. After installation, invoke the skill by name or use /xeonasr
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.4
**xeonasr 0.1.4 Changelog** - Replaced ASR (speech-to-text) service scripts with new TTS (text-to-speech) service scripts and configuration files. - Added: `start_tts_service.sh`, `stop_tts.sh`, and `tts_config.example.json`. - Removed: `start_asr.sh`, `start_asr_service.sh`, and `stop_asr.sh`. - Updated documentation to reflect the transition from ASR (Qwen3-ASR) to TTS (Qwen3-TTS) with new usage instructions, service ports, and workflow details. - Ensured TTS and ASR services do not overlap or conflict in configuration.
v1.1.1
xeonasr 1.1.1 - No file changes detected in this version. - Documentation and configuration workflow remain unchanged and consistent.
v1.1.0
xeonasr v1.1.0 - 增加了 install_systemd_services.sh 和 self_check.sh 脚本,便于服务自检与管理。 - 改进安装流程:不再假设固定用户目录,路径默认以 $HOME 变量为准。 - OpenClaw 配置与补丁流程优化,自动为全局 audio 配置和 Feishu 多副本场景打补丁。 - 更新文档,更加清晰地描述标准流程与依赖,明确分离本地运行内容与仓库追踪内容。
v1.0.9
xeonasr 1.0.9 - Added three new files: .clawhub/origin.json, _meta.json, and configure_openclaw_integration.sh. - Improved OpenClaw integration: automatic configuration for both QQBot and Feishu STT channels. - Setup now supports automatic model download and detection, with improved instructions for manual downloads. - Installation script now auto-starts services and restarts gateway after health checks. - Documentation updated to include Feishu Bot support and clarify model path usage.
v1.0.6
update readme
v1.0.5
xeonasr 1.0.5 Changelog - Rewrote documentation to clarify dual-service architecture and setup flow. - Simplified and reordered installation instructions for improved usability. - Detailed configuration steps for QQ Bot speech-to-text (STT) integration. - Added health check and management commands for both service ports. - Included troubleshooting section for common issues.
v1.0.4
update xdp service
v1.0.3
- Added shell scripts for environment setup, starting, and stopping the ASR service: setup_env.sh, start_all.sh, start_asr.sh, start_asr_service.sh, and stop_asr.sh. - No changes to SKILL.md or existing documentation. - Enhances usability by providing script-based service management.
v1.0.2
- 新增“智能启动”说明:运行 npm start 会检测配置,缺失时提示用户手动确认后再执行安装脚本。 - 强调配置缺失时不会自动执行安装,需用户输入 y 确认。 - 其余内容保持不变,仅补充和澄清启动与安装体验。
v1.0.1
- Added install.sh script for one-click setup. - Updated documentation for automatic installation and configuration. - Integrated xdp-audio-service for local ASR. - Now recommends HuggingFace for Qwen3-ASR model download. - Simplified setup process and clarified usage steps.
v1.0.0
Initial release of xeonasr: ASR Voice-to-Text skill - Automatically transcribes incoming voice messages to text using an external ASR service (e.g., Flask + Whisper). - Listens on a specified port to receive audio files and returns recognized text to OpenClaw. - Easy installation via ClawHub or manual setup. - Supports multiple audio formats: .silk, .slk, .amr, .wav, .mp3, .ogg, .pcm. - Simple configuration with customizable ASR model and service endpoint. - Designed for seamless integration with QQ channels and OpenClaw sessions.
Metadata
Slug xeonasr
Version 0.1.4
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 11
Frequently Asked Questions

What is xeon_asr?

Automatically converts received voice messages to text via an external ASR service, supporting multiple audio formats and integrating with OpenClaw. It is an AI Agent Skill for Claude Code / OpenClaw, with 488 downloads so far.

How do I install xeon_asr?

Run "/install xeonasr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is xeon_asr free?

Yes, xeon_asr is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does xeon_asr support?

xeon_asr is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created xeon_asr?

It is built and maintained by aurora2035 (@aurora2035); the current version is v0.1.4.

💬 Comments