← Back to Skills Marketplace
310
Downloads
0
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install xunfei-voice-reply
Description
语音回复技能 - 使用讯飞 TTS 生成语音并发送到飞书。当需要用语音回复用户消息时使用。触发词:用语音、语音回复、切换语音模式、语音模式。
Usage Guidance
This skill appears to implement Xunfei TTS correctly, but review these before installing:
- Metadata mismatch: the registry claims no env vars required but SKILL.md and the code require XUNFEI_APP_ID, XUNFEI_API_KEY, and XUNFEI_API_SECRET. Don't proceed unless you're prepared to provide those secrets and trust the skill's author. Use least-privilege credentials and rotate them if possible.
- Workspace edits: the skill expects the agent or you to edit USER.md (to store reply mode) and AGENTS.md (to add the voice flow). Confirm you want those persistent changes in your workspace.
- Dependencies: you must install ffmpeg and the npm 'ws' package; ensure ffmpeg on your system is from a trusted source.
- Shell use: the code uses child_process.execSync with interpolated file paths when calling ffmpeg. Although paths are internal (/tmp/openclaw) and not user-supplied, unquoted interpolation increases risk. Review the code and consider sanitizing/quoting paths if you run untrusted inputs.
If you decide to install: (1) inspect the code yourself or run in an isolated/test environment first, (2) supply Xunfei credentials via a secure mechanism, and (3) limit exposure of those credentials to the minimum required.
Capability Analysis
Type: OpenClaw Skill
Name: xunfei-voice-reply
Version: 1.0.0
The skill provides legitimate text-to-speech (TTS) functionality using the Xunfei WebSocket API and ffmpeg for audio conversion. It follows standard OpenClaw patterns, such as using /tmp/openclaw for temporary files and updating USER.md to persist user preferences (text vs. voice mode). While it uses execSync to call ffmpeg, the arguments are derived from internal paths and configuration values rather than direct user input, and the text to be synthesized is safely base64-encoded before being sent to the API.
Capability Assessment
Purpose & Capability
The code and docs implement a Xunfei TTS-to-Feishu voice reply flow which matches the skill name/description. However the registry metadata lists no required environment variables while SKILL.md and the code clearly expect XUNFEI_APP_ID, XUNFEI_API_KEY, and XUNFEI_API_SECRET — an inconsistency between declared metadata and actual requirements.
Instruction Scope
Runtime instructions direct the agent to generate text, run scripts/voice-reply.js, send media via the platform 'message' tool, and update workplace files (USER.md and AGENTS.md). Reading/writing USER.md and requiring AGENTS.md edits is reasonable for a mode toggle, but it does expand the skill's scope to editing workspace policy/config files outside its own folder — you should be comfortable with that behavior.
Install Mechanism
No remote install/downloads are performed by the skill bundle. It relies on system ffmpeg and the npm 'ws' package (not bundled) which the README instructs to install. This is a low-risk install model, but you must install third-party binaries (ffmpeg) and an npm dependency yourself.
Credentials
The skill legitimately requires Xunfei API credentials (APP_ID / API_KEY / API_SECRET) to function; those are sensitive secrets and their need is proportionate to TTS functionality. The concern is the skill's registry metadata omitted these required env vars (declared as 'none'), causing a coherence problem and potential surprise for users. The skill does not request unrelated credentials or broad system secrets.
Persistence & Privilege
always:false and no special platform privileges — good. The skill knowingly instructs modifications to USER.md and advises adding flow text to AGENTS.md; this is persistent workspace modification (not a hidden persistent agent privilege) and should be OK if you accept those file changes. The skill does not set itself as always-enabled or modify other skills' configs.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install xunfei-voice-reply - After installation, invoke the skill by name or use
/xunfei-voice-reply - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
xunfei-voice-reply 1.0.0 初始版本
- 提供讯飞 TTS 语音生成并发送到飞书的语音回复能力
- 支持文字/语音模式切换及模式持久化,含多种触发词
- 配置项包括多种发音人、环境变量覆盖和标准临时音频路径
- 详细的错误处理与依赖提示
- 集成流程示例,需在 AGENTS.md 明确接入语音回复流程
Metadata
Frequently Asked Questions
What is XunFei Voice Reply?
语音回复技能 - 使用讯飞 TTS 生成语音并发送到飞书。当需要用语音回复用户消息时使用。触发词:用语音、语音回复、切换语音模式、语音模式。 It is an AI Agent Skill for Claude Code / OpenClaw, with 310 downloads so far.
How do I install XunFei Voice Reply?
Run "/install xunfei-voice-reply" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is XunFei Voice Reply free?
Yes, XunFei Voice Reply is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does XunFei Voice Reply support?
XunFei Voice Reply is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created XunFei Voice Reply?
It is built and maintained by wglnngt (@wglnngt); the current version is v1.0.0.
More Skills