← Back to Skills Marketplace
飞书语音气泡生成
by
shihuaijun777
· GitHub ↗
· v1.0.1
· MIT-0
98
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install feishu-voice-skill-v2
Description
使用 edge-tts 生成语音并发送到飞书(语音气泡形式)。支持 XiaoxiaoNeural 等多种音色。
README (SKILL.md)
feishu-voice-skill
使用 edge-tts 生成中文语音,并将语音文件以飞书气泡消息的形式发送给指定用户。
功能特性
- 🎙️ 使用 Microsoft Edge TTS 引擎,支持多种中文音色
- 🔄 自动转换音频格式为 opus(飞书语音气泡要求格式)
- 📤 直接发送语音气泡消息,无需手动上传
- ⚙️ 支持自定义音色和目标用户
支持的音色
| 音色名称 | 描述 |
|---|---|
zh-CN-XiaoxiaoNeural |
晓晓(女声,默认) |
zh-CN-YunxiNeural |
云希(男声) |
zh-CN-YunyangNeural |
云扬(男声,新闻) |
zh-CN-XiaoyiNeural |
小艺(女声) |
zh-CN-liaoning-XiaobaiNeural |
辽宁小白(女声) |
zh-CN-shaanxi-XiaoniNeural |
陕西小妮(女声) |
环境依赖
- Python 3.8+
edge-tts包requests包ffmpeg(需加入 PATH)
安装依赖:
pip install edge-tts requests
使用方式
python skill.py --text "你好" --voice "zh-CN-XiaoxiaoNeural"
参数说明
| 参数 | 必填 | 默认值 | 说明 |
|---|---|---|---|
--text |
是 | - | 要转换的文本内容 |
--voice |
否 | zh-CN-XiaoxiaoNeural | 音色名称 |
--open-id |
否 | ou_********************************** | 目标用户 open_id |
--app-id |
否 | cli_************ | 飞书应用 App ID |
--app-secret |
否 | (内置默认值) | 飞书应用 App Secret |
工作流程
- 调用 edge-tts 将文本转为 mp3 音频
- 使用 ffmpeg 将 mp3 转换为 opus 格式(48kHz, 64kbps)
- 获取飞书 tenant_access_token
- 上传音频文件到飞书获取 file_key
- 发送语音气泡消息给目标用户
- 清理临时文件
注意事项
- 发送完成后会自动清理临时音频文件
- 建议单次文本不超过 300 字符
- 确保 ffmpeg 已安装并可从命令行调用
Usage Guidance
This skill appears to implement TTS → ffmpeg → Feishu upload correctly, but there are a few red flags to consider before installing or running it:
- The script forcibly attempts to kill common browser processes (taskkill for chrome/msedge/firefox/iexplore). This behavior is not documented in SKILL.md, is unnecessary for normal operation, and can disrupt users. Inspect or remove the close_browser_processes() call if you don't want browsers terminated.
- The script accepts App Secret via a CLI argument (and has a placeholder default). Passing secrets on the command line can expose them to other local users via process listings. Prefer using environment variables or a secure secret store and modify the script to read from them instead of CLI args.
- Ensure the Feishu App ID/Secret you provide have the minimum required permissions (message upload/send) and are not shared broadly. The script uploads files and sends messages using your tenant token — a compromised credential could be abused.
- Confirm ffmpeg installed on your system includes libopus support as required by the script.
If you are comfortable reviewing/removing the browser-kill code and handling secrets securely (or running the script in a controlled environment), the skill's behavior is coherent with its description. If you cannot inspect or modify the code, treat it as risky to run on a personal or multi-user machine.
Capability Analysis
Type: OpenClaw Skill
Name: feishu-voice-skill-v2
Version: 1.0.1
The skill contains an undocumented and highly irregular function `close_browser_processes` in `skill.py` that attempts to forcefully terminate all running instances of Chrome, Edge, Firefox, and Internet Explorer using the `taskkill` command. This behavior is entirely unrelated to the stated purpose of Text-to-Speech generation and Feishu integration, and it could lead to data loss or disruption of the user's environment without warning.
Capability Assessment
Purpose & Capability
Name/description match the code and instructions: uses edge-tts, ffmpeg, and Feishu APIs. Required binary (python3) and Python dependencies are appropriate for the stated functionality.
Instruction Scope
The runtime instructions and code limit themselves to generating audio, converting it, uploading to Feishu, and cleaning temp files — which is expected — but the code also forcibly attempts to kill browser processes (taskkill for chrome/msedge/firefox/iexplore). Closing user browsers is not documented in SKILL.md, is unnecessary for normal TTS/upload flow, and is intrusive (and platform-specific).
Install Mechanism
No install spec (instruction-only for dependencies). This is low risk; pip/ffmpeg instructions are normal. Nothing is downloaded from untrusted URLs or written permanently to system locations by an installer.
Credentials
No required environment variables declared. The tool accepts app_id and app_secret as CLI args (defaults in code are placeholders). Passing secrets on the command line can expose them via process listings; the skill does not document or require safer secret handling (env files, stdin). The number of credentials requested is proportionate to the Feishu integration, but secret-handling is weak.
Persistence & Privilege
Skill does not request permanent presence (always:false) and does not modify other skills or global config. It only writes temporary files which it cleans up by default.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install feishu-voice-skill-v2 - After installation, invoke the skill by name or use
/feishu-voice-skill-v2 - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
- 隐藏了飞书用户 open_id 和 app_id 的默认值,用星号代替,提升安全性
- 其余内容无更改
v1.0.0
Initial release with voice-to-Feishu message functionality:
- Generates Chinese speech using Microsoft Edge TTS with multiple available voices.
- Converts audio to opus format for Feishu compatibility.
- Sends voice messages as Feishu voice bubbles to specified users directly.
- Allows customization of voice, recipient, and app configuration via command-line.
- Automates cleanup of temporary audio files after message delivery.
Metadata
Frequently Asked Questions
What is 飞书语音气泡生成?
使用 edge-tts 生成语音并发送到飞书(语音气泡形式)。支持 XiaoxiaoNeural 等多种音色。 It is an AI Agent Skill for Claude Code / OpenClaw, with 98 downloads so far.
How do I install 飞书语音气泡生成?
Run "/install feishu-voice-skill-v2" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is 飞书语音气泡生成 free?
Yes, 飞书语音气泡生成 is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does 飞书语音气泡生成 support?
飞书语音气泡生成 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created 飞书语音气泡生成?
It is built and maintained by shihuaijun777 (@shihuaijun777); the current version is v1.0.1.
More Skills