← 返回 Skills 市场
yangqibin-caibi

OmniVoice

作者 Yang Qibin · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
121
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install omnivoice
功能描述
All-in-one voice identity toolkit: speaker identification, voice library management, voice cloning, and speech-to-text. The only OpenClaw skill with speaker...
安全使用建议
Key things to consider before installing/using OmniVoice: - Metadata mismatch: The registry metadata claims no required env vars, but the skill requires SF_API_KEY for the SiliconFlow cloning API and FEISHU_APP_ID/FEISHU_APP_SECRET to send messages to Feishu. Confirm you are comfortable providing those secrets and update metadata expectations. - Privacy risk: Voice cloning sends reference audio (base64 or a remote URL) to https://api.siliconflow.cn. Any audio you provide (including recordings of other people) will be transmitted to that third party. Do not upload recordings you do not have permission to share. Review SiliconFlow's privacy/TOS before use. - Manual code edits: The documentation instructs you to register speakers by editing SPEAKER_MAP in scripts/voice_identify.py. This means the workflow relies on modifying source files — consider instead keeping references in a separate metadata file to avoid altering shipped code, or be aware that the skill expects write access to its own files. - Resource use: The speaker-identification model downloads ~360MB to /tmp on first run and requires CPU/GPU resources; ensure your runtime environment has sufficient disk and compute. - Feishu integration: The provided shell script will exchange your FEISHU_APP_ID/SECRET for a tenant token and upload audio. Limit credential scope for the app and confirm you trust the destination Feishu tenant. - Operational safety: If you need to evaluate the skill, run it in an isolated/sandboxed environment, inspect network traffic to confirm where audio is uploaded, and avoid giving production credentials until you trust the behavior. If you want, I can: (1) list the exact environment variables and commands you must run to test the skill safely in a sandbox, (2) suggest a safer workflow that avoids editing code (store speaker metadata in TOOLS.md and read it at runtime), or (3) help craft a minimal wrapper that blocks external uploads for local-only testing.
功能分析
Type: OpenClaw Skill Name: omnivoice Version: 1.0.0 The 'omnivoice' skill is a comprehensive voice processing toolkit providing speaker identification, voice cloning, and transcription. It utilizes legitimate third-party services (SiliconFlow API for cloning and Feishu API for messaging) and standard AI libraries (Whisper, Transformers). The included scripts (voice_identify.py, cosyvoice_clone.py, and feishu_send_audio.sh) are transparently written to perform their stated functions, and there is no evidence of malicious intent, data exfiltration, or unauthorized system access.
能力评估
Purpose & Capability
Functionality (speaker ID, library management, cloning, Feishu delivery) matches the included scripts. However the registry metadata claims no required environment variables while the SKILL.md and scripts require SF_API_KEY for SiliconFlow and FEISHU_APP_ID/FEISHU_APP_SECRET for sending audio. That omission is an incoherence in declared purpose/requirements: the skill legitimately needs those secrets for cloning and Feishu sending, but the package metadata does not declare them.
Instruction Scope
SKILL.md stays largely within the stated domain (local voice refs, transcribe, identify, clone). Two points to watch: (1) it instructs manual edits to SPEAKER_MAP inside scripts/voice_identify.py to register speakers (i.e., modify the skill's code to add speakers), which is unusual and grants the agent or user permission to change shipped code; (2) voice cloning sends reference audio (possibly private) to an external API (SiliconFlow) which is necessary for cloning but is a privacy/exfiltration risk. The skill also downloads a ~360MB model to /tmp on first run (resource/disk considerations).
Install Mechanism
This is instruction-only (no automated install spec). Dependencies are standard for the tasks (whisper/transformers/librosa/ffmpeg). No installers or external arbitrary downloads beyond model weights from HuggingFace (expected for UniSpeech-SAT).
Credentials
The skill requires SF_API_KEY (SiliconFlow) and Feishu credentials (FEISHU_APP_ID and FEISHU_APP_SECRET) according to the SKILL.md and scripts, but the registry metadata lists no required env vars. Requiring third-party API keys is proportionate to voice cloning and Feishu message sending, but the metadata omission is misleading. Also, sending reference audio to an external service (SiliconFlow) means sensitive audio data will leave your environment — request for SF_API_KEY and choice of endpoint should be evaluated before use.
Persistence & Privilege
always:false and no OS restrictions — the skill does not request permanent, universal inclusion. It will write files into workspace directories (voice-refs/, TOOLS.md) and may modify its own SPEAKER_MAP if the user follows the instructions; these are local operations and not system-wide privilege escalations. No indication it modifies other skills or global agent config.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install omnivoice
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /omnivoice 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
- Initial release of OmniVoice: an all-in-one voice identity toolkit. - Provides ten operations including speaker identification, voice library management, voice cloning, speech-to-text, and persona voice replies. - Unique feature: speaker identification—recognize WHO is speaking, not just what they said. - Supports audio transcription, voice swap, and CRUD operations on a voice library. - Works with English, Chinese, and Japanese input; compatible with Feishu message sending.
元数据
Slug omnivoice
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

OmniVoice 是什么?

All-in-one voice identity toolkit: speaker identification, voice library management, voice cloning, and speech-to-text. The only OpenClaw skill with speaker... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 121 次。

如何安装 OmniVoice?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install omnivoice」即可一键安装,无需额外配置。

OmniVoice 是免费的吗?

是的,OmniVoice 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

OmniVoice 支持哪些平台?

OmniVoice 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 OmniVoice?

由 Yang Qibin(@yangqibin-caibi)开发并维护,当前版本 v1.0.0。

💬 留言讨论