← 返回 Skills 市场

Audio Cog

Name: Audio Cog
Author: nitishgargiitd

作者 CellCog · GitHub ↗ · v1.0.12 · MIT-0

darwinlinuxwindows ✓ 安全检测通过

5929

总下载

当前安装

版本数

在 OpenClaw 中安装

/install audio-cog

功能描述

AI audio generation and text-to-speech powered by CellCog. Voiceover, narration, voice cloning, avatar voices, sound effects, music, podcasts, dialogue. Thre...

安全使用建议

Install this skill only if you trust CellCog and are comfortable sending prompts and audio-generation requests to its service. Use a controlled API key, monitor usage, avoid submitting secrets, and use cloned voices only with consent and clear disclosure.

功能分析

Type: OpenClaw Skill Name: audio-cog Version: 1.0.12 The audio-cog skill provides documentation and usage instructions for an AI audio generation service via the CellCog SDK. The SKILL.md file outlines legitimate features such as text-to-speech, voice cloning, and music generation using providers like OpenAI and ElevenLabs. There are no indicators of malicious intent, data exfiltration, or harmful prompt injection; the instructions are strictly aligned with the stated purpose of professional audio production.

能力评估

ℹ Purpose & Capability

The stated purpose and documented capabilities align: AI narration, sound effects, music, and cloned/avatar voices. Voice cloning is disclosed, but users should treat it as a sensitive capability.

ℹ Instruction Scope

The skill provides SDK usage snippets and delegates full operational details to the separate CellCog skill/service. This is purpose-aligned, but users should understand what data is sent and how tasks are managed.

ℹ Install Mechanism

There is no install script or code file, but the SKILL.md declares a CellCog dependency. Users should verify the CellCog package/skill source if their environment installs or invokes it.

ℹ Credentials

The CELLCOG_API_KEY requirement is expected for a CellCog API integration, and the artifacts do not show hardcoded keys or credential leakage.

ℹ Persistence & Privilege

The OpenClaw example uses a fire-and-forget remote chat task, so jobs may continue asynchronously after invocation; this is disclosed and consistent with media-generation workflows.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install audio-cog
安装完成后，直接呼叫该 Skill 的名称或使用 /audio-cog 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.12

- Added environment requirements for the skill: now specifies needed binaries (python3) and environment variables (CELLCOG_API_KEY) in metadata. - No user-facing feature or functionality changes; documentation in SKILL.md updated to reflect setup prerequisites.

v1.0.11

- Updated documentation for clarity and accuracy in SKILL.md - Improved and expanded description, highlighting support for podcasts and dialogue - Clarified agent usage: now specifies “all agents except OpenClaw” for blocking chat integration example - Refined instructions and language throughout for easier onboarding and provider selection - No code or functional changes—documentation update only

v1.0.10

- Improved documentation in SKILL.md with more concise descriptions and clearer usage instructions. - Expanded SDK usage code sample to explicitly show client initialization. - Updated skill description to highlight avatar voices and music generation up to 10 minutes. - Enhanced formatting and clarified steps for using voice providers, avatars, sound effects, and music features.

v1.0.9

- Simplified and clarified the description to emphasize major features and use cases. - Reorganized usage instructions for easier onboarding, highlighting SDK references up front. - Added new "If CellCog is not installed" section with agent-specific installation guidance. - Streamlined sections for voice providers and capabilities; removed duplicative wording. - Made instructions for different agent types (OpenClaw, Cursor, etc.) more concise and highlighted code output. - Shortened and focused provider feature explanations for improved readability.

v1.0.8

- Expanded SKILL.md with detailed guidance on provider selection (OpenAI, ElevenLabs, MiniMax) and their strengths. - Added new tables outlining voice provider scenarios, voice options, customization tips, and emotion tag usage. - Provided explicit examples for avatar/cloned voices and usage of custom avatars for personalized narration. - Enhanced sections on sound effects and music generation, including sample prompts and best practice tips. - Clarified multi-language support and practical agent usage for audio generation. - Included a new "Tips for Better Audio" section to help users optimize results with the skill.

v1.0.7

- Expanded description and usage details for TTS, music, SFX, and podcast production features - Clarified role of supported providers: OpenAI, ElevenLabs, and MiniMax, with highlights for multi-voice and avatar voices - Documented new features: multi-voice dialogue, podcast pipeline, 160+ voices, and output in MP3/WAV - Updated related skills section for audio, music, podcast, and video generation - Refined and reorganized documentation for easier discovery of key capabilities

v1.0.6

audio-cog 1.0.6 - Added explicit Python code examples for agent-based and blocking audio generation using the SDK. - Clarified that OpenClaw agent mode is recommended for long tasks and provided notify_session_key details. - Referenced the main cellcog skill for advanced SDK usage, delivery modes, and file handling. - Documentation updates only; no functional or interface changes.

v1.0.5

- Added OS compatibility metadata for Darwin, Linux, and Windows. - Updated skill description for improved clarity and SEO. - Added homepage link to CellCog website. - Improved formatting and metadata structure in SKILL.md. - No changes to core functionality; documentation and metadata only.

v1.0.4

- Adds support for three voice providers: OpenAI, ElevenLabs, and MiniMax, each with unique capabilities. - Introduces avatar/cloned voice generation via MiniMax, allowing users to create audio in their own voice. - Expands features to include standalone sound effects (up to 30 seconds) and longer music generation (up to 10 minutes). - Clarifies provider recommendations by scenario, with detailed guidance on emotional tags (ElevenLabs) and fine-grained controls (MiniMax). - Updates documentation with new usage examples, usage tips, and multi-language support across all providers.

v1.0.3

- Added author and dependencies fields to SKILL.md for clearer metadata. - Updated prerequisite instructions to refer to the cellcog skill by name. - Minor edits for clarity and consistency in setup and usage guidance.

v1.0.2

audio-cog 1.0.2 - Added detailed documentation of all 8 available CellCog voices, including usage recommendations and voice characteristics. - Included guidance on choosing voices by content type and how to customize styles (accent, emotion, pacing, etc.). - Added music licensing statement: all generated music is royalty-free and usable for any commercial purpose. - Expanded multi-language support list and updated multi-language example prompts. - Updated example prompts and tips to reflect voice selection and new usage patterns.

v1.0.1

- Adds metadata (including an emoji) for enhanced identification. - Updates quick-start usage pattern to use `create_chat` with simplified, fire-and-forget execution and notification model (v1.0+). - Clearly standardizes on `chat_mode="agent"` as optimal for all audio tasks, deprecating previous agent team recommendations. - Updates guidance to reflect new workflow and best practices. - No change in feature set or audio capabilities.

v1.0.0

- Initial release of audio-cog: Professional AI audio generation powered by CellCog. - Supports text-to-speech, voice synthesis, narration, voiceovers, podcast production, music creation, and sound design. - Offers voice customization (gender, age, emotion, accent, pacing, tone). - Enables music and background audio generation with detailed control (genre, tempo, mood, instruments, duration). - Multi-language speech generation and various audio output formats. - Includes prompt examples, guidance for agent team mode, and detailed usage tips.

元数据

Slug audio-cog

版本 1.0.12

许可证 MIT-0

累计安装 223

当前安装数 38

历史版本数 13

常见问题

Audio Cog 是什么？

AI audio generation and text-to-speech powered by CellCog. Voiceover, narration, voice cloning, avatar voices, sound effects, music, podcasts, dialogue. Thre... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 5929 次。

如何安装 Audio Cog？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install audio-cog」即可一键安装，无需额外配置。

Audio Cog 是免费的吗？

是的，Audio Cog 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Audio Cog 支持哪些平台？

Audio Cog 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（darwin, linux, windows）。

谁开发了 Audio Cog？

由 CellCog（@nitishgargiitd）开发并维护，当前版本 v1.0.12。