← Back to Skills Marketplace
senseaudio-voice-ab-lab
by
Wu Ruixiao
· GitHub ↗
· v1.0.1
· MIT-0
255
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install senseaudio-voice-ab-lab
Description
Use when a team wants to generate multiple ad, spoken-copy, sales, or promo voice variants from one typed or spoken creative brief, transcribe voice memos wi...
Usage Guidance
This skill performs the advertised tasks but the package metadata understates what it needs. Before installing or running it: 1) Expect to provide a SenseAudio API key (SENSEAUDIO_API_KEY) and possibly a SENSEAUDIO_PLATFORM_TOKEN for platform uploads; the skill will call https://api.senseaudio.cn and https://platform.senseaudio.cn. 2) If you want Feishu delivery, the skill expects Feishu app credentials/config (app_id/app_secret) accessible via its local config path — review where those are stored and how tenant tokens are fetched. 3) Review the missing shared helpers (audioclaw_paths, senseaudio_env, senseaudio_api_guard, feishu_audio_sender) before trusting runtime behavior — they may be in a parent repo in expected deployments but are not included in the manifest. 4) Limit API key scopes and use test/isolated credentials first; avoid giving production-wide keys until you audit the code paths. 5) Be aware the scripts will transcode (ffmpeg) and may call system utilities (afinfo); run in an environment where those binaries are safe and available. 6) If you need a definitive safety assessment, ask the publisher for corrected metadata listing required env vars and for the missing _shared modules, or run the skill in an isolated container and observe network endpoints it contacts.
Capability Analysis
Type: OpenClaw Skill
Name: senseaudio-voice-ab-lab
Version: 1.0.1
The senseaudio-voice-ab-lab skill bundle is a legitimate tool designed for marketing teams to generate and A/B test spoken ad variants. It provides a complete pipeline including speech-to-text (ASR) via senseaudio_asr.py, text variant generation in build_voice_ab_variants.py, batch text-to-speech synthesis in batch_tts_variants.py, and distribution to Feishu (Lark) via send_ab_variants_to_feishu.py. The scripts use standard Python libraries and interact with expected service endpoints (api.senseaudio.cn and open.feishu.cn) to perform their stated functions. No evidence of malicious intent, data exfiltration, or harmful prompt injection was found.
Capability Assessment
Purpose & Capability
Name/description (generate A/B voice variants, transcribe briefs) matches the code's core behavior. However the registry metadata claims no required environment variables or config paths, while the scripts clearly use SENSEAUDIO_API_KEY, optionally SENSEAUDIO_PLATFORM_TOKEN, SENSEAUDIO_ASR_MODEL, and rely on local config (audioclaw_paths.get_config_path/get_workspace_root) and a Feishu helper to fetch tenant tokens. Those credentials and config access are expected for the stated purpose, but the metadata omission is a coherence problem that could mislead users about what secrets/config are needed.
Instruction Scope
SKILL.md instructs the agent to save user audio, run ASR, build variants, synthesize via SenseAudio TTS, and (optionally) send audio into Feishu. The code implements exactly that. Concerns: (1) the SKILL.md and agent prompt encourage automatically sending variants to Feishu when the user asks to '试听/发语音/飞书', which will post user audio to an external chat service; (2) scripts read local config and helper modules (audioclaw_paths, _shared/*, feishu sender) that are not listed in the metadata; (3) scripts call system tools (ffmpeg, afinfo) and run subprocesses. These actions are within the stated purpose but expand the skill's access surface and require explicit credentials/config that are not declared.
Install Mechanism
No install spec (instruction-only) and all bundled code is local. There are no remote downloads in the install. That lowers supply-chain risk. However the package depends on helper modules in a parent _shared directory and on local environment/tooling (ffmpeg, afinfo), so runtime failures or implicit path traversal may occur if the expected repository layout isn't present.
Credentials
Registry shows 'no required env vars', but the code uses and/or checks: SENSEAUDIO_API_KEY (default for TTS/ASR open API), SENSEAUDIO_PLATFORM_TOKEN (platform upload mode), SENSEAUDIO_ASR_MODEL, and expects Feishu app_id/app_secret via a feishu config loaded from get_config_path(). The skill will fetch tenant tokens and upload audio to Feishu and post to SenseAudio endpoints (https://api.senseaudio.cn and https://platform.senseaudio.cn). Requesting these secrets is reasonable for the described functionality, but the metadata omission is misleading and increases risk if users supply broad-scoped credentials without understanding where they go.
Persistence & Privilege
This skill is not always:true and is user-invocable; it does not request persistent platform privileges. It can be invoked autonomously (default allowed) which is normal for skills; combine that with the credential/config mismatches above if you want extra caution, but there is no evidence it modifies other skills or system-wide settings.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install senseaudio-voice-ab-lab - After installation, invoke the skill by name or use
/senseaudio-voice-ab-lab - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
AudioClaw/branding and API support updates.
- Updated branding: renamed all documentation and references from SenseAudio to AudioClaw.
- Adjusted TTS-related scripts to default to using SENSEAUDIO_API_KEY for AudioClaw API calls.
- Improved API key handling: TTS scripts now auto-replace public tokens with the actual key for synthesis.
- Updated documentation to reflect new naming, platform steps, and authentication rules.
- No major API or core workflow logic changes; workflows and commands remain compatible.
v1.0.0
Initial release of SenseAudio Voice AB Lab.
- Generate multiple spoken script variants from a single creative brief for ad, promo, sales, or regional experiments.
- Transcribe voice memos using SenseAudio ASR and extract structured briefs.
- Synthesize all variants with the same SenseAudio voice_id for controlled A/B testing.
- Supports direct audio review and sending results to Feishu with proper transcoding.
- Designed for fast creative iteration while keeping the voice consistent across all variants.
Metadata
Frequently Asked Questions
What is senseaudio-voice-ab-lab?
Use when a team wants to generate multiple ad, spoken-copy, sales, or promo voice variants from one typed or spoken creative brief, transcribe voice memos wi... It is an AI Agent Skill for Claude Code / OpenClaw, with 255 downloads so far.
How do I install senseaudio-voice-ab-lab?
Run "/install senseaudio-voice-ab-lab" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is senseaudio-voice-ab-lab free?
Yes, senseaudio-voice-ab-lab is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does senseaudio-voice-ab-lab support?
senseaudio-voice-ab-lab is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created senseaudio-voice-ab-lab?
It is built and maintained by Wu Ruixiao (@kikidouloveme79); the current version is v1.0.1.
More Skills