← Back to Skills Marketplace
xinhao-96

有声读物生成助手

by XinHao-96 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
180
Downloads
1
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install story-audio-adapter
Description
Use when: 用户希望把带有 `[角色]文本` 标记的小说、剧本、故事台词转成多角色有声作品时触发。 适用于旁白、人物对白、角色 ID 已标注清楚的文本内容。Skill 会读取可编辑音色库,分析角色数量与性格特征,匹配最接近的音色,逐段调用 SenseAudio TTS,最后拼接为完整音频并以 `MEDIA...
Usage Guidance
This skill appears coherent: it needs only your SenseAudio API key to call the vendor's TTS endpoint and produces per-segment WAV output. Before installing, consider: (1) The script will try to pip-install the 'requests' package at runtime if missing — review/allow that network install on your environment. (2) The SENSEAUDIO_API_KEY you provide will be sent to the external SenseAudio API (default host api.senseaudio.cn); only supply a key you trust with this service and consider scoping/rotating it. (3) SKILL.md mentions SENSEAUDIO_API_BASE (optional) though it's not declared as required—if you need a custom endpoint, set that env var. (4) Outputs are written to the workspace outputs/ directory and local files (voice library, analysis JSON) are read/written; verify you are comfortable with that. If you want extra assurance, review scripts/main.py fully in your environment (it is included) before running, and consider running in a sandbox or with a dedicated API key with limited permissions.
Capability Analysis
Type: OpenClaw Skill Name: story-audio-adapter Version: 1.0.0 The skill bundle is a legitimate tool designed to convert structured story text into multi-role audio using the SenseAudio TTS API. The Python script `scripts/main.py` handles role parsing, heuristic voice matching, and audio concatenation using standard libraries and `ffmpeg`. While the script includes an automatic dependency installer (`pip install requests`) and executes `ffmpeg` via subprocess, these actions are transparently documented and implemented safely without shell injection risks. There is no evidence of data exfiltration, credential theft, or malicious prompt injection in `SKILL.md` or the supporting markdown files.
Capability Assessment
Purpose & Capability
Name/description (multi-role story→audio) align with the actual code and declared requirement (SENSEAUDIO_API_KEY). The script implements voice-library reading, role analysis, per-segment TTS calls, and audio concatenation as advertised.
Instruction Scope
SKILL.md limits scope to reading the editable voice library, role analysis prompt, and the provided story text, then calling SenseAudio TTS and returning MEDIA:./...; the code follows that flow. It does not read unrelated system files or other environment variables beyond the API key/base.
Install Mechanism
There is no formal install spec (instruction-only + included Python script). The script will auto-install the 'requests' package at runtime using pip if missing (ensure_python_package), which performs a network package install. This is expected for a Python CLI but means the runtime will attempt to modify the local Python environment.
Credentials
Only SENSEAUDIO_API_KEY is declared as required (primary credential) which is proportional for a cloud TTS service. SKILL.md also suggests exporting SENSEAUDIO_API_BASE, though that variable is not listed in requires.env — it's optional in code (defaults to https://api.senseaudio.cn). No other secrets or unrelated env vars are requested.
Persistence & Privilege
The skill is not 'always: true' and uses normal agent invocation. It does not request system-wide persistent privileges, nor does it modify other skills' configs. It writes outputs beneath the workspace outputs/ directory as expected.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install story-audio-adapter
  3. After installation, invoke the skill by name or use /story-audio-adapter
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of story-audio-adapter. - Converts structured `[角色]内容` story/novel scripts into multi-role audio using SenseAudio TTS. - Reads an editable voice library for role-to-voice matching; default voices provided. - Full workflow: config check, voice listing, role/voice analysis, segment synthesis, output `MEDIA:./...` for Feishu/OpenClaw. - Strict fallback and error handling for missing API keys, input limits, or empty voice library. - Outputs character analysis, voice mapping, and always returns a media reference for channel consumption.
Metadata
Slug story-audio-adapter
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is 有声读物生成助手?

Use when: 用户希望把带有 `[角色]文本` 标记的小说、剧本、故事台词转成多角色有声作品时触发。 适用于旁白、人物对白、角色 ID 已标注清楚的文本内容。Skill 会读取可编辑音色库,分析角色数量与性格特征,匹配最接近的音色,逐段调用 SenseAudio TTS,最后拼接为完整音频并以 `MEDIA... It is an AI Agent Skill for Claude Code / OpenClaw, with 180 downloads so far.

How do I install 有声读物生成助手?

Run "/install story-audio-adapter" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is 有声读物生成助手 free?

Yes, 有声读物生成助手 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does 有声读物生成助手 support?

有声读物生成助手 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created 有声读物生成助手?

It is built and maintained by XinHao-96 (@xinhao-96); the current version is v1.0.0.

💬 Comments