← Back to Skills Marketplace

有声读物生成助手

Name: 有声读物生成助手
Author: xinhao-96

by XinHao-96 · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ Security Clean

180

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install story-audio-adapter

Description

Use when: 用户希望把带有 `[角色]文本` 标记的小说、剧本、故事台词转成多角色有声作品时触发。适用于旁白、人物对白、角色 ID 已标注清楚的文本内容。Skill 会读取可编辑音色库，分析角色数量与性格特征，匹配最接近的音色，逐段调用 SenseAudio TTS，最后拼接为完整音频并以 `MEDIA...

Usage Guidance

This skill appears coherent: it needs only your SenseAudio API key to call the vendor's TTS endpoint and produces per-segment WAV output. Before installing, consider: (1) The script will try to pip-install the 'requests' package at runtime if missing — review/allow that network install on your environment. (2) The SENSEAUDIO_API_KEY you provide will be sent to the external SenseAudio API (default host api.senseaudio.cn); only supply a key you trust with this service and consider scoping/rotating it. (3) SKILL.md mentions SENSEAUDIO_API_BASE (optional) though it's not declared as required—if you need a custom endpoint, set that env var. (4) Outputs are written to the workspace outputs/ directory and local files (voice library, analysis JSON) are read/written; verify you are comfortable with that. If you want extra assurance, review scripts/main.py fully in your environment (it is included) before running, and consider running in a sandbox or with a dedicated API key with limited permissions.

Capability Analysis

Type: OpenClaw Skill Name: story-audio-adapter Version: 1.0.0 The skill bundle is a legitimate tool designed to convert structured story text into multi-role audio using the SenseAudio TTS API. The Python script `scripts/main.py` handles role parsing, heuristic voice matching, and audio concatenation using standard libraries and `ffmpeg`. While the script includes an automatic dependency installer (`pip install requests`) and executes `ffmpeg` via subprocess, these actions are transparently documented and implemented safely without shell injection risks. There is no evidence of data exfiltration, credential theft, or malicious prompt injection in `SKILL.md` or the supporting markdown files.

Capability Assessment

✓ Purpose & Capability

Name/description (multi-role story→audio) align with the actual code and declared requirement (SENSEAUDIO_API_KEY). The script implements voice-library reading, role analysis, per-segment TTS calls, and audio concatenation as advertised.

✓ Instruction Scope

SKILL.md limits scope to reading the editable voice library, role analysis prompt, and the provided story text, then calling SenseAudio TTS and returning MEDIA:./...; the code follows that flow. It does not read unrelated system files or other environment variables beyond the API key/base.

ℹ Install Mechanism

There is no formal install spec (instruction-only + included Python script). The script will auto-install the 'requests' package at runtime using pip if missing (ensure_python_package), which performs a network package install. This is expected for a Python CLI but means the runtime will attempt to modify the local Python environment.

ℹ Credentials

Only SENSEAUDIO_API_KEY is declared as required (primary credential) which is proportional for a cloud TTS service. SKILL.md also suggests exporting SENSEAUDIO_API_BASE, though that variable is not listed in requires.env — it's optional in code (defaults to https://api.senseaudio.cn). No other secrets or unrelated env vars are requested.

✓ Persistence & Privilege

The skill is not 'always: true' and uses normal agent invocation. It does not request system-wide persistent privileges, nor does it modify other skills' configs. It writes outputs beneath the workspace outputs/ directory as expected.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install story-audio-adapter
After installation, invoke the skill by name or use /story-audio-adapter
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release of story-audio-adapter. - Converts structured `[角色]内容` story/novel scripts into multi-role audio using SenseAudio TTS. - Reads an editable voice library for role-to-voice matching; default voices provided. - Full workflow: config check, voice listing, role/voice analysis, segment synthesis, output `MEDIA:./...` for Feishu/OpenClaw. - Strict fallback and error handling for missing API keys, input limits, or empty voice library. - Outputs character analysis, voice mapping, and always returns a media reference for channel consumption.

Metadata

Slug story-audio-adapter

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is 有声读物生成助手?

Use when: 用户希望把带有 `[角色]文本` 标记的小说、剧本、故事台词转成多角色有声作品时触发。适用于旁白、人物对白、角色 ID 已标注清楚的文本内容。Skill 会读取可编辑音色库，分析角色数量与性格特征，匹配最接近的音色，逐段调用 SenseAudio TTS，最后拼接为完整音频并以 `MEDIA... It is an AI Agent Skill for Claude Code / OpenClaw, with 180 downloads so far.

How do I install 有声读物生成助手?

Run "/install story-audio-adapter" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is 有声读物生成助手 free?

Yes, 有声读物生成助手 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does 有声读物生成助手 support?

有声读物生成助手 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created 有声读物生成助手?

It is built and maintained by XinHao-96 (@xinhao-96); the current version is v1.0.0.

More Skills