← Back to Skills Marketplace
praanmichael

Step Audiobook

by praanmichael · GitHub ↗ · v0.1.4 · MIT-0
cross-platform ✓ Security Clean
131
Downloads
2
Stars
0
Active Installs
5
Versions
Install in OpenClaw
/install step-audiobook
Description
Use when your agent needs to build, maintain, or run the local `audiobook` skill for voice-library management, Step official voice sync, clone voice analysis...
Usage Guidance
This skill appears internally consistent with its stated purpose, but it performs network calls and writes local files — consider these before installing: 1) Use a test or limited STEP_API_KEY; the skill will call Step endpoints and may upload audio/text. 2) Paid cloning requires explicit confirmation (--confirm-paid-action); still double-check before running cloning commands. 3) Run initially in an isolated/sandbox workspace to verify behavior and review generated files under ~/.openclaw/workspace. 4) Inspect scripts if you have extra concerns about what is sent remotely (the repo documents endpoints and has a references/security.md). 5) Install ffmpeg/ffprobe from a trusted package source. 6) If you plan to change the LLM base_url, be aware that long-text reasoning calls will be sent to whatever base_url you configure (and will use STEP_API_KEY by default).
Capability Analysis
Type: OpenClaw Skill Name: step-audiobook Version: 0.1.4 The skill provides a comprehensive workflow for generating audiobooks using StepFun's APIs (LLM, TTS, and voice cloning). It includes scripts for voice library management, script structuring, casting, and audio synthesis. The code uses standard libraries for networking (urllib) and file handling, and utilizes system binaries like ffmpeg for audio processing. All behaviors, including API key resolution and external communication with api.stepfun.com, are well-documented and strictly aligned with the stated purpose of the skill. No evidence of malicious intent, data exfiltration, or unauthorized execution was found.
Capability Assessment
Purpose & Capability
Name/description target audio library, voice analysis, TTS and cloning workflows; required binaries (ffmpeg, ffprobe), required env var STEP_API_KEY, and included scripts (sync, analyze, build_tts_requests, clone helper, run_audiobook) all align with that purpose.
Instruction Scope
SKILL.md explicitly instructs the agent to read/write files under a local workspace (~/.openclaw/workspace), run the included Python scripts, and call Step endpoints (step_plan, step-audio-r1.1, stepaudio-2.5-tts). This behavior is expected for a local workflow tool, but it does mean user data (audio and text) and generated artifacts will be written to disk and (by default) sent to Step endpoints — the README warns about billing and requires explicit confirmation for paid clone actions.
Install Mechanism
Install spec only offers a brew formula to install ffmpeg/ffprobe (well-known package); no arbitrary downloads or extract/install of unknown binaries in the manifest.
Credentials
The skill requires a single credential STEP_API_KEY as primaryEnv and documents that this key is used for Step/LLM calls. No unrelated secrets or broad system credentials are requested. The code and templates reference only workspace paths under ~/.openclaw by default.
Persistence & Privilege
always:false and no indicators that the skill attempts to modify other skills or system-wide settings. It writes artifacts to its own workspace paths, which is normal for a workflow skill.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install step-audiobook
  3. After installation, invoke the skill by name or use /step-audiobook
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.4
Update audiobook workflow content and docs
v0.1.3
Update audiobook workflow content and docs
v0.1.2
Update audiobook workflow content and docs
v0.1.1
Add expanded English documentation and refresh skill docs.
v0.1.0
Initial publish: local audiobook workflow for voice library sync, structured script generation, casting, replayable TTS requests, segment synthesis, and final export.
Metadata
Slug step-audiobook
Version 0.1.4
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 5
Frequently Asked Questions

What is Step Audiobook?

Use when your agent needs to build, maintain, or run the local `audiobook` skill for voice-library management, Step official voice sync, clone voice analysis... It is an AI Agent Skill for Claude Code / OpenClaw, with 131 downloads so far.

How do I install Step Audiobook?

Run "/install step-audiobook" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Step Audiobook free?

Yes, Step Audiobook is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Step Audiobook support?

Step Audiobook is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Step Audiobook?

It is built and maintained by praanmichael (@praanmichael); the current version is v0.1.4.

💬 Comments