← Back to Skills Marketplace

Audio To Text Caption

Name: Audio To Text Caption
Author: leooooooow

by LeroyCreates · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ Security Clean

283

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install audio-to-text-caption

Description

Turn creator audio into clean text captions for ecommerce content and reuse. Use when teams need fast transcript-to-caption workflows.

README (SKILL.md)

Audio to Text Caption

Skill Card

Category: Image & Media Tools
Core problem: Teams lose time turning creator audio into usable captions and text drafts.
Best for: Short-video, live clip, and repurposing workflows.
Expected input: Audio source, language, output goal, style preference.
Expected output: Clean transcript + caption-ready text + review notes.

Workflow

Transcribe source audio.
Clean filler and formatting noise.
Format text for caption or script reuse.
Flag unclear segments for manual review.

Output format

Transcript summary
Clean transcript
Caption-ready version
Manual review list

Quality and safety rules

Prioritize readability and reuse.
Do not invent unclear speech.
Adapt formatting to subtitle or script needs.

License

This skill is provided under CC BY-NC-SA 4.0 for non-commercial use. You may reuse and adapt it with attribution to Razestar, and share derivatives under the same license.

Commercial use requires a separate paid commercial license from Razestar. No trademark rights are granted.

Usage Guidance

This skill is internally consistent and appears to do what it says: transcribe and clean audio for captions. However, because it is instruction-only it does not specify how transcription will be performed — the actual runtime behavior depends on the agent platform and any transcription tools or APIs available there. Before installing or using with sensitive content, confirm: (1) whether your agent/platform will use a local model or a third-party cloud ASR (and if cloud, where audio will be sent), (2) any privacy or retention policies for uploaded audio, and (3) licensing terms (the SKILL.md states a CC BY-NC-SA license and commercial use requires a paid license from Razestar). If you need to avoid external uploads, test with non-sensitive audio and verify the agent's configured toolchain or ask the platform how transcription is performed.

Capability Analysis

Type: OpenClaw Skill Name: audio-to-text-caption Version: 1.0.0 The skill bundle contains only metadata and instructional markdown files (SKILL.md and output-template.md) for an AI agent to perform audio-to-text transcription and captioning. There is no executable code, no network activity, and no evidence of prompt injection or malicious intent.

Capability Assessment

✓ Purpose & Capability

Name, description, and SKILL.md all describe transcription, cleaning, and caption formatting; there are no unrelated required env vars, binaries, or install steps that would be disproportionate to that purpose.

✓ Instruction Scope

Runtime instructions are limited to transcribing audio, removing fillers, formatting for captions, and flagging unclear segments. The SKILL.md does not direct reading unrelated files, exfiltrating data, or contacting external endpoints. It does not specify the transcription implementation (local vs. external), which is an intentional omission rather than scope creep.

✓ Install Mechanism

No install spec and no code files — instruction-only skill. This is the lowest-risk install surface; nothing is written to disk by the skill itself.

✓ Credentials

The skill declares no required environment variables, credentials, or config paths. There is no apparent need for secrets or unrelated credentials.

✓ Persistence & Privilege

always:false and no install scripts or config writes. The skill does not request persistent system presence or elevated privileges.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install audio-to-text-caption
After installation, invoke the skill by name or use /audio-to-text-caption
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release

Metadata

Slug audio-to-text-caption

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Audio To Text Caption?

Turn creator audio into clean text captions for ecommerce content and reuse. Use when teams need fast transcript-to-caption workflows. It is an AI Agent Skill for Claude Code / OpenClaw, with 283 downloads so far.

How do I install Audio To Text Caption?

Run "/install audio-to-text-caption" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Audio To Text Caption free?

Yes, Audio To Text Caption is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Audio To Text Caption support?

Audio To Text Caption is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Audio To Text Caption?

It is built and maintained by LeroyCreates (@leooooooow); the current version is v1.0.0.

More Skills