← Back to Skills Marketplace
leooooooow

Audio To Text Caption

by LeroyCreates · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
283
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install audio-to-text-caption
Description
Turn creator audio into clean text captions for ecommerce content and reuse. Use when teams need fast transcript-to-caption workflows.
README (SKILL.md)

Audio to Text Caption

Skill Card

  • Category: Image & Media Tools
  • Core problem: Teams lose time turning creator audio into usable captions and text drafts.
  • Best for: Short-video, live clip, and repurposing workflows.
  • Expected input: Audio source, language, output goal, style preference.
  • Expected output: Clean transcript + caption-ready text + review notes.

Workflow

  1. Transcribe source audio.
  2. Clean filler and formatting noise.
  3. Format text for caption or script reuse.
  4. Flag unclear segments for manual review.

Output format

  1. Transcript summary
  2. Clean transcript
  3. Caption-ready version
  4. Manual review list

Quality and safety rules

  • Prioritize readability and reuse.
  • Do not invent unclear speech.
  • Adapt formatting to subtitle or script needs.

License

Copyright (c) 2026 Razestar.

This skill is provided under CC BY-NC-SA 4.0 for non-commercial use. You may reuse and adapt it with attribution to Razestar, and share derivatives under the same license.

Commercial use requires a separate paid commercial license from Razestar. No trademark rights are granted.

Usage Guidance
This skill is internally consistent and appears to do what it says: transcribe and clean audio for captions. However, because it is instruction-only it does not specify how transcription will be performed — the actual runtime behavior depends on the agent platform and any transcription tools or APIs available there. Before installing or using with sensitive content, confirm: (1) whether your agent/platform will use a local model or a third-party cloud ASR (and if cloud, where audio will be sent), (2) any privacy or retention policies for uploaded audio, and (3) licensing terms (the SKILL.md states a CC BY-NC-SA license and commercial use requires a paid license from Razestar). If you need to avoid external uploads, test with non-sensitive audio and verify the agent's configured toolchain or ask the platform how transcription is performed.
Capability Analysis
Type: OpenClaw Skill Name: audio-to-text-caption Version: 1.0.0 The skill bundle contains only metadata and instructional markdown files (SKILL.md and output-template.md) for an AI agent to perform audio-to-text transcription and captioning. There is no executable code, no network activity, and no evidence of prompt injection or malicious intent.
Capability Assessment
Purpose & Capability
Name, description, and SKILL.md all describe transcription, cleaning, and caption formatting; there are no unrelated required env vars, binaries, or install steps that would be disproportionate to that purpose.
Instruction Scope
Runtime instructions are limited to transcribing audio, removing fillers, formatting for captions, and flagging unclear segments. The SKILL.md does not direct reading unrelated files, exfiltrating data, or contacting external endpoints. It does not specify the transcription implementation (local vs. external), which is an intentional omission rather than scope creep.
Install Mechanism
No install spec and no code files — instruction-only skill. This is the lowest-risk install surface; nothing is written to disk by the skill itself.
Credentials
The skill declares no required environment variables, credentials, or config paths. There is no apparent need for secrets or unrelated credentials.
Persistence & Privilege
always:false and no install scripts or config writes. The skill does not request persistent system presence or elevated privileges.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install audio-to-text-caption
  3. After installation, invoke the skill by name or use /audio-to-text-caption
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release
Metadata
Slug audio-to-text-caption
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Audio To Text Caption?

Turn creator audio into clean text captions for ecommerce content and reuse. Use when teams need fast transcript-to-caption workflows. It is an AI Agent Skill for Claude Code / OpenClaw, with 283 downloads so far.

How do I install Audio To Text Caption?

Run "/install audio-to-text-caption" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Audio To Text Caption free?

Yes, Audio To Text Caption is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Audio To Text Caption support?

Audio To Text Caption is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Audio To Text Caption?

It is built and maintained by LeroyCreates (@leooooooow); the current version is v1.0.0.

💬 Comments