← Back to Skills Marketplace
weiqingtangx

Transcribe audio/video to LRC, SRT, WebVTT, ASS, TTML and create karaoke videos

by weiqingtangx · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ✓ Security Clean
66
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install karadeo
Description
Transcribe audio or video files to time-synced lyrics or subtitle formats like LRC, SRT, WebVTT, ASS, and TTML, and create karaoke videos.
README (SKILL.md)

Karadeo — Agent & AI Integration Guide

Karadeo is a karaoke and lyrics tooling platform. This document describes all available APIs, tools, and agent integration surfaces.

Authentication

All API endpoints require a Bearer API key from https://karadeo.com/dashboard.

Authorization: Bearer kd_\x3Cyour-api-key>

MCP Server

Karadeo exposes a full MCP (Model Context Protocol) server for AI agent tool use.

Endpoint: POST https://karadeo.com/api/mcp
Transport: Streamable HTTP (JSON-RPC 2.0)
Discovery: https://karadeo.com/.well-known/mcp/server-card.json

Initialize

{"jsonrpc":"2.0","id":1,"method":"initialize","params":{}}

List Tools

{"jsonrpc":"2.0","id":2,"method":"tools/list","params":{}}

Call Tool

{
  "jsonrpc": "2.0",
  "id": 3,
  "method": "tools/call",
  "params": {
    "name": "transcribe",
    "arguments": {
      "audio_url": "https://example.com/song.mp3",
      "format": "lrc"
    }
  }
}

REST API

Transcribe

Convert audio or video to time-synced lyrics or subtitle files.

Endpoint: POST https://karadeo.com/api/transcribe
Docs: https://karadeo.com/resources/karadeo-lyrics-api
OpenAPI: https://karadeo.com/api/doc

Request body (JSON):

Field Type Required Description
fileUrl string yes Publicly accessible audio/video URL
format string no lrc, srt, ass, webvtt, ttml, txt (default: lrc)
transcriptText string no Known lyrics to align instead of auto-transcribing
isWordLevel boolean no Word-level timing (default: false)

Example:

curl -X POST https://karadeo.com/api/transcribe \
  -H "Authorization: Bearer kd_your_key" \
  -H "Content-Type: application/json" \
  -d '{"fileUrl":"https://example.com/song.mp3","format":"lrc"}'

Response: Plain text subtitle file with appropriate Content-Type header.


WebMCP (Browser)

Karadeo registers browser tools via navigator.modelContext.registerTool() on every page load.

Available tools:

  • list-karaoke-tools — list all Karadeo tools with URLs
  • navigate-to-tool — navigate to a specific tool page
  • list-karaoke-templates — list available karaoke video templates

Discovery Endpoints

Endpoint Description
/.well-known/agent-skills/index.json Agent Skills Discovery index (RFC v0.2.0)
/.well-known/mcp/server-card.json MCP Server Card (SEP-1649)
/.well-known/api-catalog API Catalog (RFC 9727)
/.well-known/oauth-authorization-server OAuth discovery metadata (RFC 8414)

Content Policy

Content-Signal: ai-train=no, search=yes, ai-input=no

AI agents may read this site for tool use. Training on site content is not permitted.

Usage Guidance
This skill appears coherent and purpose-aligned. Before installing or using it, confirm you trust karadeo.com, protect your API key, and avoid submitting private audio/video URLs unless you are comfortable sharing them with the service.
Capability Analysis
Type: OpenClaw Skill Name: karadeo Version: 1.0.1 The skill bundle provides documentation and API specifications for Karadeo, a karaoke and lyrics transcription service. It defines standard Model Context Protocol (MCP) and REST API endpoints hosted on karadeo.com. There are no signs of malicious code, data exfiltration, or prompt injection attempts; the content is strictly informational for agent integration.
Capability Tags
requires-oauth-tokenrequires-sensitive-credentials
Capability Assessment
Purpose & Capability
The documented REST API and MCP example are aligned with transcribing audio/video into subtitle or lyric formats. No unrelated local access or destructive behavior is shown.
Instruction Scope
The skill documents remote REST, MCP, and browser WebMCP tool surfaces. These are purpose-aligned, but users should understand that an agent may interact with Karadeo-hosted tools.
Install Mechanism
There is no install spec, no code files, no required binaries, and no static scan findings.
Credentials
The workflow depends on sending publicly accessible audio/video URLs, and optionally transcript text, to Karadeo. This is expected for transcription but matters for private media.
Persistence & Privilege
Karadeo API use requires a bearer API key. No local persistence, background process, or privileged system access is described.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install karadeo
  3. After installation, invoke the skill by name or use /karadeo
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
- No user-visible changes; documentation and skill functionality remain unchanged in this version.
v1.0.0
Karadeo-transcribe 1.0.0 — Initial release - Introduces detailed integration guide for agents and AI, including authentication instructions. - Documents full MCP (Model Context Protocol) server endpoints and usage examples. - Provides REST API reference for audio/video transcription to time-synced lyrics or subtitles in multiple formats. - Outlines browser tool registration and capabilities via WebMCP. - Lists discovery endpoints for skill, API, and OAuth metadata. - States content policy restricting AI training and input usage.
Metadata
Slug karadeo
Version 1.0.1
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 2
Frequently Asked Questions

What is Transcribe audio/video to LRC, SRT, WebVTT, ASS, TTML and create karaoke videos?

Transcribe audio or video files to time-synced lyrics or subtitle formats like LRC, SRT, WebVTT, ASS, and TTML, and create karaoke videos. It is an AI Agent Skill for Claude Code / OpenClaw, with 66 downloads so far.

How do I install Transcribe audio/video to LRC, SRT, WebVTT, ASS, TTML and create karaoke videos?

Run "/install karadeo" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Transcribe audio/video to LRC, SRT, WebVTT, ASS, TTML and create karaoke videos free?

Yes, Transcribe audio/video to LRC, SRT, WebVTT, ASS, TTML and create karaoke videos is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Transcribe audio/video to LRC, SRT, WebVTT, ASS, TTML and create karaoke videos support?

Transcribe audio/video to LRC, SRT, WebVTT, ASS, TTML and create karaoke videos is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Transcribe audio/video to LRC, SRT, WebVTT, ASS, TTML and create karaoke videos?

It is built and maintained by weiqingtangx (@weiqingtangx); the current version is v1.0.1.

💬 Comments