← 返回 Skills 市场
peand-rover

Ai Video Subtitle Editor

作者 peandrover adam · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
179
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install ai-video-subtitle-editor
功能描述
Create, edit, and style subtitles for any video with AI — auto-transcribe speech to text, translate subtitles to 50+ languages, style with custom fonts and c...
使用说明 (SKILL.md)

AI Video Subtitle Editor — Transcribe, Style, Translate, and Perfect Every Word on Screen

Subtitles are no longer optional. 85% of Facebook videos are watched without sound. TikTok and Instagram Reels autoplay muted by default. LinkedIn video starts silent in the feed. YouTube data shows that subtitled videos get 7-10% more watch time because viewers who might otherwise scroll past will stop and read. Accessibility regulations increasingly require captions for professional and educational content. Subtitles have evolved from an accessibility accommodation to a primary content consumption mode. The challenge is not whether to add subtitles — it is how to add them well. Auto-generated captions from most platforms are serviceable but ugly: small white text, poor timing, frequent errors, no style. Professional subtitles require: accurate transcription, precise word-level timing, readable fonts and colors, strategic positioning that does not block important visual elements, and stylistic choices that match the content's brand and energy. NemoVideo handles the entire subtitle workflow. AI transcription with 98%+ accuracy, word-level timing synchronization, 50+ language translation, and a library of subtitle styles from minimal cinema to animated TikTok. Edit any word, adjust any timing, change any style — then export with subtitles rendered cleanly into the video.

Use Cases

  1. Auto-Transcribe and Style — Complete Subtitle Workflow (any length) — A 15-minute YouTube video needs professional subtitles. NemoVideo: transcribes the entire audio track with 98%+ accuracy, aligns each word to its exact spoken moment (word-level sync, not sentence-level), applies the creator's chosen style (font, size, color, background, position, animation), handles multiple speakers (different colors per speaker), and renders subtitles directly into the video. From raw video to professionally subtitled content in one step.

  2. TikTok Animated Captions — Viral Subtitle Style (15-60s) — Short-form content needs the animated caption style that dominates TikTok: large bold text, word-by-word highlight animation (each word pops as it is spoken), bright colors with dark outlines, centered on screen. NemoVideo: applies the exact TikTok caption aesthetic — word-by-word animation synced to speech, bold sans-serif font, customizable highlight color (yellow, green, pink), positioned in the center-upper third of the frame. The subtitle style that is proven to increase watch time on short-form platforms.

  3. Multi-Language Translation — One Video, Global Audience (any length) — A course creator's English video needs subtitles in Spanish, Portuguese, Japanese, Korean, and Arabic. NemoVideo: transcribes the English audio, translates to all 5 languages with context-aware AI (not word-by-word dictionary translation), adjusts subtitle timing for each language (some languages use more/fewer words for the same meaning), handles right-to-left text for Arabic (proper RTL rendering), and exports 5 subtitle versions. One production, five markets.

  4. Subtitle Editing — Fix and Refine Existing Captions (any length) — An auto-generated transcript has errors: proper nouns misspelled, technical terms wrong, timing slightly off. NemoVideo: imports the existing subtitle file, highlights low-confidence words (likely errors), provides an editing interface for text and timing corrections, and re-renders with the corrected subtitles. Fixing a 90%-accurate auto-transcript to 100% instead of transcribing from scratch.

  5. Karaoke Style — Lyrics Display for Music Content (any length) — A music video or lyric video needs karaoke-style subtitles: each word or syllable highlights in sequence as the music plays, creating a follow-along reading experience. NemoVideo: aligns lyrics to the audio with syllable-level timing (not just word-level), applies karaoke highlight animation (color change sweeps across each word as it is sung), styles with the music's aesthetic (genre-appropriate fonts, colors matching the album art), and renders the sing-along visual. Music content that invites audience participation.

How It Works

Step 1 — Upload Video

Any video with speech, music, or both. NemoVideo auto-detects language and speaker count.

Step 2 — Choose Subtitle Style and Language

Select: style preset (minimal, TikTok animated, Netflix, karaoke, custom), languages, positioning, and font options.

Step 3 — Generate

curl -X POST https://mega-api-prod.nemovideo.ai/api/v1/generate \
  -H "Authorization: Bearer $NEMO_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "skill": "ai-video-subtitle-editor",
    "prompt": "Add animated TikTok-style captions to a 45-second motivational speaking clip. Word-by-word highlight animation — each word pops in bright yellow as it is spoken. Bold sans-serif font, black outline, centered in upper-third of frame. Also generate Spanish and Portuguese subtitle versions with the same style. Export all three at 9:16 for Reels and TikTok.",
    "transcribe": true,
    "style": "tiktok-animated",
    "highlight_color": "#FFD700",
    "font": "bold-sans-serif",
    "outline": "black",
    "position": "upper-third-center",
    "languages": ["en", "es", "pt"],
    "format": "9:16"
  }'

Step 4 — Review and Edit

Preview subtitles over video. Edit any word, adjust any timing, change any style parameter. Re-render with changes.

Parameters

Parameter Type Required Description
prompt string Subtitle requirements
style string "minimal", "tiktok-animated", "netflix", "karaoke", "custom"
font string Font family or preset
highlight_color string Color for word-by-word animation
outline string Outline color and width
position string "bottom", "upper-third", "center", "custom"
languages array ["en", "es", "ja", "ko", "ar", ...]
transcribe boolean Auto-transcribe audio
import_srt string URL to existing subtitle file
speakers object {differentiate: true, colors: {"Speaker 1": "#fff"}}
format string "16:9", "9:16", "1:1"

Output Example

{
  "job_id": "avsub-20260328-001",
  "status": "completed",
  "transcription": {
    "language": "en",
    "confidence": 0.98,
    "words": 312,
    "speakers_detected": 1
  },
  "translations": ["es", "pt"],
  "style": "tiktok-animated",
  "outputs": {
    "en": {"file": "video-subtitled-en-9x16.mp4"},
    "es": {"file": "video-subtitled-es-9x16.mp4"},
    "pt": {"file": "video-subtitled-pt-9x16.mp4"}
  }
}

Tips

  1. Word-level sync is what separates professional from amateur subtitles — Sentence-level timing (entire sentence appears at once) feels disconnected from speech. Word-level timing (each word appears as spoken) creates a reading experience that matches the audio precisely. Always use word-level sync.
  2. TikTok animated style increases short-form watch time measurably — The large, bold, animated captions are not just aesthetic — they hold attention by giving the viewer an active reading task synchronized with audio. Dual-channel engagement (reading + listening) reduces scroll-away.
  3. Translation timing must adjust, not just translate — German uses 30% more words than English for the same meaning. Japanese uses fewer characters. Subtitle timing must expand or contract for each language, not just swap text at the same timestamps.
  4. Position subtitles in platform safe zones — TikTok's bottom 15% is covered by UI. YouTube's bottom area has progress bar and controls. Subtitles at the very bottom of the frame are often partially or fully hidden on the platforms where they matter most.
  5. Speaker differentiation prevents confusion in multi-person content — Color-coding speakers (Speaker A: white, Speaker B: yellow) instantly communicates who is speaking without needing "Speaker A:" labels that waste screen space and reading time.

Output Formats

Format Resolution Use Case
MP4 16:9 1080p / 4K YouTube / website
MP4 9:16 1080x1920 TikTok / Reels / Shorts
MP4 1:1 1080x1080 Instagram / LinkedIn

Related Skills

安全使用建议
This skill appears coherent for adding subtitles via an external NemoVideo API and only needs a single service token (NEMO_TOKEN). Before installing/use: (1) Confirm you trust nemovideo.ai (unknown homepage/source here) — the skill will upload video/audio to that external endpoint. (2) Only provide a token scoped for this service; avoid reusing high-privilege or long-lived tokens. (3) Check privacy/billing: uploaded content may be stored or billed by the service. (4) Note the metadata mismatch: SKILL.md references a config directory (~/.config/nemovideo/) — verify whether the platform will read local config files. (5) If you will upload sensitive videos, verify the vendor's data retention and security policies and consider using temporary/limited-scope tokens or rotating the token after use.
功能分析
Type: OpenClaw Skill Name: ai-video-subtitle-editor Version: 1.0.0 The skill bundle consists of metadata and documentation (SKILL.md) for an AI-powered video subtitle editor. It defines an interface for an external API (mega-api-prod.nemovideo.ai) and specifies the use of an environment variable (NEMO_TOKEN) and a configuration directory (~/.config/nemovideo/). The instructions are consistent with the stated purpose of transcribing and styling video subtitles, with no evidence of malicious code, data exfiltration, or prompt injection attacks.
能力评估
Purpose & Capability
The skill claims to transcribe, style, translate, and render subtitles and its runtime example calls an external NemoVideo API endpoint and references a bearer token (NEMO_TOKEN). Requiring a service token for an external subtitle/transcription API is appropriate for the stated purpose.
Instruction Scope
The SKILL.md instructs the agent to upload videos and call the mega-api-prod.nemovideo.ai API (via curl with Authorization: Bearer $NEMO_TOKEN). Uploading/local file access is expected for a video-subtitling skill, but it means the agent will transmit potentially sensitive video/audio to an external service. The instructions do not request unrelated system files or other environment variables.
Install Mechanism
No install spec or code files are included (instruction-only). This lowers risk because nothing is written to disk or executed locally by an install step.
Credentials
The primary credential is NEMO_TOKEN which is proportional to calling an external NemoVideo API. However, metadata in SKILL.md includes a configPaths entry (~/.config/nemovideo/) while the registry metadata indicated no required config paths; this metadata mismatch should be clarified. The skill does not request unrelated secrets.
Persistence & Privilege
always is false and there is no install step or requests to modify other skills or system-wide configs. The skill can be invoked autonomously by the agent (platform default), which is expected behavior for a service-integration skill.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install ai-video-subtitle-editor
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /ai-video-subtitle-editor 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
- Initial release of AI Video Subtitle Editor (v1.0.0). - Auto-transcribe speech to text with word-level timing accuracy. - Translate subtitles to 50+ languages, supporting right-to-left scripts. - Style subtitles with custom fonts, colors, positioning, and choose from multiple presets (TikTok animated, Netflix, YouTube, karaoke). - Edit text, adjust timing, and review subtitles before exporting. - Export videos with subtitles rendered directly in the video, supporting various aspect ratios.
元数据
Slug ai-video-subtitle-editor
版本 1.0.0
许可证 MIT-0
累计安装 1
当前安装数 1
历史版本数 1
常见问题

Ai Video Subtitle Editor 是什么?

Create, edit, and style subtitles for any video with AI — auto-transcribe speech to text, translate subtitles to 50+ languages, style with custom fonts and c... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 179 次。

如何安装 Ai Video Subtitle Editor?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ai-video-subtitle-editor」即可一键安装,无需额外配置。

Ai Video Subtitle Editor 是免费的吗?

是的,Ai Video Subtitle Editor 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Ai Video Subtitle Editor 支持哪些平台?

Ai Video Subtitle Editor 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Ai Video Subtitle Editor?

由 peandrover adam(@peand-rover)开发并维护,当前版本 v1.0.0。

💬 留言讨论