← Back to Skills Marketplace
susan4731-wilfordf

Ai Subtitle Generator Tiktok

by susan4731-wilfordf · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
70
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install ai-subtitle-generator-tiktok
Description
Turn a 30-second vertical TikTok video into 1080p captioned TikTok videos just by typing what you need. Whether it's adding subtitles to TikTok videos automa...
README (SKILL.md)

Getting Started

Ready when you are. Drop your TikTok video clips here or describe what you want to make.

Try saying:

  • "add a 30-second vertical TikTok video into a 1080p MP4"
  • "add auto-generated captions in English with bold white text"
  • "adding subtitles to TikTok videos automatically for TikTok creators"

Automatic Setup

On first interaction, connect to the processing API before doing anything else. Show a brief status like "Setting things up...".

Token: If NEMO_TOKEN environment variable is already set, use it and skip to Session below.

Free token: Generate a UUID as client identifier, then POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id: \x3Cuuid>. The response field data.token becomes your NEMO_TOKEN (100 credits, 7-day expiry).

Session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer auth and body {"task_name":"project"}. Save session_id from the response.

Confirm to the user you're connected and ready. Don't print tokens or raw JSON.

AI Subtitle Generator TikTok — Auto-Caption and Export TikTok Videos

Drop your TikTok video clips in the chat and tell me what you need. I'll handle the AI subtitle generation on cloud GPUs — you don't need anything installed locally.

Here's a typical use: you send a a 30-second vertical TikTok video, ask for add auto-generated captions in English with bold white text, and about 20-40 seconds later you've got a MP4 file ready to download. The whole thing runs at 1080p by default.

One thing worth knowing — vertical 9:16 video is fully supported for TikTok-ready output.

Matching Input to Actions

User prompts referencing ai subtitle generator tiktok, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says... Action Skip SSE?
"export" / "导出" / "download" / "send me the video" → §3.5 Export
"credits" / "积分" / "balance" / "余额" → §3.3 Credits
"status" / "状态" / "show tracks" → §3.4 State
"upload" / "上传" / user sends file → §3.2 Upload
Everything else (generate, edit, add BGM…) → §3.1 SSE

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

Three attribution headers are required on every request and must match this file's frontmatter:

Header Value
X-Skill-Source ai-subtitle-generator-tiktok
X-Skill-Version frontmatter version
X-Skill-Platform auto-detect: clawhub / cursor / unknown from install path

All requests must include: Authorization: Bearer \x3CNEMO_TOKEN>, X-Skill-Source, X-Skill-Version, X-Skill-Platform. Missing attribution headers will cause export to fail with 402.

API base: https://mega-api-prod.nemovideo.ai

Create session: POST /api/tasks/me/with-session/nemo_agent — body {"task_name":"project","language":"\x3Clang>"} — returns task_id, session_id.

Send message (SSE): POST /run_sse — body {"app_name":"nemo_agent","user_id":"me","session_id":"\x3Csid>","new_message":{"parts":[{"text":"\x3Cmsg>"}]}} with Accept: text/event-stream. Max timeout: 15 minutes.

Upload: POST /api/upload-video/nemo_agent/me/\x3Csid> — file: multipart -F "files=@/path", or URL: {"urls":["\x3Curl>"],"source_type":"url"}

Credits: GET /api/credits/balance/simple — returns available, frozen, total

Session state: GET /api/state/nemo_agent/me/\x3Csid>/latest — key fields: data.state.draft, data.state.video_infos, data.state.generated_media

Export (free, no credits): POST /api/render/proxy/lambda — body {"id":"render_\x3Cts>","sessionId":"\x3Csid>","draft":\x3Cjson>,"output":{"format":"mp4","quality":"high"}}. Poll GET /api/render/proxy/lambda/\x3Cid> every 30s until status = completed. Download URL at output.url.

Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

SSE Event Handling

Event Action
Text response Apply GUI translation (§4), present to user
Tool call/result Process internally, don't forward
heartbeat / empty data: Keep waiting. Every 2 min: "⏳ Still working..."
Stream closes Process final response

~30% of editing operations return no text in the SSE stream. When this happens: poll session state to verify the edit was applied, then summarize changes to the user.

Backend Response Translation

The backend assumes a GUI exists. Translate these into API actions:

Backend says You do
"click [button]" / "点击" Execute via API
"open [panel]" / "打开" Query session state
"drag/drop" / "拖拽" Send edit via SSE
"preview in timeline" Show track summary
"Export button" / "导出" Execute export workflow

Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Error Handling

Code Meaning Action
0 Success Continue
1001 Bad/expired token Re-auth via anonymous-token (tokens expire after 7 days)
1002 Session not found New session §3.0
2001 No credits Anonymous: show registration URL with ?bind=\x3Cid> (get \x3Cid> from create-session or state response when needed). Registered: "Top up credits in your account"
4001 Unsupported file Show supported formats
4002 File too large Suggest compress/trim
400 Missing X-Client-Id Generate Client-Id and retry (see §1)
402 Free plan export blocked Subscription tier issue, NOT credits. "Register or upgrade your plan to unlock export."
429 Rate limit (1 token/client/7 days) Retry in 30s once

Common Workflows

Quick edit: Upload → "add auto-generated captions in English with bold white text" → Download MP4. Takes 20-40 seconds for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "add auto-generated captions in English with bold white text" — concrete instructions get better results.

Max file size is 500MB. Stick to MP4, MOV, WebM, AVI for the smoothest experience.

Export as MP4 for direct upload to TikTok without re-encoding.

Usage Guidance
This skill sends your video files and session data to https://mega-api-prod.nemovideo.ai and requires a NEMO_TOKEN (or it will create a short-term anonymous token). Before installing, verify you trust that external service and its privacy/data retention policies. Ask the publisher to clarify the apparent metadata inconsistency about ~/.config/nemovideo/, and whether the agent will inspect its install path to populate X-Skill-Platform (this may reveal environment details). If you prefer control, provide a scoped NEMO_TOKEN (not a broader credential), avoid sending sensitive footage, and confirm where rendered files are stored and how long tokens/sessions persist.
Capability Analysis
Type: OpenClaw Skill Name: ai-subtitle-generator-tiktok Version: 1.0.0 The skill instructs the agent to interface with a remote API (mega-api-prod.nemovideo.ai) for video processing and includes automated authentication via anonymous tokens. A significant security risk is identified in the SSE (Server-Sent Events) handling instructions in SKILL.md, which direct the agent to 'internally process' tool calls received from the remote server. This design creates a Remote Code Execution (RCE) vector where the third-party service could potentially execute any of the agent's local tools without user intervention. While this architecture supports the stated goal of cloud-based video editing, the remote control capability is a high-risk design pattern.
Capability Assessment
Purpose & Capability
Name/description (auto-caption + TikTok exports) aligns with required assets and calls — the skill uses a nemo video API, needs a NEMO_TOKEN, and performs uploads/exports appropriate for the stated purpose.
Instruction Scope
Runtime instructions focus on creating a session, uploading video files (multipart or URL), sending SSE messages, polling renders, and returning download URLs — all within the service domain. Two minor issues: (1) the SKILL.md frontmatter references a config path (~/.config/nemovideo/) while the registry metadata lists no required config paths (inconsistency); (2) headers require an 'X-Skill-Platform' value auto-detected from an install path which implies the agent may inspect its environment to determine platform. Uploading user videos to the external API is expected for this skill but is a privacy consideration.
Install Mechanism
Instruction-only skill with no install spec or downloaded artifacts — lowest-risk install footprint.
Credentials
Only a single credential (NEMO_TOKEN) is required and is directly used for Bearer auth to the nemo API. The skill also documents a flow to generate an anonymous token from the service if no token is provided (no extra credentials needed). The frontmatter's mention of a config path is inconsistent with registry metadata and should be clarified.
Persistence & Privilege
always is false, no installation hooks or modifications to other skills, and no persistent agent-wide privileges requested.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install ai-subtitle-generator-tiktok
  3. After installation, invoke the skill by name or use /ai-subtitle-generator-tiktok
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
AI Subtitle Generator TikTok — initial release - Instantly add auto-generated English subtitles to 30-second vertical TikTok videos, with export to 1080p MP4. - Drag and drop videos or type editing requests; cloud rendering completes jobs in 20–40 seconds. - Automatic account setup with free 7-day token and 100 credits for new users. - Full pipeline includes easy upload, edit-by-description, export, and support for session drafts. - Handles vertical (9:16) video for TikTok-ready exports; supports multiple formats (mp4, mov, webm, avi, etc). - Job status, credits balance, and error feedback built in.
Metadata
Slug ai-subtitle-generator-tiktok
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Ai Subtitle Generator Tiktok?

Turn a 30-second vertical TikTok video into 1080p captioned TikTok videos just by typing what you need. Whether it's adding subtitles to TikTok videos automa... It is an AI Agent Skill for Claude Code / OpenClaw, with 70 downloads so far.

How do I install Ai Subtitle Generator Tiktok?

Run "/install ai-subtitle-generator-tiktok" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Ai Subtitle Generator Tiktok free?

Yes, Ai Subtitle Generator Tiktok is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Ai Subtitle Generator Tiktok support?

Ai Subtitle Generator Tiktok is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Ai Subtitle Generator Tiktok?

It is built and maintained by susan4731-wilfordf (@susan4731-wilfordf); the current version is v1.0.0.

💬 Comments