← Back to Skills Marketplace
linmillsd7

Caption Generator Hindi

by linmillsd7 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
61
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install caption-generator-hindi
Description
Get Hindi captioned videos ready to post, without touching a single slider. Upload your video files (MP4, MOV, AVI, WebM, up to 500MB), say something like "a...
README (SKILL.md)

Getting Started

Ready when you are. Drop your video files here or describe what you want to make.

Try saying:

  • "add a 3-minute Hindi YouTube video into a 1080p MP4"
  • "add captions in Hindi to my video"
  • "adding Hindi subtitles to videos for Hindi content creators"

Getting Connected

Before handling any user request, establish a connection to the backend API. Show a brief status like "Connecting...".

If NEMO_TOKEN is in the environment, use it directly and create a session. Otherwise, acquire a free starter token:

  • Generate a UUID as client identifier
  • POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with the X-Client-Id header
  • The response includes a token with 100 free credits valid for 7 days — use it as NEMO_TOKEN

Then create a session by POSTing to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer authorization and body {"task_name":"project","language":"en"}. The session_id in the response is needed for all following requests.

Tell the user you're ready. Keep the technical details out of the chat.

Caption Generator Hindi — Generate Hindi Captions for Videos

Drop your video files in the chat and tell me what you need. I'll handle the Hindi caption generation on cloud GPUs — you don't need anything installed locally.

Here's a typical use: you send a a 3-minute Hindi YouTube video, ask for add captions in Hindi to my video, and about 30-60 seconds later you've got a MP4 file ready to download. The whole thing runs at 1080p by default.

One thing worth knowing — shorter clips under 5 minutes generate captions faster and more accurately.

Matching Input to Actions

User prompts referencing caption generator hindi, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says... Action Skip SSE?
"export" / "导出" / "download" / "send me the video" → §3.5 Export
"credits" / "积分" / "balance" / "余额" → §3.3 Credits
"status" / "状态" / "show tracks" → §3.4 State
"upload" / "上传" / user sends file → §3.2 Upload
Everything else (generate, edit, add BGM…) → §3.1 SSE

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

Three attribution headers are required on every request and must match this file's frontmatter:

Header Value
X-Skill-Source caption-generator-hindi
X-Skill-Version frontmatter version
X-Skill-Platform auto-detect: clawhub / cursor / unknown from install path

Every API call needs Authorization: Bearer \x3CNEMO_TOKEN> plus the three attribution headers above. If any header is missing, exports return 402.

API base: https://mega-api-prod.nemovideo.ai

Create session: POST /api/tasks/me/with-session/nemo_agent — body {"task_name":"project","language":"\x3Clang>"} — returns task_id, session_id.

Send message (SSE): POST /run_sse — body {"app_name":"nemo_agent","user_id":"me","session_id":"\x3Csid>","new_message":{"parts":[{"text":"\x3Cmsg>"}]}} with Accept: text/event-stream. Max timeout: 15 minutes.

Upload: POST /api/upload-video/nemo_agent/me/\x3Csid> — file: multipart -F "files=@/path", or URL: {"urls":["\x3Curl>"],"source_type":"url"}

Credits: GET /api/credits/balance/simple — returns available, frozen, total

Session state: GET /api/state/nemo_agent/me/\x3Csid>/latest — key fields: data.state.draft, data.state.video_infos, data.state.generated_media

Export (free, no credits): POST /api/render/proxy/lambda — body {"id":"render_\x3Cts>","sessionId":"\x3Csid>","draft":\x3Cjson>,"output":{"format":"mp4","quality":"high"}}. Poll GET /api/render/proxy/lambda/\x3Cid> every 30s until status = completed. Download URL at output.url.

Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

Reading the SSE Stream

Text events go straight to the user (after GUI translation). Tool calls stay internal. Heartbeats and empty data: lines mean the backend is still working — show "⏳ Still working..." every 2 minutes.

About 30% of edit operations close the stream without any text. When that happens, poll /api/state to confirm the timeline changed, then tell the user what was updated.

Backend Response Translation

The backend assumes a GUI exists. Translate these into API actions:

Backend says You do
"click [button]" / "点击" Execute via API
"open [panel]" / "打开" Query session state
"drag/drop" / "拖拽" Send edit via SSE
"preview in timeline" Show track summary
"Export button" / "导出" Execute export workflow

Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Error Codes

  • 0 — success, continue normally
  • 1001 — token expired or invalid; re-acquire via /api/auth/anonymous-token
  • 1002 — session not found; create a new one
  • 2001 — out of credits; anonymous users get a registration link with ?bind=\x3Cid>, registered users top up
  • 4001 — unsupported file type; show accepted formats
  • 4002 — file too large; suggest compressing or trimming
  • 400 — missing X-Client-Id; generate one and retry
  • 402 — free plan export blocked; not a credit issue, subscription tier
  • 429 — rate limited; wait 30s and retry once

Common Workflows

Quick edit: Upload → "add captions in Hindi to my video" → Download MP4. Takes 30-60 seconds for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "add captions in Hindi to my video" — concrete instructions get better results.

Max file size is 500MB. Stick to MP4, MOV, AVI, WebM for the smoothest experience.

Export as MP4 for widest compatibility across social platforms.

Usage Guidance
This skill appears to do what it says (upload your video to nemovideo.ai, generate Hindi captions, return a downloadable MP4), but check a few things before installing: 1) Privacy: videos are uploaded to a third-party API (mega-api-prod.nemovideo.ai). Don't upload confidential content unless you trust that service. 2) Token usage: the skill will use any NEMO_TOKEN in the agent environment; if none exists it will request an anonymous token from the service. Consider using a scoped/ephemeral token rather than a global secret. 3) Config-path mismatch: the SKILL.md frontmatter mentions ~/.config/nemovideo/ (possible local config access) even though the registry metadata did not — ask the publisher why and what is read/written there. 4) Attribution headers and platform auto-detection require the agent to inspect install paths; confirm you’re comfortable with that. 5) Verify the API domain and the publisher independently if you need higher assurance. If these points are acceptable and you expect uploads to a cloud service, the skill’s behavior is plausible; if not, treat it as risky and avoid installing.
Capability Analysis
Type: OpenClaw Skill Name: caption-generator-hindi Version: 1.0.0 The skill is designed to generate Hindi captions for videos by interfacing with the nemovideo.ai API. It follows a standard workflow for cloud-based video processing, including session management, file uploads, and polling for render status. While it requires an API token and communicates with an external domain (mega-api-prod.nemovideo.ai), these actions are consistent with its stated purpose, and there is no evidence of data exfiltration, malicious code execution, or harmful instructions to the agent.
Capability Assessment
Purpose & Capability
Name/description (Hindi captioning) aligns with the runtime instructions (upload video, create session, render/export). The single required env var NEMO_TOKEN is appropriate for a third-party API. However, the SKILL.md frontmatter declares a config path (~/.config/nemovideo/) not listed in the registry's top-level 'Required config paths', a mismatch worth questioning.
Instruction Scope
Instructions stay within the service's domain (mega-api-prod.nemovideo.ai) and describe session creation, SSE, upload, export, and polling. This requires uploading user video files to the external API and streaming SSE events — expected for a cloud render workflow but privacy-sensitive. The skill also tells the agent to auto-acquire an anonymous token if NEMO_TOKEN is absent and to 'auto-detect' a platform value from the install path, which implies the agent may read environment/paths beyond just the declared token.
Install Mechanism
Instruction-only skill with no install steps or code to download — lowest install risk. Nothing is written to disk by an installer because no install spec is provided.
Credentials
Only NEMO_TOKEN is declared as required (reasonable). But SKILL.md references a config directory (~/.config/nemovideo/) and asks to use NEMO_TOKEN if present; if not present it will obtain an anonymous token from the API. The extra config path in the frontmatter wasn't recorded in the registry metadata, creating an unexplained discrepancy. Requiring a single token is proportional, but the skill will access network endpoints and potentially local paths for 'auto-detect' behavior.
Persistence & Privilege
always is false and the skill is user-invocable; it does not request permanent 'always' presence or system-wide configuration changes. Autonomous invocation is allowed (default) but not combined with other high-risk flags here.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install caption-generator-hindi
  3. After installation, invoke the skill by name or use /caption-generator-hindi
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Caption Generator Hindi — initial release - Generate accurate Hindi captions for uploaded video files (MP4, MOV, AVI, WebM, up to 500MB) - Automatic backend session setup using either an environment token or a free starter token - Fast 1080p MP4 export with cloud-based captioning, typically within 30–60 seconds - Supports credit management, session state, and export/download workflows - User-friendly: just upload, describe your goal, and receive ready-to-post videos in Hindi
Metadata
Slug caption-generator-hindi
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Caption Generator Hindi?

Get Hindi captioned videos ready to post, without touching a single slider. Upload your video files (MP4, MOV, AVI, WebM, up to 500MB), say something like "a... It is an AI Agent Skill for Claude Code / OpenClaw, with 61 downloads so far.

How do I install Caption Generator Hindi?

Run "/install caption-generator-hindi" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Caption Generator Hindi free?

Yes, Caption Generator Hindi is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Caption Generator Hindi support?

Caption Generator Hindi is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Caption Generator Hindi?

It is built and maintained by linmillsd7 (@linmillsd7); the current version is v1.0.0.

💬 Comments