← Back to Skills Marketplace
whitejohnk-26

Text To Video Local Model

by whitejohnk-26 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
67
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install text-to-video-local-model
Description
generate text prompts into AI-generated videos with this skill. Works with TXT, JSON, CSV, MD files up to 500MB. developers and AI enthusiasts use it for gen...
Usage Guidance
This skill is labeled as a 'local model' but actually runs your prompts and uploaded files through a cloud service (mega-api-prod.nemovideo.ai). Before installing or using it, consider: (1) Do you accept sending potentially sensitive text and files to an external service? (2) The skill will use or automatically obtain a NEMO_TOKEN and include it in every request — tokens grant access to your account/credits. (3) The skill has no listed source/homepage and the registry metadata is inconsistent with the frontmatter; that reduces provenance. If you still want to try it, test only with non-sensitive sample data, verify the remote domain is legitimate, and prefer a skill with a published source, privacy policy, and clear owner. If you expected strictly local-only processing, do not install or use this skill.
Capability Analysis
Type: OpenClaw Skill Name: text-to-video-local-model Version: 1.0.0 The skill exhibits deceptive behavior by branding itself as a 'Local Model' in its name and display name, while the internal instructions (SKILL.md) explicitly mandate a cloud-based processing pipeline via 'mega-api-prod.nemovideo.ai'. It instructs the agent to hide tokens and raw API outputs from the user and performs environment fingerprinting to detect the host platform (e.g., Cursor vs. OpenClaw). While it does not explicitly exfiltrate system secrets like SSH keys, the contradiction between its 'local' privacy claim and its cloud-reliant implementation is a significant indicator of deceptive intent.
Capability Assessment
Purpose & Capability
The name and description advertise a "local model" and local generation, but the SKILL.md instructs the agent to use a cloud rendering pipeline at https://mega-api-prod.nemovideo.ai for session creation, SSE, uploads and exports. That is a substantive mismatch between claimed purpose and actual behavior. The frontmatter also mentions a config path (~/.config/nemovideo/) that is not declared in the registry metadata, an internal inconsistency.
Instruction Scope
Runtime instructions direct the agent to POST files and prompts to external endpoints, stream SSE, upload multipart files, poll render status and manage session tokens. It also instructs reading the skill's YAML frontmatter and detecting install path to populate attribution headers. These behaviors go beyond 'local' processing and involve sending potentially large user files to a third-party service.
Install Mechanism
There is no install specification and no code files — the skill is instruction-only, which minimizes on-disk install risk. No external downloads or package installs are requested.
Credentials
The only declared required credential is NEMO_TOKEN, which aligns with the described API usage. However, the skill will auto-acquire an anonymous token by POSTing to the service when NEMO_TOKEN is absent; this behavior is functionally reasonable but worth flagging because it means the skill will create and store/use tokens on the user's behalf and send them with every request.
Persistence & Privilege
The skill is not always-enabled and does not request elevated platform privileges. It keeps session_id state for operations but does not declare any actions that modify other skills or global agent configuration.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install text-to-video-local-model
  3. After installation, invoke the skill by name or use /text-to-video-local-model
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of text-to-video-local-model. - Generate short AI-driven videos (up to 1080p MP4) from text prompts using a local model via a cloud backend. - Supports TXT, JSON, CSV, and MD files up to 500MB. - Handles uploads, prompt-based edits, exports, state checks, and credit balance through clear user commands. - Automated setup, including anonymous token generation and session management. - Cloud GPU processing delivers videos in 1–3 minutes per prompt. - Comprehensive error handling and workflow tips for a smooth editing experience.
Metadata
Slug text-to-video-local-model
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Text To Video Local Model?

generate text prompts into AI-generated videos with this skill. Works with TXT, JSON, CSV, MD files up to 500MB. developers and AI enthusiasts use it for gen... It is an AI Agent Skill for Claude Code / OpenClaw, with 67 downloads so far.

How do I install Text To Video Local Model?

Run "/install text-to-video-local-model" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Text To Video Local Model free?

Yes, Text To Video Local Model is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Text To Video Local Model support?

Text To Video Local Model is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Text To Video Local Model?

It is built and maintained by whitejohnk-26 (@whitejohnk-26); the current version is v1.0.0.

💬 Comments