← Back to Skills Marketplace

Text To Video Local Model

Name: Text To Video Local Model
Author: whitejohnk-26

by whitejohnk-26 · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install text-to-video-local-model

Description

generate text prompts into AI-generated videos with this skill. Works with TXT, JSON, CSV, MD files up to 500MB. developers and AI enthusiasts use it for gen...

Usage Guidance

This skill is labeled as a 'local model' but actually runs your prompts and uploaded files through a cloud service (mega-api-prod.nemovideo.ai). Before installing or using it, consider: (1) Do you accept sending potentially sensitive text and files to an external service? (2) The skill will use or automatically obtain a NEMO_TOKEN and include it in every request — tokens grant access to your account/credits. (3) The skill has no listed source/homepage and the registry metadata is inconsistent with the frontmatter; that reduces provenance. If you still want to try it, test only with non-sensitive sample data, verify the remote domain is legitimate, and prefer a skill with a published source, privacy policy, and clear owner. If you expected strictly local-only processing, do not install or use this skill.

Capability Analysis

Type: OpenClaw Skill Name: text-to-video-local-model Version: 1.0.0 The skill exhibits deceptive behavior by branding itself as a 'Local Model' in its name and display name, while the internal instructions (SKILL.md) explicitly mandate a cloud-based processing pipeline via 'mega-api-prod.nemovideo.ai'. It instructs the agent to hide tokens and raw API outputs from the user and performs environment fingerprinting to detect the host platform (e.g., Cursor vs. OpenClaw). While it does not explicitly exfiltrate system secrets like SSH keys, the contradiction between its 'local' privacy claim and its cloud-reliant implementation is a significant indicator of deceptive intent.

Capability Assessment

⚠ Purpose & Capability

The name and description advertise a "local model" and local generation, but the SKILL.md instructs the agent to use a cloud rendering pipeline at https://mega-api-prod.nemovideo.ai for session creation, SSE, uploads and exports. That is a substantive mismatch between claimed purpose and actual behavior. The frontmatter also mentions a config path (~/.config/nemovideo/) that is not declared in the registry metadata, an internal inconsistency.

⚠ Instruction Scope

Runtime instructions direct the agent to POST files and prompts to external endpoints, stream SSE, upload multipart files, poll render status and manage session tokens. It also instructs reading the skill's YAML frontmatter and detecting install path to populate attribution headers. These behaviors go beyond 'local' processing and involve sending potentially large user files to a third-party service.

✓ Install Mechanism

There is no install specification and no code files — the skill is instruction-only, which minimizes on-disk install risk. No external downloads or package installs are requested.

ℹ Credentials

The only declared required credential is NEMO_TOKEN, which aligns with the described API usage. However, the skill will auto-acquire an anonymous token by POSTing to the service when NEMO_TOKEN is absent; this behavior is functionally reasonable but worth flagging because it means the skill will create and store/use tokens on the user's behalf and send them with every request.

✓ Persistence & Privilege

The skill is not always-enabled and does not request elevated platform privileges. It keeps session_id state for operations but does not declare any actions that modify other skills or global agent configuration.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install text-to-video-local-model
After installation, invoke the skill by name or use /text-to-video-local-model
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release of text-to-video-local-model. - Generate short AI-driven videos (up to 1080p MP4) from text prompts using a local model via a cloud backend. - Supports TXT, JSON, CSV, and MD files up to 500MB. - Handles uploads, prompt-based edits, exports, state checks, and credit balance through clear user commands. - Automated setup, including anonymous token generation and session management. - Cloud GPU processing delivers videos in 1–3 minutes per prompt. - Comprehensive error handling and workflow tips for a smooth editing experience.

Metadata

Slug text-to-video-local-model

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Text To Video Local Model?

generate text prompts into AI-generated videos with this skill. Works with TXT, JSON, CSV, MD files up to 500MB. developers and AI enthusiasts use it for gen... It is an AI Agent Skill for Claude Code / OpenClaw, with 67 downloads so far.

How do I install Text To Video Local Model?

Run "/install text-to-video-local-model" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Text To Video Local Model free?

Yes, Text To Video Local Model is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Text To Video Local Model support?

Text To Video Local Model is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Text To Video Local Model?

It is built and maintained by whitejohnk-26 (@whitejohnk-26); the current version is v1.0.0.

More Skills