← Back to Skills Marketplace
ximasadila

Novita AI Multimodal

by bbear · GitHub ↗ · v0.2.0 · MIT-0
cross-platform ⚠ suspicious
270
Downloads
1
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install novita-multimodal
Description
Execute multimodal tasks using Novita AI: text-to-image, image-to-image, text-to-video, image-to-video, TTS, STT. Use for: generating images, generating vide...
Usage Guidance
This skill appears to be a straightforward Novita AI integration, but the registry metadata omits the fact that it needs an API key. Before installing or using it: (1) do not paste your private API keys into chat messages—prefer setting NOVITA_API_KEY in your environment or creating the ~/.novita/config.json file; (2) verify you trust the skill source (homepage/source unknown) because the skill will send your API key to https://api.novita.ai to generate media; (3) be aware the SKILL.md asks the agent to look for keys inside user messages (tokens starting with 'sk_')—avoid sending keys in free text to prevent accidental leakage; and (4) expect minor ambiguity in variable names ($API_KEY vs NOVITA_API_KEY/~/.novita/config.json) — ensure the agent or your environment maps them correctly. If you need higher assurance, ask the author to update the registry metadata to declare the required credential and clarify the env/config variable names.
Capability Analysis
Type: OpenClaw Skill Name: novita-multimodal Version: 0.2.0 The skill provides a legitimate interface for Novita AI's multimodal services, including text-to-image, video generation, and speech processing. It follows standard configuration patterns by using environment variables or a local config file (~/.novita/config.json) and interacts exclusively with official endpoints (api.novita.ai). The instructions and execution flow are transparent and aligned with the stated purpose of the skill.
Capability Assessment
Purpose & Capability
The skill claims no required env vars/config paths/primary credential in registry metadata, yet the SKILL.md clearly expects an API key (via ~/.novita/config.json, NOVITA_API_KEY, or in-message 'sk_' token). The need for an API key is reasonable for a multimodal API integration, but the metadata omission is an inconsistency that could mislead users about what secrets the skill uses.
Instruction Scope
Runtime instructions explicitly direct the agent to: parse user messages for API keys (tokens starting with 'sk_'), read ~/.novita/config.json, or read NOVITA_API_KEY. Asking the agent to extract API keys from free-form user messages grants broad discretion and increases risk of accidental key exposure. Aside from that, the API endpoints and curl examples stay within the stated multimodal purpose.
Install Mechanism
Instruction-only skill with no install spec and no code files; nothing is written to disk by an installer. This is the lowest-risk install mechanism.
Credentials
The skill legitimately requires a single Novita API key to call external endpoints, which is proportional. However the registry declares no primary credential or required env/config path while the SKILL.md requires ~/.novita/config.json or NOVITA_API_KEY or an in-message key—this metadata mismatch is problematic and reduces transparency. Also the SKILL.md uses $API_KEY in examples while recommending NOVITA_API_KEY in setup, which is ambiguous.
Persistence & Privilege
The skill does not request always: true and does not declare system-wide modifications. It can be invoked autonomously (platform default), which is normal; no elevated persistence or cross-skill config changes are requested.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install novita-multimodal
  3. After installation, invoke the skill by name or use /novita-multimodal
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.2.0
- Major update: Expanded documentation and clarified configuration and execution flow for multimodal Novita AI tasks. - Added detailed step-by-step guides for setup via config file, environment variable, or direct parameter. - Introduced clear API key reading logic and user-facing configuration guidance. - Specified required progress prompts and response templates for each task type (image, video, TTS, STT). - Included sample API requests for all supported endpoints and detailed polling instructions for video generation. - Enhanced error handling documentation and added links to official pricing and key management pages.
Metadata
Slug novita-multimodal
Version 0.2.0
License MIT-0
All-time Installs 1
Active Installs 1
Total Versions 1
Frequently Asked Questions

What is Novita AI Multimodal?

Execute multimodal tasks using Novita AI: text-to-image, image-to-image, text-to-video, image-to-video, TTS, STT. Use for: generating images, generating vide... It is an AI Agent Skill for Claude Code / OpenClaw, with 270 downloads so far.

How do I install Novita AI Multimodal?

Run "/install novita-multimodal" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Novita AI Multimodal free?

Yes, Novita AI Multimodal is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Novita AI Multimodal support?

Novita AI Multimodal is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Novita AI Multimodal?

It is built and maintained by bbear (@ximasadila); the current version is v0.2.0.

💬 Comments