← Back to Skills Marketplace

Novita AI Multimodal

Name: Novita AI Multimodal
Author: ximasadila

by bbear · GitHub ↗ · v0.2.0 · MIT-0

cross-platform ⚠ suspicious

270

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install novita-multimodal

Description

Execute multimodal tasks using Novita AI: text-to-image, image-to-image, text-to-video, image-to-video, TTS, STT. Use for: generating images, generating vide...

Usage Guidance

This skill appears to be a straightforward Novita AI integration, but the registry metadata omits the fact that it needs an API key. Before installing or using it: (1) do not paste your private API keys into chat messages—prefer setting NOVITA_API_KEY in your environment or creating the ~/.novita/config.json file; (2) verify you trust the skill source (homepage/source unknown) because the skill will send your API key to https://api.novita.ai to generate media; (3) be aware the SKILL.md asks the agent to look for keys inside user messages (tokens starting with 'sk_')—avoid sending keys in free text to prevent accidental leakage; and (4) expect minor ambiguity in variable names ($API_KEY vs NOVITA_API_KEY/~/.novita/config.json) — ensure the agent or your environment maps them correctly. If you need higher assurance, ask the author to update the registry metadata to declare the required credential and clarify the env/config variable names.

Capability Analysis

Type: OpenClaw Skill Name: novita-multimodal Version: 0.2.0 The skill provides a legitimate interface for Novita AI's multimodal services, including text-to-image, video generation, and speech processing. It follows standard configuration patterns by using environment variables or a local config file (~/.novita/config.json) and interacts exclusively with official endpoints (api.novita.ai). The instructions and execution flow are transparent and aligned with the stated purpose of the skill.

Capability Assessment

⚠ Purpose & Capability

The skill claims no required env vars/config paths/primary credential in registry metadata, yet the SKILL.md clearly expects an API key (via ~/.novita/config.json, NOVITA_API_KEY, or in-message 'sk_' token). The need for an API key is reasonable for a multimodal API integration, but the metadata omission is an inconsistency that could mislead users about what secrets the skill uses.

⚠ Instruction Scope

Runtime instructions explicitly direct the agent to: parse user messages for API keys (tokens starting with 'sk_'), read ~/.novita/config.json, or read NOVITA_API_KEY. Asking the agent to extract API keys from free-form user messages grants broad discretion and increases risk of accidental key exposure. Aside from that, the API endpoints and curl examples stay within the stated multimodal purpose.

✓ Install Mechanism

Instruction-only skill with no install spec and no code files; nothing is written to disk by an installer. This is the lowest-risk install mechanism.

⚠ Credentials

The skill legitimately requires a single Novita API key to call external endpoints, which is proportional. However the registry declares no primary credential or required env/config path while the SKILL.md requires ~/.novita/config.json or NOVITA_API_KEY or an in-message key—this metadata mismatch is problematic and reduces transparency. Also the SKILL.md uses $API_KEY in examples while recommending NOVITA_API_KEY in setup, which is ambiguous.

✓ Persistence & Privilege

The skill does not request always: true and does not declare system-wide modifications. It can be invoked autonomously (platform default), which is normal; no elevated persistence or cross-skill config changes are requested.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install novita-multimodal
After installation, invoke the skill by name or use /novita-multimodal
Provide required inputs per the skill's parameter spec and get structured output

Version History

v0.2.0

- Major update: Expanded documentation and clarified configuration and execution flow for multimodal Novita AI tasks. - Added detailed step-by-step guides for setup via config file, environment variable, or direct parameter. - Introduced clear API key reading logic and user-facing configuration guidance. - Specified required progress prompts and response templates for each task type (image, video, TTS, STT). - Included sample API requests for all supported endpoints and detailed polling instructions for video generation. - Enhanced error handling documentation and added links to official pricing and key management pages.

Metadata

Slug novita-multimodal

Version 0.2.0

License MIT-0

All-time Installs 1

Active Installs 1

Total Versions 1

Frequently Asked Questions

What is Novita AI Multimodal?

Execute multimodal tasks using Novita AI: text-to-image, image-to-image, text-to-video, image-to-video, TTS, STT. Use for: generating images, generating vide... It is an AI Agent Skill for Claude Code / OpenClaw, with 270 downloads so far.

How do I install Novita AI Multimodal?

Run "/install novita-multimodal" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Novita AI Multimodal free?

Yes, Novita AI Multimodal is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Novita AI Multimodal support?

Novita AI Multimodal is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Novita AI Multimodal?

It is built and maintained by bbear (@ximasadila); the current version is v0.2.0.

More Skills