← Back to Skills Marketplace

Whisk Ai

Name: Whisk Ai
Author: linmillsd7

by linmillsd7 · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install whisk-ai

Description

Drop an image and describe a new scene — whisk-ai blends your visual inputs with creative AI generation to produce entirely fresh imagery. Built around Googl...

Usage Guidance

This skill will upload any images you give it to a third-party API (mega-api-prod.nemovideo.ai) and will obtain or use a NEMO_TOKEN for authorization. The publisher/source and homepage are missing, and the SKILL.md claims Google 'Imagen' tech while calling an unknown domain — ask the publisher for provenance and a privacy/terms URL before trusting sensitive content. Also clarify whether the skill actually needs access to ~/.config/nemovideo/ (the frontmatter lists it but the registry metadata did not). If you decide to proceed: avoid uploading private or sensitive photos, review any returned URLs before clicking, and prefer skills from verified sources or with public documentation.

Capability Analysis

Type: OpenClaw Skill Name: whisk-ai Version: 1.0.0 The whisk-ai skill is a legitimate integration for an AI image remixing service hosted on the nemovideo.ai platform. The SKILL.md file provides functional instructions for the agent to manage sessions, handle anonymous authentication via a UUID-based token exchange, and interact with a server-side image generation API. It includes appropriate error handling and security-conscious instructions (e.g., advising the agent not to expose raw tokens). No evidence of data exfiltration, malicious execution, or unauthorized access was found.

Capability Tags

crypto

Capability Assessment

ℹ Purpose & Capability

The name/description (image remixing) aligns with the runtime behavior: uploading images, creating sessions, and requesting generations from a remote API. However: the skill claims to be built around Google's Whisk/Imagen tech while all network calls target mega-api-prod.nemovideo.ai (an unknown domain) — that mismatch is unexplained. The SKILL.md frontmatter also declares a config path (~/.config/nemovideo/) that the registry metadata did not list, and the skill has no public homepage or source repository to validate the backend claims.

ℹ Instruction Scope

Instructions are explicit about creating a session, uploading images, using SSE for streaming responses, polling for export, and including attribution headers. These are in-scope for an image-generation skill. It also instructs detecting install path and reading this SKILL.md frontmatter at runtime to fill attribution headers (requires reading the agent's environment/files). The skill will POST user images and prompts to the external nemovideo.ai API — this is expected for cloud processing but has privacy implications (user uploads leave the local environment).

✓ Install Mechanism

This is instruction-only with no install spec and no code files, so nothing is written to disk by an installer. Lowest install-surface risk.

⚠ Credentials

Only NEMO_TOKEN is required, which is appropriate for a remote API. However, the SKILL.md metadata also mentions a config path (~/.config/nemovideo/) not declared elsewhere in the registry metadata, creating an inconsistency: either the skill expects to read/write that config directory (broader access than declared) or the metadata is stale/incorrect. The skill also will accept/issue an anonymous token if NEMO_TOKEN is absent, which means it will contact the remote API to obtain credentials on the fly — a behavior you should be aware of before uploading sensitive images.

✓ Persistence & Privilege

The skill is not always-enabled and does not request system-wide persistence. It will create ephemeral sessions/tokens for operations, which is consistent with a remote-processing image skill. Autonomous invocation is allowed by default (platform normal) and not by itself a problem here.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install whisk-ai
After installation, invoke the skill by name or use /whisk-ai
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Whisk AI Image Remixer 1.0.0 - Initial release of Whisk AI, a tool for remixed image generation using Google’s Imagen backend. - Enables creative blending of up to three input images (subject, scene, style) and text prompts for new AI-generated visuals. - Automated credential setup with anonymous token and session management (includes 100 free credits, 7-day expiry). - Supports upload, credits check, session state, and export to multiple formats (jpg, png, gif, mp4, etc). - Robust error handling and clear user feedback for setup, session state, and common API issues.

Metadata

Slug whisk-ai

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Whisk Ai?

Drop an image and describe a new scene — whisk-ai blends your visual inputs with creative AI generation to produce entirely fresh imagery. Built around Googl... It is an AI Agent Skill for Claude Code / OpenClaw, with 65 downloads so far.

How do I install Whisk Ai?

Run "/install whisk-ai" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Whisk Ai free?

Yes, Whisk Ai is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Whisk Ai support?

Whisk Ai is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Whisk Ai?

It is built and maintained by linmillsd7 (@linmillsd7); the current version is v1.0.0.

More Skills