← Back to Skills Marketplace
Whisk Ai
by
linmillsd7
· GitHub ↗
· v1.0.0
· MIT-0
65
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install whisk-ai
Description
Drop an image and describe a new scene — whisk-ai blends your visual inputs with creative AI generation to produce entirely fresh imagery. Built around Googl...
Usage Guidance
This skill will upload any images you give it to a third-party API (mega-api-prod.nemovideo.ai) and will obtain or use a NEMO_TOKEN for authorization. The publisher/source and homepage are missing, and the SKILL.md claims Google 'Imagen' tech while calling an unknown domain — ask the publisher for provenance and a privacy/terms URL before trusting sensitive content. Also clarify whether the skill actually needs access to ~/.config/nemovideo/ (the frontmatter lists it but the registry metadata did not). If you decide to proceed: avoid uploading private or sensitive photos, review any returned URLs before clicking, and prefer skills from verified sources or with public documentation.
Capability Analysis
Type: OpenClaw Skill
Name: whisk-ai
Version: 1.0.0
The whisk-ai skill is a legitimate integration for an AI image remixing service hosted on the nemovideo.ai platform. The SKILL.md file provides functional instructions for the agent to manage sessions, handle anonymous authentication via a UUID-based token exchange, and interact with a server-side image generation API. It includes appropriate error handling and security-conscious instructions (e.g., advising the agent not to expose raw tokens). No evidence of data exfiltration, malicious execution, or unauthorized access was found.
Capability Tags
Capability Assessment
Purpose & Capability
The name/description (image remixing) aligns with the runtime behavior: uploading images, creating sessions, and requesting generations from a remote API. However: the skill claims to be built around Google's Whisk/Imagen tech while all network calls target mega-api-prod.nemovideo.ai (an unknown domain) — that mismatch is unexplained. The SKILL.md frontmatter also declares a config path (~/.config/nemovideo/) that the registry metadata did not list, and the skill has no public homepage or source repository to validate the backend claims.
Instruction Scope
Instructions are explicit about creating a session, uploading images, using SSE for streaming responses, polling for export, and including attribution headers. These are in-scope for an image-generation skill. It also instructs detecting install path and reading this SKILL.md frontmatter at runtime to fill attribution headers (requires reading the agent's environment/files). The skill will POST user images and prompts to the external nemovideo.ai API — this is expected for cloud processing but has privacy implications (user uploads leave the local environment).
Install Mechanism
This is instruction-only with no install spec and no code files, so nothing is written to disk by an installer. Lowest install-surface risk.
Credentials
Only NEMO_TOKEN is required, which is appropriate for a remote API. However, the SKILL.md metadata also mentions a config path (~/.config/nemovideo/) not declared elsewhere in the registry metadata, creating an inconsistency: either the skill expects to read/write that config directory (broader access than declared) or the metadata is stale/incorrect. The skill also will accept/issue an anonymous token if NEMO_TOKEN is absent, which means it will contact the remote API to obtain credentials on the fly — a behavior you should be aware of before uploading sensitive images.
Persistence & Privilege
The skill is not always-enabled and does not request system-wide persistence. It will create ephemeral sessions/tokens for operations, which is consistent with a remote-processing image skill. Autonomous invocation is allowed by default (platform normal) and not by itself a problem here.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install whisk-ai - After installation, invoke the skill by name or use
/whisk-ai - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Whisk AI Image Remixer 1.0.0
- Initial release of Whisk AI, a tool for remixed image generation using Google’s Imagen backend.
- Enables creative blending of up to three input images (subject, scene, style) and text prompts for new AI-generated visuals.
- Automated credential setup with anonymous token and session management (includes 100 free credits, 7-day expiry).
- Supports upload, credits check, session state, and export to multiple formats (jpg, png, gif, mp4, etc).
- Robust error handling and clear user feedback for setup, session state, and common API issues.
Metadata
Frequently Asked Questions
What is Whisk Ai?
Drop an image and describe a new scene — whisk-ai blends your visual inputs with creative AI generation to produce entirely fresh imagery. Built around Googl... It is an AI Agent Skill for Claude Code / OpenClaw, with 65 downloads so far.
How do I install Whisk Ai?
Run "/install whisk-ai" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Whisk Ai free?
Yes, Whisk Ai is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Whisk Ai support?
Whisk Ai is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Whisk Ai?
It is built and maintained by linmillsd7 (@linmillsd7); the current version is v1.0.0.
More Skills