← Back to Skills Marketplace
rdeangel

Gemini Image Remix

by Rocco De Angelis · GitHub ↗ · v1.0.0
cross-platform ✓ Security Clean
1593
Downloads
2
Stars
15
Active Installs
1
Versions
Install in OpenClaw
/install gemini-image-remix
Description
Generate or remix images using Gemini models with text prompts and multiple input images, supporting various styles, resolutions, and advanced model options.
README (SKILL.md)

Gemini Image Remix

A versatile tool for text-to-image generation and complex image-to-image remixing. By default, it uses Gemini 2.5 Flash Image for fast, high-quality results. It also supports flagship models like Gemini 3.0 Pro (Nano Banana Pro) for advanced artistic tasks.

Generate Image

Create stunning visuals from a text prompt.

uv run {baseDir}/scripts/remix.py --prompt "a cybernetic owl in a neon forest" --filename "owl.png"

Remix/Modify Image

Use one or more reference images to guide the generation. Perfect for style transfers, background changes, or character modifications.

uv run {baseDir}/scripts/remix.py --prompt "change the art style to a pencil sketch" --filename "sketch.png" -i "original.png"

Multi-image Composition

Combine elements from up to 14 different images into a single cohesive scene.

uv run {baseDir}/scripts/remix.py --prompt "place the character from image 1 into the environment of image 2" --filename "result.png" -i "character.png" -i "env.png"

Advanced Model Selection

Switch to advanced models like Nano Banana Pro for high-fidelity work.

uv run {baseDir}/scripts/remix.py --model "gemini-3-pro-image-preview" --prompt "highly detailed oil painting of a dragon" --filename "dragon.png"

Options

  • --prompt, -p: Image description or specific edit instructions.
  • --filename, -f: The output path for the generated PNG.
  • --input-image, -i: Path to an input image (repeatable up to 14 times).
  • --resolution, -r: 1K (default), 2K, or 4K.
  • --aspect-ratio, -a: Output aspect ratio (e.g., 1:1, 16:9, 9:16, 4:3, 3:4).
  • --model, -m: Model to use (defaults to gemini-2.5-flash-image). Supported: gemini-2.5-flash-image, gemini-3-pro-image-preview.
  • --api-key, -k: Gemini API key (defaults to GEMINI_API_KEY env var).
Usage Guidance
This skill will read any local image files you pass to it and send those images plus your text prompt to the Gemini API using the GEMINI_API_KEY. Before installing or running it: 1) Confirm you trust the Gemini API endpoint/service for handling the images and prompts (private or sensitive images may be exposed). 2) Provide a scoped or replaceable API key (rotate or revoke if needed). 3) Install the Python dependencies (google-genai, pillow) from official PyPI packages in a controlled environment. 4) Verify the 'uv' brew formula source on your system if you do not already have 'uv'. If you need the script to avoid uploading certain files, do not pass those files as input. Overall the package is consistent with its stated purpose; the main risks are privacy of uploaded images and managing the API key.
Capability Analysis
Type: OpenClaw Skill Name: gemini-image-remix Version: 1.0.0 The skill bundle is designed for image generation and remixing using the Gemini API. All operations, including dependency installation via `uv` and `brew`, API key handling, and image processing, are standard and align with the stated purpose. The `SKILL.md` provides clear instructions without any prompt injection attempts against the agent. The `remix.py` script uses `argparse` for robust argument handling, preventing shell injection from user input, and interacts only with the Google GenAI library for its core functionality. There is no evidence of data exfiltration, unauthorized network activity, persistence mechanisms, or other malicious intent. While `Pillow` (used for image processing) could theoretically have vulnerabilities exploitable by malicious input images, this is a general software vulnerability and not indicative of malicious intent within this skill.
Capability Assessment
Purpose & Capability
Name/description match the code and runtime instructions. Requiring GEMINI_API_KEY and the 'uv' runner is appropriate for a script that calls the Gemini API via google-genai and is executed with 'uv run'.
Instruction Scope
SKILL.md and the script instruct running the local Python script and passing local image paths and a prompt. The script reads only the input image files you explicitly provide, the prompt, and the GEMINI_API_KEY; it sends those to the Gemini API. It does not attempt to read unrelated system files or other environment variables. Note: transmitting local images and prompts to the Gemini service is expected behavior but may have privacy implications.
Install Mechanism
Install spec uses a Homebrew formula 'uv' (traceable and low risk). The Python dependencies (google-genai, pillow) are declared in a header comment in the script but not installed by the SKILL.md install step — you'll need to install them in your Python environment (pip). This is not malicious but is a potential operational gap to be aware of.
Credentials
Only GEMINI_API_KEY is required and designated as the primary credential, which is proportional to the skill's purpose. No other secrets or unrelated environment/config paths are requested.
Persistence & Privilege
The skill is not force-included (always: false) and does not request elevated or persistent system privileges. It does not modify other skills or system-wide agent settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install gemini-image-remix
  3. After installation, invoke the skill by name or use /gemini-image-remix
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Gemini Image Remix 1.0.0 initial release: - Powerful text-to-image and image-to-image generation with Gemini models. - Default support for Gemini 2.5 Flash Image; option to use Gemini 3.0 Pro (Nano Banana Pro). - Allows remixing and modification of up to 14 images in a single composition. - Versatile options for resolution, aspect ratio, and advanced controls. - Simple command-line usage with detailed prompts and model selection.
Metadata
Slug gemini-image-remix
Version 1.0.0
License
All-time Installs 16
Active Installs 15
Total Versions 1
Frequently Asked Questions

What is Gemini Image Remix?

Generate or remix images using Gemini models with text prompts and multiple input images, supporting various styles, resolutions, and advanced model options. It is an AI Agent Skill for Claude Code / OpenClaw, with 1593 downloads so far.

How do I install Gemini Image Remix?

Run "/install gemini-image-remix" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Gemini Image Remix free?

Yes, Gemini Image Remix is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Gemini Image Remix support?

Gemini Image Remix is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Gemini Image Remix?

It is built and maintained by Rocco De Angelis (@rdeangel); the current version is v1.0.0.

💬 Comments