/install gemini-image-proxy
Gemini Image Simple
Generate and edit images using Gemini 3 Pro Image via the OpenAI Python SDK and an OpenAI-compatible API endpoint.
Why This Skill
| Feature | This Skill | Others (nano-banana-pro, etc.) |
|---|---|---|
| Dependencies | openai (SDK) | google-genai, pillow, etc. |
| Requires pip/uv | ✅ Yes | ✅ Yes |
| Works on Fly.io free | ✅ Yes (with pip) | ❌ Fails |
| Works in containers | ✅ Yes (with pip) | ❌ Often fails |
| Image generation | ✅ Full | ✅ Full |
| Image editing | ✅ Yes | ✅ Yes |
| Setup complexity | Install SDK + set API key | Install packages first |
Bottom line: This skill uses the OpenAI SDK, so you must install openai once with pip.
Install
python3 -m pip install openai
Quick Start
# Set env
export GOOGLE_PROXY_API_KEY="your_api_key"
export GOOGLE_PROXY_BASE_URL="https://example.com/v1"
# Generate
python3 /data/clawd/skills/gemini-image-simple/scripts/generate.py "A cat wearing a tiny hat" cat.png
# Edit existing image
python3 /data/clawd/skills/gemini-image-simple/scripts/generate.py "Make it sunset lighting" edited.png --input original.png
Usage
Generate new image
python3 {baseDir}/scripts/generate.py "your prompt" output.png
Edit existing image
python3 {baseDir}/scripts/generate.py "edit instructions" output.png --input source.png
Supported input formats: PNG, JPG, JPEG, GIF, WEBP
Environment
Set these environment variables:
GOOGLE_PROXY_API_KEY(your API key)GOOGLE_PROXY_BASE_URL(OpenAI-compatible base URL, e.g. https://example.com/v1)
How It Works
Uses Gemini 3 Pro Image (gemini-3-pro-image) via the OpenAI Python SDK:
client.images.generate(...)for new imagesclient.images.edits(...)for edits- Requires the
openaipackage
That's it. Works on any Python 3.10+ installation with openai installed.
Model
Currently using: gemini-3-pro-image
Other available models (can be changed in generate.py if needed):
gemini-3-pro-image-preview- Preview variantimagen-4.0-ultra-generate-001- Imagen 4.0 Ultraimagen-4.0-generate-001- Imagen 4.0gemini-2.5-flash-image- Gemini 2.5 Flash with image gen
Examples
# Landscape
python3 {baseDir}/scripts/generate.py "Misty mountains at sunrise, photorealistic" mountains.png
# Product shot
python3 {baseDir}/scripts/generate.py "Minimalist product photo of a coffee cup, white background" coffee.png
# Edit: change style
python3 {baseDir}/scripts/generate.py "Convert to watercolor painting style" watercolor.png --input photo.jpg
# Edit: add element
python3 {baseDir}/scripts/generate.py "Add a rainbow in the sky" rainbow.png --input landscape.png
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install gemini-image-proxy - After installation, invoke the skill by name or use
/gemini-image-proxy - Provide required inputs per the skill's parameter spec and get structured output
What is Gemini Image Proxy?
Generate and edit images with Gemini API using the OpenAI Python SDK. It is an AI Agent Skill for Claude Code / OpenClaw, with 1727 downloads so far.
How do I install Gemini Image Proxy?
Run "/install gemini-image-proxy" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Gemini Image Proxy free?
Yes, Gemini Image Proxy is completely free (open-source). You can download, install and use it at no cost.
Which platforms does Gemini Image Proxy support?
Gemini Image Proxy is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Gemini Image Proxy?
It is built and maintained by YspCoder (@yspcoder); the current version is v1.0.0.