← Back to Skills Marketplace
wangyan

Gemini Image Generator

by WangYan · GitHub ↗ · v1.0.2 · MIT-0
cross-platform ⚠ suspicious
321
Downloads
1
Stars
1
Active Installs
3
Versions
Install in OpenClaw
/install wangyan-gemini-image-gen
Description
Generate, edit, and compose images using Gemini models. Activate when user asks to generate images, draw, create logos/posters/icons/banners, edit/modify pho...
Usage Guidance
This skill likely does what it says — it sends image-generation requests to a configured Gemini endpoint — but there are three practical risks to consider before installing: (1) SKILL.md and the script require GEMINI_API_KEY and GEMINI_BASE_URL even though the registry lists none; plan to provide only a dedicated API key, not a general-purpose credential. (2) The script loads .env files from /workspace/.env and your home ~/.openclaw/.env and will import every KEY=VALUE into the environment; check those files first (or avoid placing other secrets there) because the skill will read them. (3) the README recommends running a remote curl | sh to install 'uv' — avoid running untrusted install scripts; prefer installing uv from a known package source or reviewing the installer first. If you decide to proceed, run the skill in an isolated environment (or container), supply a dedicated Gemini API key, and review the full script source for any additional network calls or logging that might leak sensitive data.
Capability Analysis
Type: OpenClaw Skill Name: wangyan-gemini-image-gen Version: 1.0.2 The skill provides image generation and editing capabilities using Gemini models via OpenAI-compatible or Google-native APIs. The Python script `scripts/generate_image.py` implements robust API interaction, model fallback logic, and image processing using standard libraries like `httpx` and `PIL`. While the documentation in `SKILL.md` suggests `curl|bash` installation methods for dependencies (uv and skillhub), these are presented as standard utility installation patterns and do not show evidence of malicious intent. The code logic is transparent, focuses on the stated purpose, and follows OpenClaw conventions for secret management and media handling.
Capability Tags
requires-sensitive-credentials
Capability Assessment
Purpose & Capability
The skill's name/description (Gemini image generation) matches the included Python script and dependencies (httpx, pillow, google-genai). However the registry metadata declares no required env vars or primary credential while the SKILL.md and the script require GEMINI_API_KEY and GEMINI_BASE_URL at runtime — this mismatch is a meaningful coherence problem that could confuse permission reviews or automated gating.
Instruction Scope
SKILL.md and the script instruct loading .env files from three locations (/workspace/.env, ~/.openclaw/.env, and the skill directory). Reading the global ~/.openclaw/.env (user-wide) and /workspace/.env (sandbox) is outside the narrow scope of image generation and risks exposing secrets that belong to other skills or the user. The script auto-populates os.environ from these files; while it only appears to use GEMINI-specific keys, the act of loading global .env files is broad and unnecessary for many installs.
Install Mechanism
The skill has no formal install spec (instruction-only), which is lower-risk, but SKILL.md recommends installing 'uv' via a remote curl | sh command (https://astral.sh/uv/install.sh). Advising users to run an arbitrary remote install script increases supply-chain risk. The binary requirement 'uv' itself is uncommon for small scripts but explained; still the remote installer instruction is a red flag.
Credentials
The script and SKILL.md require GEMINI_API_KEY and GEMINI_BASE_URL (and optionally GEMINI_MODEL, etc.), but the registry declares no required env vars — that's an inconsistency. More importantly, the script loads multiple .env sources (including ~/.openclaw/.env) and imports all KEY=VALUE pairs into the environment, which is disproportionate: a simple image-generator should not read user-global env files that may contain unrelated secrets or credentials.
Persistence & Privilege
The skill does not request 'always: true' and does not modify other skills or system-wide settings. It requires messaging permission to send images (documented). There is no evidence it persists beyond normal skill files.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install wangyan-gemini-image-gen
  3. After installation, invoke the skill by name or use /wangyan-gemini-image-gen
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.2
更新日志详见:[CHANGELOG.md](https://github.com/wangyan/wangyan-skills/blob/main/CHANGELOG.md)
v1.0.1
- 添加了关于 skills CLI、Clawhub CLI 和 skillhub 的详细安装说明,使技能安装更加便捷。 - 优化了文档结构,清晰区分了安装、环境配置和使用等章节。 - 阐明了触发关键词和非触发场景,以实现更精准的激活。 - 更新了环境变量和配置优先级的说明。 - 进行了细微更新和修正,以确保整个文档的一致性和清晰度。
v1.0.0
wangyan-gemini-image-gen 1.0.0 – 初始版本 通过 Gemini 模型实现图片生成、编辑与多图合成。 **核心功能:** - 文生图:根据文字描述生成图片 - 图片编辑:基于已有图片进行修改 - 多图合成:最多支持 14 张图片合成 **主要特性:** - 支持 OpenAI 兼容格式和 Google 原生格式两种 API - 内置模型自动轮询机制,提升可用性 - 可自定义 API 端点和密钥
Metadata
Slug wangyan-gemini-image-gen
Version 1.0.2
License MIT-0
All-time Installs 1
Active Installs 1
Total Versions 3
Frequently Asked Questions

What is Gemini Image Generator?

Generate, edit, and compose images using Gemini models. Activate when user asks to generate images, draw, create logos/posters/icons/banners, edit/modify pho... It is an AI Agent Skill for Claude Code / OpenClaw, with 321 downloads so far.

How do I install Gemini Image Generator?

Run "/install wangyan-gemini-image-gen" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Gemini Image Generator free?

Yes, Gemini Image Generator is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Gemini Image Generator support?

Gemini Image Generator is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Gemini Image Generator?

It is built and maintained by WangYan (@wangyan); the current version is v1.0.2.

💬 Comments