← 返回 Skills 市场
321
总下载
1
收藏
1
当前安装
3
版本数
在 OpenClaw 中安装
/install wangyan-gemini-image-gen
功能描述
Generate, edit, and compose images using Gemini models. Activate when user asks to generate images, draw, create logos/posters/icons/banners, edit/modify pho...
安全使用建议
This skill likely does what it says — it sends image-generation requests to a configured Gemini endpoint — but there are three practical risks to consider before installing: (1) SKILL.md and the script require GEMINI_API_KEY and GEMINI_BASE_URL even though the registry lists none; plan to provide only a dedicated API key, not a general-purpose credential. (2) The script loads .env files from /workspace/.env and your home ~/.openclaw/.env and will import every KEY=VALUE into the environment; check those files first (or avoid placing other secrets there) because the skill will read them. (3) the README recommends running a remote curl | sh to install 'uv' — avoid running untrusted install scripts; prefer installing uv from a known package source or reviewing the installer first. If you decide to proceed, run the skill in an isolated environment (or container), supply a dedicated Gemini API key, and review the full script source for any additional network calls or logging that might leak sensitive data.
功能分析
Type: OpenClaw Skill
Name: wangyan-gemini-image-gen
Version: 1.0.2
The skill provides image generation and editing capabilities using Gemini models via OpenAI-compatible or Google-native APIs. The Python script `scripts/generate_image.py` implements robust API interaction, model fallback logic, and image processing using standard libraries like `httpx` and `PIL`. While the documentation in `SKILL.md` suggests `curl|bash` installation methods for dependencies (uv and skillhub), these are presented as standard utility installation patterns and do not show evidence of malicious intent. The code logic is transparent, focuses on the stated purpose, and follows OpenClaw conventions for secret management and media handling.
能力标签
能力评估
Purpose & Capability
The skill's name/description (Gemini image generation) matches the included Python script and dependencies (httpx, pillow, google-genai). However the registry metadata declares no required env vars or primary credential while the SKILL.md and the script require GEMINI_API_KEY and GEMINI_BASE_URL at runtime — this mismatch is a meaningful coherence problem that could confuse permission reviews or automated gating.
Instruction Scope
SKILL.md and the script instruct loading .env files from three locations (/workspace/.env, ~/.openclaw/.env, and the skill directory). Reading the global ~/.openclaw/.env (user-wide) and /workspace/.env (sandbox) is outside the narrow scope of image generation and risks exposing secrets that belong to other skills or the user. The script auto-populates os.environ from these files; while it only appears to use GEMINI-specific keys, the act of loading global .env files is broad and unnecessary for many installs.
Install Mechanism
The skill has no formal install spec (instruction-only), which is lower-risk, but SKILL.md recommends installing 'uv' via a remote curl | sh command (https://astral.sh/uv/install.sh). Advising users to run an arbitrary remote install script increases supply-chain risk. The binary requirement 'uv' itself is uncommon for small scripts but explained; still the remote installer instruction is a red flag.
Credentials
The script and SKILL.md require GEMINI_API_KEY and GEMINI_BASE_URL (and optionally GEMINI_MODEL, etc.), but the registry declares no required env vars — that's an inconsistency. More importantly, the script loads multiple .env sources (including ~/.openclaw/.env) and imports all KEY=VALUE pairs into the environment, which is disproportionate: a simple image-generator should not read user-global env files that may contain unrelated secrets or credentials.
Persistence & Privilege
The skill does not request 'always: true' and does not modify other skills or system-wide settings. It requires messaging permission to send images (documented). There is no evidence it persists beyond normal skill files.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install wangyan-gemini-image-gen - 安装完成后,直接呼叫该 Skill 的名称或使用
/wangyan-gemini-image-gen触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.2
更新日志详见:[CHANGELOG.md](https://github.com/wangyan/wangyan-skills/blob/main/CHANGELOG.md)
v1.0.1
- 添加了关于 skills CLI、Clawhub CLI 和 skillhub 的详细安装说明,使技能安装更加便捷。
- 优化了文档结构,清晰区分了安装、环境配置和使用等章节。
- 阐明了触发关键词和非触发场景,以实现更精准的激活。
- 更新了环境变量和配置优先级的说明。
- 进行了细微更新和修正,以确保整个文档的一致性和清晰度。
v1.0.0
wangyan-gemini-image-gen 1.0.0 – 初始版本
通过 Gemini 模型实现图片生成、编辑与多图合成。
**核心功能:**
- 文生图:根据文字描述生成图片
- 图片编辑:基于已有图片进行修改
- 多图合成:最多支持 14 张图片合成
**主要特性:**
- 支持 OpenAI 兼容格式和 Google 原生格式两种 API
- 内置模型自动轮询机制,提升可用性
- 可自定义 API 端点和密钥
元数据
常见问题
Gemini Image Generator 是什么?
Generate, edit, and compose images using Gemini models. Activate when user asks to generate images, draw, create logos/posters/icons/banners, edit/modify pho... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 321 次。
如何安装 Gemini Image Generator?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install wangyan-gemini-image-gen」即可一键安装,无需额外配置。
Gemini Image Generator 是免费的吗?
是的,Gemini Image Generator 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Gemini Image Generator 支持哪些平台?
Gemini Image Generator 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Gemini Image Generator?
由 WangYan(@wangyan)开发并维护,当前版本 v1.0.2。
推荐 Skills