功能描述

Nano-banana 图像生成与编辑 Skill。支持文生图和图生图，根据文字描述或参考图使用 AI 生成/编辑图像，自动压缩大图，保存到 workspace/img 目录，并直接在飞书中展示。触发场景：用户要求生成图片、画图、AI 绘图、创建图像、修改图片、基于图片生成、图生图。

使用说明 (SKILL.md)

Nano-banana 图像生成与编辑 Skill

Name: Image Gen
Author: zweien

使用 Nano-banana-3.1-Flash API 进行图像生成与编辑，支持：

文生图 (text-to-image): 根据文字描述生成图像
图生图 (image-to-image): 基于参考图生成新图像

自动压缩超过 1MB 的图片，保存到 workspace，并直接在飞书中展示。

配置

在workspace目录下创建 .env 文件：

# .env
IMAGE_API_BASE_URL=https://api.imyaigc.top
IMAGE_API_KEY=your-api-key-here
IMAGE_MODEL=gemini-3.1-flash-image-preview
IMAGE_SIZE=2K
IMAGE_ASPECT_RATIO=1:1

图片保存位置

所有生成的图片保存在：

~/.openclaw/workspace/img/

文件名格式：image_YYYYMMDD_HHMMSS.png

压缩规则：

原始图片保存为：image_YYYYMMDD_HHMMSS_original.png
如果原图 > 1MB，自动压缩并保存为：image_YYYYMMDD_HHMMSS.png
发送到时使用压缩后的版本

使用方法

1. 文生图 (Text-to-Image)

直接告诉我要生成什么图片：

示例：

"生成一张夕阳下的海滩图片"
"画一只可爱的猫咪"
"创建一个赛博朋克风格的大龙虾，16:9 比例"

2. 图生图 (Image-to-Image)

提供参考图和修改描述：

示例：

"基于这张图，改成水彩风格"
"把这张图改成夜景"
"参考这张图，生成一个类似风格的建筑"
"把这张照片变成油画风格"

工作流程：

用户提供参考图（发送图片）
下载图片并转为 base64
调用 API 时传入 image 参数
返回基于参考图生成的新图像

工作流程

文生图流程

解析参数
- 提取提示词
- 识别宽高比参数（如 "16:9"、"1:1"）

调用 API

curl -X POST "${IMAGE_API_BASE_URL}/v1/images/generations" \
    -H "Authorization: Bearer ${IMAGE_API_KEY}" \
    -H "Content-Type: application/json" \
    -d '{
        "model": "'"${IMAGE_MODEL}"'",
        "prompt": "描述内容",
        "response_format": "url",
        "aspect_ratio": "16:9"
    }'

图生图流程

获取参考图
- 用户发送图片
- 下载图片并转为 base64

调用 API

curl -X POST "${IMAGE_API_BASE_URL}/v1/images/generations" \
    -H "Authorization: Bearer ${IMAGE_API_KEY}" \
    -H "Content-Type: application/json" \
    -d '{
        "model": "'"${IMAGE_MODEL}"'",
        "prompt": "修改描述（如：改为水彩风格）",
        "response_format": "url",
        "aspect_ratio": "16:9",
        "image": ["data:image/png;base64,iVBORw0KGgo..."]
    }'

获取图片 URL
- 从响应中提取 data[0].url

下载并保存原始图片

OUTPUT_DIR=~/.openclaw/workspace/img
TIMESTAMP=$(date +%Y%m%d_%H%M%S)
ORIGINAL_FILE="${OUTPUT_DIR}/image_${TIMESTAMP}_original.png"
FINAL_FILE="${OUTPUT_DIR}/image_${TIMESTAMP}.png"

curl -o "$ORIGINAL_FILE" "$IMAGE_URL"

检查大小并压缩（如需要）

FILE_SIZE=$(stat -f%z "$ORIGINAL_FILE" 2>/dev/null || stat -c%s "$ORIGINAL_FILE")
MAX_SIZE=$((1024 * 1024))  # 1MB

if [ "$FILE_SIZE" -gt "$MAX_SIZE" ]; then
    echo "图片超过 1MB，正在压缩..."
    python3 compress_image.py "$ORIGINAL_FILE" "$FINAL_FILE" 1
else
    cp "$ORIGINAL_FILE" "$FINAL_FILE"
fi

发送到飞书 如果在飞书对话中，注意以下发送方式

message(
  action="send",
  channel="feishu",
  accountId="second",
  target="用户或群组ID",
  media="~/.openclaw/workspace/img/image_xxx.png"
)

压缩工具

使用 compress_image.py 进行图片压缩：

# 用法
python3 compress_image.py \x3Cinput> \x3Coutput> [max_size_mb]

# 示例
python3 compress_image.py original.png compressed.png 1

压缩策略：

如果图片 ≤ 1MB，不压缩
如果图片 > 1MB：
- 逐步降低 JPEG 质量 (85 → 10)
- 如果仍超过限制，缩小尺寸 (90% → 30%)
保留原始文件，压缩结果另存

支持的参数

宽高比 (aspect_ratio)

1:1 - 正方形（默认）
4:3, 3:4 - 标准比例
16:9, 9:16 - 宽屏/竖屏
2:3, 3:2 - 照片比例
4:5, 5:4 - 接近正方形
21:9 - 超宽屏
1:4, 4:1 - 长条形
8:1, 1:8 - 超长条形

图像尺寸 (image_size)

1K - 适合快速预览
2K - 推荐默认（默认）
4K - 高清大图
512px - 小图

参考图 (image)

类型：字符串数组
格式：URL 或 base64 编码（推荐 data:image/png;base64,... 格式）
支持：单张或多张参考图

API 信息

Endpoint: POST /v1/images/generations
认证: Authorization: Bearer {{YOUR_API_KEY}}
模型: ${IMAGE_MODEL} (默认: gemini-3.1-flash-image-preview)
Base URL: https://api.imyaigc.top

完整请求参数

参数	类型	必填	说明
model	string	✅	模型名称
prompt	string	✅	生成描述
response_format	string	❌	`url` 或 `b64_json`
aspect_ratio	string	❌	宽高比
image	array	❌	参考图数组（URL 或 base64）
image_size	string	❌	图像尺寸（仅 nano-banana-2 支持）

目录结构

~/.openclaw/workspace/
├── img/                          # 生成的图片保存目录
│   ├── image_20260302_221900_original.png  # 原始图片
│   ├── image_20260302_221900.png           # 压缩后（用于发送）
│   └── ...
└── skills/
    └── image-gen/
        ├── SKILL.md              # 本说明文件
        ├── image-gen.sh          # 生成脚本
        ├── compress_image.py     # 压缩工具
        ├── .env                  # API 配置
        └── .env.example          # 配置示例

注意事项

图片必须保存在 ~/.openclaw/workspace/img/ 目录
超过 1MB 的图片会自动压缩后再发送到飞书
原始图片保留，方便后续使用
文件名使用时间戳避免冲突
提示词会自动翻译为英文以获得更好的生成效果
API 调用可能需要 30-60 秒，请告知用户等待
图生图时：参考图会转为 base64 传入 API 的 image 参数

安全使用建议

Before installing or running this skill: - Be aware it will send prompts and any reference images to a third-party endpoint (https://api.imyaigc.top). Only provide images/prompts you are comfortable sharing with that service. - The registry metadata omits required env vars; you must create a .env containing IMAGE_API_BASE_URL and IMAGE_API_KEY. Treat that API key like a secret. - Verify and trust the external API host before providing credentials. Consider contacting the skill author or using a well-known provider instead. - The scripts rely on external tools (curl, jq, file, base64) and Python Pillow (PIL) but don’t declare or install them—install/verify these dependencies in a sandbox first. - Note the .env path discrepancy: SKILL.md suggests a workspace .env, while image-gen.sh sources .env from the skill directory—confirm which file the runtime will actually load to avoid leaking a key to the wrong place. - If you cannot verify the endpoint or the author, run the skill in an isolated environment or prefer an alternative skill that declares its credentials and dependencies explicitly.

功能分析

Type: OpenClaw Skill Name: image-gen-skill Version: 0.1.0 The skill bundle provides legitimate image generation and editing capabilities via the Nano-banana API (api.imyaigc.top). It includes a shell script (image-gen.sh) for API orchestration and a Python script (compress_image.py) for local image processing. While the shell script contains a minor JSON injection vulnerability in the text-to-image heredoc construction, there is no evidence of intentional malice, data exfiltration, or unauthorized system access.

能力评估

⚠ Purpose & Capability

The skill's stated purpose (text→image and image→image) matches the included scripts and compressor. However the registry metadata lists no required environment variables or credentials while the SKILL.md and scripts clearly require IMAGE_API_BASE_URL and IMAGE_API_KEY (and optional IMAGE_MODEL, IMAGE_SIZE, IMAGE_ASPECT_RATIO). This mismatch is incoherent and should have been declared as required credentials.

ℹ Instruction Scope

Runtime instructions convert user-provided reference images to base64 and send them (and prompts) to an external API (https://api.imyaigc.top). They save originals under ~/.openclaw/workspace/img and compress >1MB images. That behavior is expected for this skill but has privacy implications (user images and prompts are transmitted to a third party). Also SKILL.md says create .env in the workspace, while image-gen.sh sources .env from the script directory—this path mismatch may cause confusion.

⚠ Install Mechanism

There is no install spec (instruction-only), but code files are included. The scripts depend on external tools/libraries (curl, jq, file, base64, Python with Pillow) but the skill does not declare these dependencies or provide install steps. Lack of dependency declaration increases risk of runtime errors and hidden behaviors when you try to run it.

⚠ Credentials

The skill requires a single API key and base URL to call an external image-generation service—this is proportionate to its function. However the metadata does not declare these required env vars or mark IMAGE_API_KEY as the primary credential. The external base URL (api.imyaigc.top) is not a well-known provider; sending images and prompts there may be a privacy/data-exfiltration risk if you don't trust the service.

✓ Persistence & Privilege

The skill does not request always:true or system-wide privileges, and does not attempt to modify other skills or system configuration. It only writes image files to ~/.openclaw/workspace/img/ (as described).

版本历史

v0.1.0

image-gen-skill v0.1.0 changelog - Initial release of the Nano-banana 图像生成与编辑 Skill. - Supports text-to-image and image-to-image generation/editing via Gemini-3.1-Flash API. - Automatically compresses images larger than 1MB before sending, preserving both original and compressed versions in workspace/img. - Offers flexible aspect ratio and size settings; saves images with timestamp-based filenames. - Includes clear usage instructions and sample API calls for both modes. - Provides built-in compression utility and detailed directory/configuration guidance.

元数据

Slug image-gen-skill

版本 0.1.0

许可证 MIT-0

累计安装 5

当前安装数 5

历史版本数 1

常见问题

Image Gen 是什么？

Nano-banana 图像生成与编辑 Skill。支持文生图和图生图，根据文字描述或参考图使用 AI 生成/编辑图像，自动压缩大图，保存到 workspace/img 目录，并直接在飞书中展示。触发场景：用户要求生成图片、画图、AI 绘图、创建图像、修改图片、基于图片生成、图生图。它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 1474 次。

如何安装 Image Gen？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install image-gen-skill」即可一键安装，无需额外配置。

Image Gen 是免费的吗？

是的，Image Gen 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Image Gen 支持哪些平台？

Image Gen 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Image Gen？

由 zweien（@zweien）开发并维护，当前版本 v0.1.0。

Image Gen