功能描述

电商套图生成助手。用户明确提出需要生成电商套图、商品主图、卖点图、场景图、模特图等图片内容时触发。支持国内平台（淘宝、京东、拼多多、抖音）与国际跨境平台（Amazon、独立站）的尺寸规范。触发示例：「帮我生成这件T恤的电商套图」「做一套淘宝主图」「生成亚马逊listing图片」。不应在用户仅上传图片但未明确...

使用说明 (SKILL.md)

电商套图生成助手

Name: ecommerce-image-suite
Author: wzj177

概览

本 Skill 实现从「商品原始图片 + 卖点信息」到「完整电商套图」的一键生成流程：

① 上传商品图片（必须）+ 输入卖点信息（可选）
        ↓
② AI 视觉分析：提取商品主体，智能生成卖点文案（可编辑）
        ↓
③ 选择平台规范 + 套图类型（7种标准图）
        ↓
④ AI 生成每张图的详细 Prompt（可编辑）
        ↓
⑤ 调用图像生成 API，输出完整套图

第一步：收集输入信息

必须项

商品图片：用户上传的原始商品图（平铺图/白底图/实物图均可）

可选项（若用户未提供，AI 将自动从图片中分析生成）

字段	说明
商品名称	如"卡通小狗印花宽松精梳棉短袖T恤"
核心卖点	材质、版型、设计特点等 3-5 条
适用人群	如"追求舒适简约风的青少年"
期望场景	如"校园日常、居家休闲、户外出游"
规格参数	材质、颜色、版型、领型、袖长等

第二步：AI 分析与卖点生成

视觉分析步骤

识别商品类型、颜色、款式、设计元素
提取商品主体轮廓与关键视觉特征
基于视觉特征推断材质、功能卖点
生成结构化卖点（JSON格式，供后续图片生成使用）

卖点 JSON 结构

{
  "product_name": "商品名称",
  "product_type": "服装/3C/家居/其他",
  "visual_features": ["白色", "圆领", "短袖", "卡通小狗印花"],
  "selling_points": [
    {"icon": "fabric", "en": "Combed Cotton", "zh": "精梳棉面料"},
    {"icon": "fit", "en": "Loose & Breathable", "zh": "宽松透气"},
    {"icon": "design", "en": "Cute Design", "zh": "萌趣设计"}
  ],
  "target_audience": "青少年、学生群体",
  "usage_scenes": ["校园", "居家", "户外"],
  "color": "白色",
  "material": "精梳棉"
}

📄 详细分析 Prompt 见 references/analysis-prompts.md

第三步：选择平台与套图配置

📄 各平台规范详见 references/platforms.md

平台选择

平台类型	平台	推荐尺寸	语言
国内	淘宝/天猫	800×800 (1:1)	中文
国内	京东	800×800 (1:1)	中文
国内	拼多多	750×750 (1:1)	中文
国内	抖音/小红书	1080×1350 (4:5) 或 1:1	中文
国际	Amazon	2000×2000 (1:1)	英文
国际	独立站/Shopify	2000×2000 (1:1) 或 16:9	英文

标准套图（7种）

每种图的详细规格见 references/image-types.md

#	图片类型	核心目标	推荐位置
1	白底主图	商品全貌展示，符合平台收录规则	第1张主图
2	核心卖点图	3大卖点图标化呈现	第2张
3	卖点图	单一核心卖点深度展示	第3张
4	材质图	面料/工艺特写，建立品质信任	第4张
5	场景展示图	生活方式场景，激发代入感	第5张
6	模特展示图	真人/AI模特穿搭，直观展示效果	第6张
7	多场景拼图	多场景适用性对比，提升决策	第7张

第四步：生成图片 Prompt

📄 各图类型的 Prompt 模板见 references/image-types.md

Prompt 构建原则

商品一致性：所有图片必须保持商品颜色、结构、比例、细节不变
背景差异化：每张图背景/场景各不相同，形成完整故事线
文字分离：图片本身不含文字，文案通过后处理叠加（除非使用图像生成API支持文字）
品质标准：photorealistic, high quality, studio lighting, 8K, commercial photography

Prompt 结构模板

[商品描述] + [版型/颜色/印花精确描述] + [场景/背景描述] + [光线/氛围] + [拍摄角度] + [品质词]

第五步：多供应商图像生成

📄 各供应商 API 接入详情见 references/providers.md

支持的图像生成供应商（5个）

供应商	默认模型	模型环境变量	国内可用	特点
OpenAI	`dall-e-3`	`OPENAI_MODEL`	需代理	高质量写实，细节清晰
Google	`gemini-3.1-flash-image-preview`	`GEMINI_MODEL`	需代理	原生图像生成，2K 输出
Stability AI	`core`	`STABILITY_MODEL`	需代理	精准控制构图
阿里云	`qwen-image-2.0-pro`	`DASHSCOPE_MODEL`	✅直连	同步接口，中文优化
字节跳动	`doubao-seedream-5-0-260128`	`ARK_MODEL`	✅直连	中文理解好，风格多样

模型名可通过 --model 参数、环境变量或默认值配置，优先级：--model > 环境变量 > 默认值。

供应商检测

python3 scripts/check_providers.py

输出 JSON 包含 configured 数组，显示哪些供应商已配置 API Key。

执行生图脚本

python3 scripts/generate.py \
  --product '{"product_description_for_prompt": "...", "selling_points": [...]}' \
  --provider tongyi \
  --types white_bg,key_features,selling_pt,material,lifestyle,model,multi_scene \
  --output-dir ./output/raw/

generate.py 完整参数

参数	说明	默认值
`--product`	必填，商品 JSON 字符串	—
`--provider`	必填，供应商：`openai` / `gemini` / `stability` / `tongyi` / `doubao`	—
`--api-key`	API Key，也可通过环境变量传入	环境变量
`--base-url`	自定义代理地址，也可通过 `*_BASE_URL` 环境变量传入	官方地址
`--model`	模型名称，也可通过 `*_MODEL` 环境变量传入	见供应商表
`--types`	逗号分隔的套图类型	全部 7 种
`--output-dir`	输出目录	`./output/raw/`

代理 API 使用示例

各供应商均支持通过 --base-url 或环境变量指定代理地址：

# Gemini 通过代理（代理使用 Bearer token 鉴权）
GEMINI_API_KEY="sk-proxy-key" \
GEMINI_BASE_URL="https://my-proxy.com/v1beta/models/gemini-3.1-flash-image-preview:generateContent" \
  python3 scripts/generate.py --provider gemini --product '...'

# 通义通过代理
DASHSCOPE_API_KEY="sk-..." \
DASHSCOPE_BASE_URL="https://my-proxy.com/api/v1/services/aigc/multimodal-generation/generation" \
  python3 scripts/generate.py --provider tongyi --product '...'

# 切换模型版本
DASHSCOPE_MODEL="qwen-image-2.0" \
  python3 scripts/generate.py --provider tongyi --product '...'

第六步：文案叠加

使用 Pillow 将文案叠加到生成图片上：

python3 scripts/overlay.py \
  --input-dir ./output/raw/ \
  --product '{"selling_points": [...], "product_name_zh": "..."}' \
  --lang zh \
  --output-dir ./output/final/

叠加规范（各图类型）

白底主图 / 模特展示图：无文案叠加
核心卖点图：右侧区域，WHY CHOOSE US + 3个卖点标签，深色文字
卖点图：左上主标题 + 左下两条副标题，深色文字
材质图：右上主标题 + 右侧两条副标题，深色文字
场景展示图：左上主标题 + 左下两条副标题，白色文字+阴影
多场景拼图：顶部居中主标题 + 底部两侧场景标注，白色文字+阴影

📄 各图类型叠加坐标规范见 references/providers.md（Canvas规范部分）

CLI 执行流程（Agent 调用）

Agent 或 CLI 环境下的完整流程：

# Step 1: 检测供应商配置
python3 scripts/check_providers.py

# Step 2: Agent 分析商品图片（Claude Vision），输出 product JSON

# Step 3: 执行生图
python3 scripts/generate.py \
  --product '{"product_description_for_prompt": "white T-shirt...", "selling_points": [...]}'  \
  --provider tongyi \
  --output-dir ./output/raw/

# Step 4: 文案叠加
python3 scripts/overlay.py \
  --input-dir ./output/raw/ \
  --product '{"product_description_for_prompt": "white T-shirt...", "selling_points": [...]}' \
  --lang zh \
  --output-dir ./output/final/

执行检查清单

商品图片已上传（必须）
商品卖点已生成或用户已填写
平台已选择（决定语言和尺寸）
套图类型已选择（至少1种）
所有 Prompt 已审核（可选）
图像生成 API 可用

参考文件索引

文件	内容
`references/platforms.md`	各平台尺寸规范、主图要求、文案风格指南
`references/image-types.md`	7种套图的详细视觉规格与 Prompt 模板
`references/analysis-prompts.md`	AI商品分析与卖点提取的系统 Prompt
`references/providers.md`	供应商 API 接入详情与文案叠加规范
`scripts/check_providers.py`	检测已配置供应商（读取环境变量）
`scripts/generate.py`	调用图像生成 API（5个供应商，支持 `--model` / `--base-url` / `--api-key`）
`scripts/overlay.py`	Pillow 文案叠加（动态卖点 + 多语言）

API Key 配置

本 Skill 使用两类 API：

变量	用途	是否必需
`DASHSCOPE_API_KEY`	千问图像生成（国内直连）	✅ 推荐
`ARK_API_KEY`	豆包 Seedream 图像生成（火山方舟，国内直连）	可选
`OPENAI_API_KEY`	DALL·E 3 图像生成（需代理）	可选
`GEMINI_API_KEY`	Gemini 原生图像生成（需代理）	可选
`STABILITY_API_KEY`	Stable Image Core（需代理）	可选
`*_BASE_URL`	各供应商自定义代理地址（`OPENAI_BASE_URL` / `GEMINI_BASE_URL` 等）	可选
`*_MODEL`	各供应商自定义模型名（`DASHSCOPE_MODEL` / `ARK_MODEL` / `GEMINI_MODEL` 等）	可选

安全声明：API Key 仅存于本地环境变量，直接调用各供应商官方 Endpoint，不经过任何第三方服务器中转。建议使用权限最小化的 Key，并定期轮换。

方式一：环境变量

# 至少配置一个图像供应商
export DASHSCOPE_API_KEY="sk-..."       # 阿里云 DashScope（国内直连，推荐）
export ARK_API_KEY="..."                # 字节跳动火山方舟（国内直连）
export OPENAI_API_KEY="sk-..."         # 需代理
export GEMINI_API_KEY="AIzaSy..."      # 需代理
export STABILITY_API_KEY="sk-..."      # 需代理

# 可选：自定义代理地址
export OPENAI_BASE_URL="https://my-proxy.com/v1"
export GEMINI_BASE_URL="https://my-proxy.com/gemini"
export DASHSCOPE_BASE_URL="https://my-proxy.com/dashscope"

# 可选：自定义模型名（不配置则使用默认值）
export DASHSCOPE_MODEL="qwen-image-2.0"       # 默认 qwen-image-2.0-pro
export ARK_MODEL="doubao-seedream-5-0-260128" # 默认 doubao-seedream-5-0-260128
export GEMINI_MODEL="gemini-3.1-flash-image-preview"  # 默认同此

加入 ~/.zshrc 或 ~/.bashrc 后永久生效。

方式二：OpenClaw 配置文件

在 $OPENCLAW_CONFIG_PATH（默认 ~/.openclaw/openclaw.json）中配置 apiKey，对应 primaryEnv（即 DASHSCOPE_API_KEY）：

{
  skills: {
    entries: {
      "ecommerce-image-suite": {
        apiKey: "DASHSCOPE_API_KEY_HERE",
      },
    }
  },
}

安全使用建议

This skill appears to implement the advertised e‑commerce image workflow, but review and be cautious before installing or running it: - Confirm which provider key you will supply. The skill lists DASHSCOPE_API_KEY as required (Tongyi / DashScope). Optional keys for OpenAI, Google Gemini, Stability, ARK/豆包 are also recognized — only set the keys you intend to use. - There are inconsistent environment variable names in the code/documentation (e.g., DOUBAO_API_KEY vs ARK_API_KEY). Double‑check the script you run (check_providers.py vs generate.py) to ensure the environment variable names you set are actually detected. - The generate scripts allow custom --base-url / *_BASE_URL values (proxy endpoints). If you use a proxy URL, the script will send your API key and prompt/image data to that proxy. Only use trusted proxy endpoints (prefer official provider endpoints) — untrusted proxies can harvest your API keys and uploaded images. - The skill will send product images/prompts to external services (image generation and the 'visual analysis' step). Do not supply sensitive or private images unless you accept they will be transmitted to those providers. - check_providers prints a key preview; avoid running it where its stdout could be captured by others if you care about leaking partial key data. - If you plan to use this in production, consider: (1) fixing the env var name mismatches in the code, (2) adding explicit validation/whitelisting for allowed base_url values, and (3) limiting logging of API keys/partials. If you are unsure about trusting provider endpoints or proxies, test the skill in a controlled environment with non-sensitive images and short‑lived API keys that you will rotate afterward.

能力标签

requires-oauth-token

能力评估

ℹ Purpose & Capability

The skill's name/description match the code: it builds prompts, calls image-generation providers, and overlays copy. Requiring a provider API key (DASHSCOPE_API_KEY) is reasonable because the skill targets multiple AIGC providers and lists DashScope/Tongyi as a primary vendor. However there are internal inconsistencies in environment variable names (see instruction_scope/environment_proportionality) which will cause confusion or misconfiguration.

⚠ Instruction Scope

Runtime instructions and scripts explicitly send product descriptions and (implicitly) uploaded product images to external image-generation APIs. The code accepts custom --base-url values (proxy endpoints) and will send Authorization headers or API keys to those URLs; this creates a clear exfiltration vector for both API keys and image/prompt data if a malicious proxy is supplied. The SKILL.md also references using Claude Vision for analysis (prompts provided) but there is no declared env/config for Anthropic — the skill expects the agent/platform to supply that model, which is plausible but worth noting.

✓ Install Mechanism

No install spec; scripts are instruction-driven Python scripts. Dependencies (requests, Pillow) are required but not automatically installed. No remote arbitrary archive downloads or installers were found in the manifest.

ℹ Credentials

Requiring a provider API key is proportionate. The manifest declares DASHSCOPE_API_KEY as required and several other provider keys as optional, which fits a multi-provider generator. However, there are naming inconsistencies: generate.py/refs use ARK_API_KEY / ARK_BASE_URL for the 'doubao' provider in some places while check_providers.py lists DOUBAO_API_KEY / DOUBAO_BASE_URL (typo/inconsistent naming). This can cause failure to detect configured keys or confusion about which secret you must provide. check_providers prints a key_preview (first 8 chars) which may leak partial keys into logs/output.

✓ Persistence & Privilege

always:false and no evidence the skill modifies other skills or system settings. It does not request elevated system presence.

版本历史

v1.0.6

ecommerce-image-suite 1.0.6 - 新增所有图像供应商支持自定义模型名：可通过`--model`参数或`*_MODEL`环境变量切换模型版本（如DASHSCOPE_MODEL、ARK_MODEL等）。 - 更新各供应商默认模型名及使用说明，完善多云API配置方式。 - generate.py 增加`--model`参数，适配自定义模型场景，模型优先级明确：命令行 > 环境变量 > 默认值。 - 扩展SKILL.md说明，详细注明各供应商模型配置、参数用法、代理配置方式及用法示例。 - 优化API Key与代理地址、模型参数的环境变量说明，支持更多国内直连及多版本覆盖。 - 细化各供应商表述与参数文档，提升跨云能力与易用性。

v1.0.5

ecommerce-image-suite v1.0.5 - No code or documentation changes detected in this version. - Behavior, features, and usage remain the same as the previous release.

v1.0.4

No code or documentation changes detected in this release. - Version updated to 1.0.4 with no other modifications. - All features and usage remain unchanged from previous version.

v1.0.3

**Changelog for ecommerce-image-suite v1.0.3** - CLI 支持大升级，新增三个 Python 脚本：`check_providers.py`、`generate.py`、`overlay.py`，分别用于检测 API 配置、多供应商生图、与文案叠加。 - 移除前端专用脚本 `suite_artifact.jsx`，CLI/Agent 场景默认采用 Python 实现的自动化流程。 - SKILL.md 文档全面补充：详细说明 CLI/代理部署支持、环境变量/代理参数配置、新增所有脚本说明与示例命令。 - 主要 API key 环境变量由 `QWEN_API_KEY` 更名为 `DASHSCOPE_API_KEY`，并补充所有可选代理参数说明。 - 技能既可在交互式 Artifact（浏览器）执行，也可在本地或 Agent 场景一键批量自动化执行。

v1.0.2

- Updated required environment variables: now only `QWEN_API_KEY` is mandatory; removed `ANTHROPIC_API_KEY` as a requirement. - Revised API Key configuration instructions and table to reflect the change—product analysis and image generation now unified under 千问 (Qwen). - Clarified description and metadata to match the new API requirements. - Adjusted references to reflect 千问 as the primary analysis and generation model; removed mention of Claude Vision/Anthropic in relevant sections. - Updated OpenClaw configuration example to use only `QWEN_API_KEY` as the primary environment variable.

v1.0.1

ecommerce-image-suite 1.0.1 - 增加 `.env.example` 文件，方便用户参考和配置环境变量。 - 精简和调整触发逻辑说明，只有用户明确提出电商图片生成需求时才会触发该技能。 - 更新环境变量说明：`ANTHROPIC_API_KEY` 现为必须项，`QWEN_API_KEY` 为主要图像生成供应商，其他供应商为可选。 - 描述与实际依赖环境变量保持一致，移除未被依赖的 KEY。 - 优化安全声明和 API Key 配置指引，更突出本地存储与直连安全性。

v1.0.0

Initial release of ecommerce-image-suite — one-click generation of full e-commerce image sets. - Upload product images to automatically generate compliant image suites (main, selling point, material, scene, model, multi-scene images) for major domestic and global e-commerce platforms. - Supports AI analysis of product images: extracts key selling points, generates editable, structured descriptions. - Adapts to platform-specific size, language, and style requirements (e.g. 淘宝, 京东, 拼多多, 抖音, Amazon, Shopify). - Integrates with 5 image generation providers (OpenAI, Gemini, Stability, 千问, 豆包); allows flexible provider selection based on available API keys. - Canvas overlay workflow to add platform-specific text to images, following configurable layout and style per image type. - Interactive React Artifact UI: guides users through API setup, image upload, provider selection, configuration, generation, preview and download.

元数据

Slug ecommerce-image-suite

版本 1.0.6

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 7

常见问题

ecommerce-image-suite 是什么？

电商套图生成助手。用户明确提出需要生成电商套图、商品主图、卖点图、场景图、模特图等图片内容时触发。支持国内平台（淘宝、京东、拼多多、抖音）与国际跨境平台（Amazon、独立站）的尺寸规范。触发示例：「帮我生成这件T恤的电商套图」「做一套淘宝主图」「生成亚马逊listing图片」。不应在用户仅上传图片但未明确... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 177 次。

如何安装 ecommerce-image-suite？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ecommerce-image-suite」即可一键安装，无需额外配置。

ecommerce-image-suite 是免费的吗？

是的，ecommerce-image-suite 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

ecommerce-image-suite 支持哪些平台？

ecommerce-image-suite 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 ecommerce-image-suite？

由 🍡（@wzj177）开发并维护，当前版本 v1.0.6。

ecommerce-image-suite