← 返回 Skills 市场
jau123

AI Image Generator & Editor — GPT Image 2, Nanobanana, ComfyUI

作者 jau123 · GitHub ↗ · v1.0.30 · MIT-0
cross-platform ✓ 安全检测通过
2305
总下载
5
收藏
12
当前安装
31
版本数
在 OpenClaw 中安装
/install creative-toolkit
功能描述
Generate images from text with multi-provider routing — supports GPT Image 2.0 (near-perfect text rendering), Nanobanana 2, Seedream 5.0, Midjourney V7 (phot...
使用说明 (SKILL.md)

Creative Toolkit

Generate professional AI images through a unified interface that routes across multiple providers. Search curated prompts, enhance ideas into production-ready descriptions, and manage local ComfyUI workflows — all from a single MCP server.

Quick Start

Add the MCP server to your mcporter config (~/.config/mcporter/config.json):

{
  "mcpServers": {
    "creative-toolkit": {
      "command": "npx",
      "args": ["-y", "[email protected]"]
    }
  }
}

Free tools (search, enhance, inspire) work immediately — no API key needed:

mcporter call creative-toolkit.search_gallery query="cyberpunk"
mcporter call creative-toolkit.enhance_prompt brief="a cat in space" style="realistic"

To unlock image generation, configure one of these providers:

Provider Config What you need
MeiGen Cloud MEIGEN_API_TOKEN Token from meigen.ai (avatar → Settings → API Keys)
Local ComfyUI comfyuiUrl A running ComfyUI instance — no external API needed
Any OpenAI-compatible API openaiApiKey + openaiBaseUrl + openaiModel Your own key from Together AI, Fireworks AI, etc.

Set credentials in ~/.clawdbot/.env, ~/.config/meigen/config.json, or add an "env" block to the mcporter config above. See references/providers.md for details.

Available Tools

Free — no API key required

Tool What it does
search_gallery Semantic search across 1,300+ AI image prompts. Supports category filtering and curated browsing. Returns prompt text, thumbnails, and metadata.
get_inspiration Get the full prompt and high-res images for any gallery entry. Use after search_gallery to get copyable prompts.
enhance_prompt Expand a brief idea into a detailed, style-aware prompt with lighting, composition, and material directions. Supports realistic, anime, and illustration styles.
list_models List all available models across configured providers with capabilities and supported features.

Requires configured provider

Tool What it does
generate_image Generate an image from a text prompt. Routes to the best available provider. Supports aspect ratio, seed, and reference images.
generate_image (with local paths) Pass local file paths directly in referenceImages — images are auto-compressed locally (max 2MB, 2048px) and prepared for the selected provider. ComfyUI handles local files entirely within the local workflow.
comfyui_workflow List, view, import, modify, and delete ComfyUI workflow templates. Adjust steps, CFG scale, sampler, and checkpoint without editing JSON.
manage_preferences Save and load user preferences (default style, aspect ratio, style notes, favorite prompts).

Important Rules

Never describe generated images

You cannot see generated images. After generation, only present the exact data from the tool response:

**Direction 1: Modern Minimal**
- Image URL: https://images.meigen.art/...
- Saved to: ~/Pictures/meigen/2026-02-08_xxxx.jpg

Do NOT write creative commentary about what the image "looks like".

Never specify model or provider

Do NOT pass model or provider to generate_image unless the user explicitly asks. The server auto-selects the best available provider and model.

Midjourney V7 vs Niji 7

Both take ~60s, accept 1 reference image, and return 4 candidate images per generation. Advanced params (stylize/chaos/weird/raw/iw/sw/sv) run with fixed server-side defaults and cannot be tuned from MCP — the only exception is sref, settable via --sref \x3Ccode> at the end of the prompt (Midjourney style codes only, e.g. 3799554500; no URLs or local paths). They differ in content focus and prompt enhancement style:

  • Midjourney V7 (model: "midjourney-v7") — general / photorealistic. Use for product photography, portraits, landscapes, cinematic and editorial shots. When enhancing, use style: 'realistic' (the default).
  • Midjourney Niji 7 (model: "midjourney-niji7") — anime / illustration ONLY. Do NOT use for photorealistic, product, or non-anime content — use GPT Image 2.0 or Nanobanana 2 instead. The server auto-appends anime illustration style if your prompt lacks anime keywords. When enhancing, ALWAYS pass style: 'anime' to enhance_prompt — the default realistic produces prompts poorly suited for anime models.

Always confirm before generating multiple images

When the user wants multiple variations, present options first and ask which direction(s) to try. Include an "all of the above" option. Never auto-generate all variants without user confirmation.


Workflow Modes

Mode 1: Single Image

User wants one image. Write a prompt (or call enhance_prompt if the description is brief), generate, present URL + path.

Mode 2: Prompt Enhancement + Generation

For brief ideas (under ~30 words, lacking visual details), enhance first:

1. enhance_prompt brief="futuristic city" style="realistic"
   -> Returns detailed prompt with camera lens, lighting, atmospheric effects

2. generate_image prompt="\x3Cenhanced prompt>"
   -> Omit aspectRatio to let MeiGen auto-infer (recommended). Pass an explicit
      value like aspectRatio="16:9" only when the user asked for that ratio.

Mode 3: Parallel Generation (2+ images)

User needs multiple variations — different directions, styles, or concepts.

  1. Plan directions, present as a table
  2. Ask user which direction(s) to try
  3. Write distinct prompts for each — don't just tweak one word
  4. Generate selected directions (max 4 parallel for API providers, 1 at a time for ComfyUI)
  5. Present URLs + paths

Mode 4: Multi-Step Creative (base + extensions)

User wants a base design plus derivatives (e.g., "design a logo and make mockups").

  1. Plan 3-5 directions, ask user which to try
  2. Generate selected direction(s)
  3. Present results, ask user to approve or try another
  4. Plan extensions using the approved Image URL as referenceImages
  5. Generate extensions

Never jump from plan to generating everything at once.

Mode 5: Edit/Modify Existing Image

User provides an image and asks for changes (add text, change background, etc.).

  • Pass the image (URL or local path) as referenceImages, then generate with a short, literal prompt describing ONLY the edit
  • The reference image carries all visual context — do NOT re-describe the original image
  • Example prompt: "Add the text 'meigen.ai' at the bottom of this image"

Mode 6: Inspiration Search

1. search_gallery query="dreamy portrait with soft light"
   -> Finds semantically similar prompts with thumbnails

2. get_inspiration id="\x3Centry_id>"
   -> Get full prompt text — copy and modify for your own generation

Mode 7: Reference Image Generation

Use an existing image to guide visual style. Pass URLs or local file paths directly to referenceImages.

generate_image prompt="coffee mug mockup with this logo" referenceImages=["~/Desktop/my-logo.png"]
   -> Local files are auto-compressed (max 2MB, 2048px) and prepared for the selected provider

Reference image sources: gallery URLs, previous generation URLs, or local file paths. All providers accept local paths — they are automatically handled.

Mode 8: ComfyUI Workflows

1. comfyui_workflow action="list"           -> See saved workflows
2. comfyui_workflow action="view" name="txt2img"  -> See adjustable parameters
3. comfyui_workflow action="modify" name="txt2img" modifications={"steps": 30}
4. generate_image prompt="..." workflow="txt2img"  -> Generate

Alternative Providers

You can use your own OpenAI-compatible API or a local ComfyUI instance instead of — or alongside — the default MeiGen provider. See references/providers.md for detailed configuration and provider comparison. For MeiGen model pricing, see https://www.meigen.ai/model-comparison.

Troubleshooting

See references/troubleshooting.md for common issues, solutions, and security & privacy details.

安全使用建议
This skill is coherent for an image-generation toolkit, but it runs a pinned npm package at runtime (npx [email protected]). Before installing or running: 1) Inspect the referenced npm package and GitHub repository (verify the source code and recent commits). 2) If you plan to use cloud providers, restrict tokens to least privilege and store them in local config files with tight permissions (chmod 600) as suggested. 3) If privacy is a concern, prefer a local ComfyUI provider so image data never leaves your machine. 4) Treat the skill's 'no telemetry' statement as a claim until you review the package source or network behaviour. If you cannot review the package, consider running it in an isolated environment (container or VM) first.
功能分析
Type: OpenClaw Skill Name: creative-toolkit Version: 1.0.30 The skill bundle describes a legitimate AI image generation toolkit that routes requests to various providers including MeiGen, OpenAI-compatible APIs, and local ComfyUI instances. It handles local file paths for reference images and manages API keys in local configuration files (~/.config/meigen/config.json), which is consistent with its stated purpose. The instructions in SKILL.md provide operational constraints for the AI agent (e.g., not describing images) without any malicious prompt injection, and the use of a pinned npm package version ([email protected]) follows security best practices for MCP servers.
能力标签
requires-sensitive-credentials
能力评估
Purpose & Capability
The name/description (multi-provider image generation + prompt tools) matches the declared requirements: mcporter (to run an MCP server), node and npx (to run the pinned npm package). The skill only references provider credentials that are appropriate for image providers (MEIGEN_API_TOKEN, openaiApiKey, comfyuiUrl). No unrelated credentials or binaries are requested.
Instruction Scope
SKILL.md stays on-topic: it instructs adding an MCP server entry, using mcporter calls, and configuring provider credentials or a local ComfyUI URL. It also instructs local operations (in-memory image compression) and explicit user-supplied local file paths. One claim — 'no telemetry' and 'tokens never logged or transmitted elsewhere' — is written in the docs but cannot be verified from the instruction-only skill itself (the actual behaviour depends on the remote npm package that will be executed).
Install Mechanism
There is no install spec inside the skill bundle, but runtime instructions tell the user/agent to run 'npx [email protected]'. Fetching and executing a pinned npm package at runtime is a moderate-risk install mechanism. The package is pinned and the README points to an npm page and GitHub repo, which reduces but does not eliminate risk — you should review the package source before executing.
Credentials
The skill manifest declares no required env vars. SKILL.md documents optional provider credentials (MEIGEN_API_TOKEN, openaiApiKey/openaiBaseUrl/openaiModel) or a local comfyuiUrl — all reasonable and proportional to an image-generation tool. It advises storing tokens locally with restricted permissions, which is appropriate. There are no unexpected or unrelated secret requests.
Persistence & Privilege
The skill is not marked always:true and does not request system-wide privileges. It asks you to add an MCP server entry to your mcporter config (expected for MCP-based tools). Autonomous invocation remains possible (default) but that is platform-default behavior and not a new privilege introduced by this skill.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install creative-toolkit
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /creative-toolkit 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.30
Display name update: lead with GPT Image 2 (the default model), use 'Generator' (matches existing ClawHub branding)
v1.0.29
Restore display name (was reset to 'Openclaw' due to missing --name flag in 1.0.27/1.0.28 publishes)
v1.0.28
Pin npm to [email protected] (gallery library refresh: 1446 entries, 6 new categories)
v1.0.27
Remove hardcoded credit pricing — link to model-comparison page instead
v1.0.26
Pricing hotfix over 1.0.25: gpt-image-2 default resolution now 1K (10 credits) to match platform product default. Previous 2K fallback silently doubled cost. Expose resolution/quality parameters. Pinned to [email protected].
v1.0.25
Default model switched to GPT Image 2.0 (near-perfect text rendering, 10 credits). Retired GPT Image 1.5. Pinned to [email protected].
v1.0.24
Rephrase reference-image handling to emphasize user-initiated, local compression; bump pinned version to [email protected]
v1.0.23
Add Midjourney V7 guidance; aspect ratio auto-inference default; clarify sref usage (codes only)
v1.0.22
Remove standalone upload_reference_image tool; local reference images are now auto-compressed and uploaded inline via generate_image
v1.0.21
Revert to 1.0.16 baseline content (OpenClaw Benign); VirusTotal false positive to be addressed via ClawHub issue
v1.0.20
Minimize trigger keywords in tool descriptions to resolve VirusTotal Code Insights false positive — functional details preserved in MCP server tool schema
v1.0.19
Add inline security notes in SKILL.md and rephrase upload descriptions — resolve VirusTotal Code Insights false positive while preserving safety guardrails for OpenClaw scanner
v1.0.18
Revert 1.0.17 phrasing changes that caused OpenClaw scanner regression; restore safety guardrail language from 1.0.16
v1.0.17
Rephrase upload_reference_image descriptions to avoid false positive security flags — remove command-style phrasing, add explanatory context about compression and format validation
v1.0.16
Add Niji 7 anime-only guidance and multi-image support notes; bump pinned npm version to 1.2.6
v1.0.15
Rephrase upload instructions to avoid security scanner false positives
v1.0.14
Prevent LLM from constructing manual HTTP upload requests — must use upload_reference_image MCP tool directly
v1.0.13
Fix security audit: remove MEIGEN_API_TOKEN from required env (optional, not mandatory), clarify 3 equal provider options in Quick Start, simplify mcporter config example
v1.0.12
Add behavioral rules (never describe images, never specify model/provider, confirm before batch), expand workflow modes from 5 to 8 (parallel generation, multi-step creative, edit/modify), rename Generator to Generation
v1.0.11
Restructure skill: split SKILL.md into core instructions + references/; add 24h upload expiry notice; add ComfyUI local file support; rename OpenAI to OpenAI-compatible
元数据
Slug creative-toolkit
版本 1.0.30
许可证 MIT-0
累计安装 12
当前安装数 12
历史版本数 31
常见问题

AI Image Generator & Editor — GPT Image 2, Nanobanana, ComfyUI 是什么?

Generate images from text with multi-provider routing — supports GPT Image 2.0 (near-perfect text rendering), Nanobanana 2, Seedream 5.0, Midjourney V7 (phot... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 2305 次。

如何安装 AI Image Generator & Editor — GPT Image 2, Nanobanana, ComfyUI?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install creative-toolkit」即可一键安装,无需额外配置。

AI Image Generator & Editor — GPT Image 2, Nanobanana, ComfyUI 是免费的吗?

是的,AI Image Generator & Editor — GPT Image 2, Nanobanana, ComfyUI 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

AI Image Generator & Editor — GPT Image 2, Nanobanana, ComfyUI 支持哪些平台?

AI Image Generator & Editor — GPT Image 2, Nanobanana, ComfyUI 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 AI Image Generator & Editor — GPT Image 2, Nanobanana, ComfyUI?

由 jau123(@jau123)开发并维护,当前版本 v1.0.30。

💬 留言讨论