← 返回 Skills 市场
dai-shuo

IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design

作者 Dai Shuo · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
373
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install ima-ai-image-photo-generator
功能描述
AI image generator and photo generator with SeeDream 4.5, Midjourney, Nano Banana 2, and Nano Banana Pro. Generate AI images for posters, thumbnails, logos,...
使用说明 (SKILL.md)

IMA AI Image & Photo Generator

⚠️ MANDATORY: You MUST read("SKILL-DETAIL.md") (full file — do NOT pass limit parameter) before your first image generation call. It contains the full API payload structure, error translation tables, upload flow, UX protocol, and FAQ that this summary omits. Skipping it causes parameter errors and poor user experience.

Model ID Reference (CRITICAL)

Use exact model_id. Do NOT infer from friendly names.

Name model_id Cost Best For
SeeDream 4.5 🌟 doubao-seedream-4.5 5 pts Default, product photos, posters, social media graphics
Midjourney 🎨 midjourney 8-10 pts Art, illustrations, creative graphic design
Nano Banana2 💚 gemini-3.1-flash-image 4-13 pts Budget thumbnails, quick social media images
Nano Banana Pro gemini-3-pro-image 10-18 pts Premium 4K photos, logos, product images

Aliases: 可梦/SeeDream → doubao-seedream-4.5 · MJ/Midjourney → midjourney · 香蕉/Banana → gemini-3.1-flash-image · 香蕉Pro → gemini-3-pro-image

Task Types

User intent task_type Use case
Text only → image text_to_image Generate poster, thumbnail, logo, art from description
Input image + edit image_to_image Style transfer, product photo editing, graphic design refinement

Visual Consistency (IMPORTANT)

For "same character" / "series" / "same product" across multiple images:

  • Do NOT use text_to_image (produces different-looking results each time)
  • Use image_to_image with previous result as reference
  • Read ima-knowledge-ai/references/visual-consistency.md if available

Knowledge Base (if ima-knowledge-ai installed)

Read before generating: visual-consistency.md (character/product continuity), model-selection.md (cost/quality), workflow-design.md (multi-step projects).

Parameter Support

Model Aspect Ratio Sizes Notes
SeeDream 4.5 ✅ 8 ratios (1:1, 16:9, 9:16, 4:3, 3:4, 2:3, 3:2, 21:9) adaptive Best value for posters & social media
Nano Banana2 ✅ 5 ratios (1:1, 16:9, 9:16, 4:3, 3:4) 512px/1K/2K/4K Budget image & thumbnail generator
Nano Banana Pro ✅ 5 ratios 1K/2K/4K Premium photo & logo generator
Midjourney 🎨 ❌ 1:1 only 480p/720p Art & illustration generator

If user asks for custom aspect ratio with Midjourney → recommend SeeDream 4.5 or Nano Banana. No model supports 8K (max 4K).

Default Models

Scenario Model model_id Cost
General image/photo SeeDream 4.5 doubao-seedream-4.5 5 pts
Art/illustration Midjourney midjourney 8-10 pts
Budget/fast thumbnail Nano Banana2 gemini-3.1-flash-image 4 pts
Premium 4K product photo Nano Banana Pro gemini-3-pro-image 10-18 pts

Model Selection Priority

  1. User preference (if explicitly stated) → highest priority
  2. ima-knowledge-ai recommendation (if installed)
  3. Fallback defaults above

Script Usage

# AI image generator — text to image (poster, thumbnail, social media graphic)
python3 {baseDir}/scripts/ima_image_create.py \
  --api-key $IMA_API_KEY --task-type text_to_image \
  --model-id doubao-seedream-4.5 --prompt "product photo for social media" \
  --user-id {user_id} --output-json

# AI photo generator — image to image (product photo editing, logo refinement)
python3 {baseDir}/scripts/ima_image_create.py \
  --api-key $IMA_API_KEY --task-type image_to_image \
  --model-id doubao-seedream-4.5 --prompt "enhance and refine" \
  --input-images https://example.com/photo.jpg \
  --user-id {user_id} --output-json

# List available image models
python3 {baseDir}/scripts/ima_image_create.py \
  --api-key $IMA_API_KEY --task-type text_to_image --list-models

Sending Results to User

# ✅ CORRECT: Use remote URL directly for inline image display
message(action="send", media=image_url, caption="✅ 图片生成成功!\
• 模型:[Name]\
• 耗时:[X]s\
• 积分:[N pts]\
🔗 原始链接:[url]")

# ❌ WRONG: Never download to local file

UX Protocol (Brief)

  1. Acknowledge: Short reply ("好的!来帮你画 🎨")
  2. Pre-gen: Model, time, cost via message tool
  3. Progress: Every 15-30s: "⏳ [P]%" (cap 95%)
  4. Success: Send media=url + include link in caption
  5. Failure: Natural language + suggest alternatives
  6. Done: No further action

Never expose: script names, API endpoints, attribute_id, technical params.

User Preferences

Storage: ~/.openclaw/memory/ima_prefs.json

  • Save on explicit: "用XXX" / "默认用XXX" / "always use XXX"
  • Clear on: "推荐一个" / "自动选择" / "用最好的"
  • Never save auto-selected models

Core Flow

  1. GET /open/v1/product/list?category=\x3Ctask_type>attribute_id, credit, form_config
  2. [image_to_image only] Upload local files → get CDN URL
  3. POST /open/v1/tasks/createtask_id
  4. POST /open/v1/tasks/detail → poll until resource_status==1

MANDATORY: Always query product list first. Missing attribute_id → task fails.

Environment

Base URL: https://api.imastudio.com Headers: Authorization: Bearer $IMA_API_KEY · x-app-source: ima_skills · x_app_language: en


⚠️ REMINDER: read("SKILL-DETAIL.md") (full file, no limit) is required before generating images. This summary covers model selection and routing — SKILL-DETAIL.md has complete API payloads, error handling, upload flow, parameter details, aspect ratio FAQ, and UX protocol needed for correct execution.

安全使用建议
This skill appears to do what it claims, but review these practical privacy/security points before installing: - The single required secret is IMA_API_KEY. That key will be sent to both api.imastudio.com and imapi.liveme.com (the upload/CDN service) during normal operation. Only provide a key you trust IMA Studio with; do not reuse high-privilege keys for unrelated services. - Image-to-image with local files will upload your local image bytes to the IMA CDN (imapi.liveme.com). If you need to process sensitive images, avoid uploading or confirm the provider's privacy policy. - The skill writes logs and a prefs file under ~/.openclaw. Logs may include filenames, truncated URLs, and status messages. Review or rotate logs if they may contain sensitive info. - The code includes a hardcoded APP_ID/APP_KEY (documented as non-secret) and mentions optional APP_UID/APP_TOKEN env vars in comments; these are not required by the skill as packaged. If you see requests to configure extra environment variables later, treat them as optional and validate why they are needed. - The skill may read optional knowledge files from ~/.openclaw/skills/ima-knowledge-ai/references/* for visual-consistency guidance; this cross-skill read is declared in metadata. If you have sensitive data in other skills' reference folders, be aware of the read scope. If you trust imastudio.com / imapi.liveme.com and are comfortable with uploads of local images and storing logs locally, this skill is proportionate to its stated purpose. If not, consider using a generator with stricter local-only handling or one that documents on-premise operation.
功能分析
Type: OpenClaw Skill Name: ima-ai-image-photo-generator Version: 1.0.0 The skill bundle is a legitimate integration for the IMA Studio AI image generation service. The primary script, `scripts/ima_image_create.py`, facilitates text-to-image and image-to-image tasks by communicating with documented IMA Studio endpoints (api.imastudio.com and imapi.liveme.com). It includes robust error handling, a 'reflection' mechanism for automatic retries, and local preference management. While the script can read local files to upload them for image-to-image processing, this behavior is transparently documented and aligned with the tool's stated purpose. No evidence of malicious intent, data exfiltration to unauthorized parties, or prompt injection attacks was found.
能力评估
Purpose & Capability
Name/description (image & photo generation) matches the declared requirements (python3, requests) and the code: HTTP calls to IMA APIs and an upload service, model selection, text→image and image→image flows. Requiring IMA_API_KEY as the primary credential is expected.
Instruction Scope
Runtime instructions require reading the included SKILL-DETAIL.md and (optionally) cross-skill knowledge files for visual-consistency guidance; the README explicitly instructs agents how to send progress messages. This is within scope for a complex image-generation skill, but notable because the agent is instructed to read local skill-reference files and to not save generation results locally while still supporting uploading local input images.
Install Mechanism
No external install/download steps are present; the skill is instruction- and script-based and only depends on Python and the requests package listed in requirements.txt. No remote arbitrary code downloads or obscure URLs were used.
Credentials
Only one credential is declared (IMA_API_KEY), which is appropriate. The code does send that API key to both api.imastudio.com and a separate upload domain (imapi.liveme.com) when uploading local images — this is documented in the code but is a privacy consideration: local images and the API key are used for authenticated uploads to a CDN. The bundle also hardcodes an APP_KEY/APP_ID (marked non-secret in comments) and writes logs/preferences under the user's home directory.
Persistence & Privilege
The skill requests read/write for its own prefs and log directory (~/.openclaw/memory/ima_prefs.json and ~/.openclaw/logs/ima_skills/) and optionally reads other skill references if present. It does not request always:true nor attempt to modify other skills' configs.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install ima-ai-image-photo-generator
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /ima-ai-image-photo-generator 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release: SDO-optimized image generation skill
元数据
Slug ima-ai-image-photo-generator
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design 是什么?

AI image generator and photo generator with SeeDream 4.5, Midjourney, Nano Banana 2, and Nano Banana Pro. Generate AI images for posters, thumbnails, logos,... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 373 次。

如何安装 IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ima-ai-image-photo-generator」即可一键安装,无需额外配置。

IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design 是免费的吗?

是的,IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design 支持哪些平台?

IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design?

由 Dai Shuo(@dai-shuo)开发并维护,当前版本 v1.0.0。

💬 留言讨论