IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design

Name: IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design
Author: dai-shuo

作者 Dai Shuo · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ 安全检测通过

373

总下载

当前安装

版本数

在 OpenClaw 中安装

/install ima-ai-image-photo-generator

功能描述

AI image generator and photo generator with SeeDream 4.5, Midjourney, Nano Banana 2, and Nano Banana Pro. Generate AI images for posters, thumbnails, logos,...

使用说明 (SKILL.md)

IMA AI Image & Photo Generator

⚠️ MANDATORY: You MUST read("SKILL-DETAIL.md") (full file — do NOT pass limit parameter) before your first image generation call. It contains the full API payload structure, error translation tables, upload flow, UX protocol, and FAQ that this summary omits. Skipping it causes parameter errors and poor user experience.

Model ID Reference (CRITICAL)

Use exact model_id. Do NOT infer from friendly names.

Name	model_id	Cost	Best For
SeeDream 4.5 🌟	`doubao-seedream-4.5`	5 pts	Default, product photos, posters, social media graphics
Midjourney 🎨	`midjourney`	8-10 pts	Art, illustrations, creative graphic design
Nano Banana2 💚	`gemini-3.1-flash-image`	4-13 pts	Budget thumbnails, quick social media images
Nano Banana Pro	`gemini-3-pro-image`	10-18 pts	Premium 4K photos, logos, product images

Aliases: 可梦/SeeDream → doubao-seedream-4.5 · MJ/Midjourney → midjourney · 香蕉/Banana → gemini-3.1-flash-image · 香蕉Pro → gemini-3-pro-image

Task Types

User intent	task_type	Use case
Text only → image	`text_to_image`	Generate poster, thumbnail, logo, art from description
Input image + edit	`image_to_image`	Style transfer, product photo editing, graphic design refinement

Visual Consistency (IMPORTANT)

For "same character" / "series" / "same product" across multiple images:

Do NOT use text_to_image (produces different-looking results each time)
Use image_to_image with previous result as reference
Read ima-knowledge-ai/references/visual-consistency.md if available

Knowledge Base (if ima-knowledge-ai installed)

Read before generating: visual-consistency.md (character/product continuity), model-selection.md (cost/quality), workflow-design.md (multi-step projects).

Parameter Support

Model	Aspect Ratio	Sizes	Notes
SeeDream 4.5	✅ 8 ratios (1:1, 16:9, 9:16, 4:3, 3:4, 2:3, 3:2, 21:9)	adaptive	Best value for posters & social media
Nano Banana2	✅ 5 ratios (1:1, 16:9, 9:16, 4:3, 3:4)	512px/1K/2K/4K	Budget image & thumbnail generator
Nano Banana Pro	✅ 5 ratios	1K/2K/4K	Premium photo & logo generator
Midjourney 🎨	❌ 1:1 only	480p/720p	Art & illustration generator

If user asks for custom aspect ratio with Midjourney → recommend SeeDream 4.5 or Nano Banana. No model supports 8K (max 4K).

Default Models

Scenario	Model	model_id	Cost
General image/photo	SeeDream 4.5	`doubao-seedream-4.5`	5 pts
Art/illustration	Midjourney	`midjourney`	8-10 pts
Budget/fast thumbnail	Nano Banana2	`gemini-3.1-flash-image`	4 pts
Premium 4K product photo	Nano Banana Pro	`gemini-3-pro-image`	10-18 pts

Model Selection Priority

User preference (if explicitly stated) → highest priority
ima-knowledge-ai recommendation (if installed)
Fallback defaults above

Script Usage

# AI image generator — text to image (poster, thumbnail, social media graphic)
python3 {baseDir}/scripts/ima_image_create.py \
  --api-key $IMA_API_KEY --task-type text_to_image \
  --model-id doubao-seedream-4.5 --prompt "product photo for social media" \
  --user-id {user_id} --output-json

# AI photo generator — image to image (product photo editing, logo refinement)
python3 {baseDir}/scripts/ima_image_create.py \
  --api-key $IMA_API_KEY --task-type image_to_image \
  --model-id doubao-seedream-4.5 --prompt "enhance and refine" \
  --input-images https://example.com/photo.jpg \
  --user-id {user_id} --output-json

# List available image models
python3 {baseDir}/scripts/ima_image_create.py \
  --api-key $IMA_API_KEY --task-type text_to_image --list-models

Sending Results to User

# ✅ CORRECT: Use remote URL directly for inline image display
message(action="send", media=image_url, caption="✅ 图片生成成功！\
• 模型：[Name]\
• 耗时：[X]s\
• 积分：[N pts]\
🔗 原始链接：[url]")

# ❌ WRONG: Never download to local file

UX Protocol (Brief)

Acknowledge: Short reply ("好的！来帮你画 🎨")
Pre-gen: Model, time, cost via message tool
Progress: Every 15-30s: "⏳ [P]%" (cap 95%)
Success: Send media=url + include link in caption
Failure: Natural language + suggest alternatives
Done: No further action

Never expose: script names, API endpoints, attribute_id, technical params.

User Preferences

Storage: ~/.openclaw/memory/ima_prefs.json

Save on explicit: "用XXX" / "默认用XXX" / "always use XXX"
Clear on: "推荐一个" / "自动选择" / "用最好的"
Never save auto-selected models

Core Flow

GET /open/v1/product/list?category=\x3Ctask_type> → attribute_id, credit, form_config
[image_to_image only] Upload local files → get CDN URL
POST /open/v1/tasks/create → task_id
POST /open/v1/tasks/detail → poll until resource_status==1

MANDATORY: Always query product list first. Missing attribute_id → task fails.

Environment

Base URL: https://api.imastudio.com Headers: Authorization: Bearer $IMA_API_KEY · x-app-source: ima_skills · x_app_language: en

⚠️ REMINDER: read("SKILL-DETAIL.md") (full file, no limit) is required before generating images. This summary covers model selection and routing — SKILL-DETAIL.md has complete API payloads, error handling, upload flow, parameter details, aspect ratio FAQ, and UX protocol needed for correct execution.

安全使用建议

This skill appears to do what it claims, but review these practical privacy/security points before installing: - The single required secret is IMA_API_KEY. That key will be sent to both api.imastudio.com and imapi.liveme.com (the upload/CDN service) during normal operation. Only provide a key you trust IMA Studio with; do not reuse high-privilege keys for unrelated services. - Image-to-image with local files will upload your local image bytes to the IMA CDN (imapi.liveme.com). If you need to process sensitive images, avoid uploading or confirm the provider's privacy policy. - The skill writes logs and a prefs file under ~/.openclaw. Logs may include filenames, truncated URLs, and status messages. Review or rotate logs if they may contain sensitive info. - The code includes a hardcoded APP_ID/APP_KEY (documented as non-secret) and mentions optional APP_UID/APP_TOKEN env vars in comments; these are not required by the skill as packaged. If you see requests to configure extra environment variables later, treat them as optional and validate why they are needed. - The skill may read optional knowledge files from ~/.openclaw/skills/ima-knowledge-ai/references/* for visual-consistency guidance; this cross-skill read is declared in metadata. If you have sensitive data in other skills' reference folders, be aware of the read scope. If you trust imastudio.com / imapi.liveme.com and are comfortable with uploads of local images and storing logs locally, this skill is proportionate to its stated purpose. If not, consider using a generator with stricter local-only handling or one that documents on-premise operation.

功能分析

Type: OpenClaw Skill Name: ima-ai-image-photo-generator Version: 1.0.0 The skill bundle is a legitimate integration for the IMA Studio AI image generation service. The primary script, `scripts/ima_image_create.py`, facilitates text-to-image and image-to-image tasks by communicating with documented IMA Studio endpoints (api.imastudio.com and imapi.liveme.com). It includes robust error handling, a 'reflection' mechanism for automatic retries, and local preference management. While the script can read local files to upload them for image-to-image processing, this behavior is transparently documented and aligned with the tool's stated purpose. No evidence of malicious intent, data exfiltration to unauthorized parties, or prompt injection attacks was found.

能力评估

✓ Purpose & Capability

Name/description (image & photo generation) matches the declared requirements (python3, requests) and the code: HTTP calls to IMA APIs and an upload service, model selection, text→image and image→image flows. Requiring IMA_API_KEY as the primary credential is expected.

ℹ Instruction Scope

Runtime instructions require reading the included SKILL-DETAIL.md and (optionally) cross-skill knowledge files for visual-consistency guidance; the README explicitly instructs agents how to send progress messages. This is within scope for a complex image-generation skill, but notable because the agent is instructed to read local skill-reference files and to not save generation results locally while still supporting uploading local input images.

✓ Install Mechanism

No external install/download steps are present; the skill is instruction- and script-based and only depends on Python and the requests package listed in requirements.txt. No remote arbitrary code downloads or obscure URLs were used.

ℹ Credentials

Only one credential is declared (IMA_API_KEY), which is appropriate. The code does send that API key to both api.imastudio.com and a separate upload domain (imapi.liveme.com) when uploading local images — this is documented in the code but is a privacy consideration: local images and the API key are used for authenticated uploads to a CDN. The bundle also hardcodes an APP_KEY/APP_ID (marked non-secret in comments) and writes logs/preferences under the user's home directory.

✓ Persistence & Privilege

The skill requests read/write for its own prefs and log directory (~/.openclaw/memory/ima_prefs.json and ~/.openclaw/logs/ima_skills/) and optionally reads other skill references if present. It does not request always:true nor attempt to modify other skills' configs.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install ima-ai-image-photo-generator
安装完成后，直接呼叫该 Skill 的名称或使用 /ima-ai-image-photo-generator 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

Initial release: SDO-optimized image generation skill

元数据

Slug ima-ai-image-photo-generator

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题