← Back to Skills Marketplace
dai-shuo

IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design

by Dai Shuo · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
373
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install ima-ai-image-photo-generator
Description
AI image generator and photo generator with SeeDream 4.5, Midjourney, Nano Banana 2, and Nano Banana Pro. Generate AI images for posters, thumbnails, logos,...
README (SKILL.md)

IMA AI Image & Photo Generator

⚠️ MANDATORY: You MUST read("SKILL-DETAIL.md") (full file — do NOT pass limit parameter) before your first image generation call. It contains the full API payload structure, error translation tables, upload flow, UX protocol, and FAQ that this summary omits. Skipping it causes parameter errors and poor user experience.

Model ID Reference (CRITICAL)

Use exact model_id. Do NOT infer from friendly names.

Name model_id Cost Best For
SeeDream 4.5 🌟 doubao-seedream-4.5 5 pts Default, product photos, posters, social media graphics
Midjourney 🎨 midjourney 8-10 pts Art, illustrations, creative graphic design
Nano Banana2 💚 gemini-3.1-flash-image 4-13 pts Budget thumbnails, quick social media images
Nano Banana Pro gemini-3-pro-image 10-18 pts Premium 4K photos, logos, product images

Aliases: 可梦/SeeDream → doubao-seedream-4.5 · MJ/Midjourney → midjourney · 香蕉/Banana → gemini-3.1-flash-image · 香蕉Pro → gemini-3-pro-image

Task Types

User intent task_type Use case
Text only → image text_to_image Generate poster, thumbnail, logo, art from description
Input image + edit image_to_image Style transfer, product photo editing, graphic design refinement

Visual Consistency (IMPORTANT)

For "same character" / "series" / "same product" across multiple images:

  • Do NOT use text_to_image (produces different-looking results each time)
  • Use image_to_image with previous result as reference
  • Read ima-knowledge-ai/references/visual-consistency.md if available

Knowledge Base (if ima-knowledge-ai installed)

Read before generating: visual-consistency.md (character/product continuity), model-selection.md (cost/quality), workflow-design.md (multi-step projects).

Parameter Support

Model Aspect Ratio Sizes Notes
SeeDream 4.5 ✅ 8 ratios (1:1, 16:9, 9:16, 4:3, 3:4, 2:3, 3:2, 21:9) adaptive Best value for posters & social media
Nano Banana2 ✅ 5 ratios (1:1, 16:9, 9:16, 4:3, 3:4) 512px/1K/2K/4K Budget image & thumbnail generator
Nano Banana Pro ✅ 5 ratios 1K/2K/4K Premium photo & logo generator
Midjourney 🎨 ❌ 1:1 only 480p/720p Art & illustration generator

If user asks for custom aspect ratio with Midjourney → recommend SeeDream 4.5 or Nano Banana. No model supports 8K (max 4K).

Default Models

Scenario Model model_id Cost
General image/photo SeeDream 4.5 doubao-seedream-4.5 5 pts
Art/illustration Midjourney midjourney 8-10 pts
Budget/fast thumbnail Nano Banana2 gemini-3.1-flash-image 4 pts
Premium 4K product photo Nano Banana Pro gemini-3-pro-image 10-18 pts

Model Selection Priority

  1. User preference (if explicitly stated) → highest priority
  2. ima-knowledge-ai recommendation (if installed)
  3. Fallback defaults above

Script Usage

# AI image generator — text to image (poster, thumbnail, social media graphic)
python3 {baseDir}/scripts/ima_image_create.py \
  --api-key $IMA_API_KEY --task-type text_to_image \
  --model-id doubao-seedream-4.5 --prompt "product photo for social media" \
  --user-id {user_id} --output-json

# AI photo generator — image to image (product photo editing, logo refinement)
python3 {baseDir}/scripts/ima_image_create.py \
  --api-key $IMA_API_KEY --task-type image_to_image \
  --model-id doubao-seedream-4.5 --prompt "enhance and refine" \
  --input-images https://example.com/photo.jpg \
  --user-id {user_id} --output-json

# List available image models
python3 {baseDir}/scripts/ima_image_create.py \
  --api-key $IMA_API_KEY --task-type text_to_image --list-models

Sending Results to User

# ✅ CORRECT: Use remote URL directly for inline image display
message(action="send", media=image_url, caption="✅ 图片生成成功!\
• 模型:[Name]\
• 耗时:[X]s\
• 积分:[N pts]\
🔗 原始链接:[url]")

# ❌ WRONG: Never download to local file

UX Protocol (Brief)

  1. Acknowledge: Short reply ("好的!来帮你画 🎨")
  2. Pre-gen: Model, time, cost via message tool
  3. Progress: Every 15-30s: "⏳ [P]%" (cap 95%)
  4. Success: Send media=url + include link in caption
  5. Failure: Natural language + suggest alternatives
  6. Done: No further action

Never expose: script names, API endpoints, attribute_id, technical params.

User Preferences

Storage: ~/.openclaw/memory/ima_prefs.json

  • Save on explicit: "用XXX" / "默认用XXX" / "always use XXX"
  • Clear on: "推荐一个" / "自动选择" / "用最好的"
  • Never save auto-selected models

Core Flow

  1. GET /open/v1/product/list?category=\x3Ctask_type>attribute_id, credit, form_config
  2. [image_to_image only] Upload local files → get CDN URL
  3. POST /open/v1/tasks/createtask_id
  4. POST /open/v1/tasks/detail → poll until resource_status==1

MANDATORY: Always query product list first. Missing attribute_id → task fails.

Environment

Base URL: https://api.imastudio.com Headers: Authorization: Bearer $IMA_API_KEY · x-app-source: ima_skills · x_app_language: en


⚠️ REMINDER: read("SKILL-DETAIL.md") (full file, no limit) is required before generating images. This summary covers model selection and routing — SKILL-DETAIL.md has complete API payloads, error handling, upload flow, parameter details, aspect ratio FAQ, and UX protocol needed for correct execution.

Usage Guidance
This skill appears to do what it claims, but review these practical privacy/security points before installing: - The single required secret is IMA_API_KEY. That key will be sent to both api.imastudio.com and imapi.liveme.com (the upload/CDN service) during normal operation. Only provide a key you trust IMA Studio with; do not reuse high-privilege keys for unrelated services. - Image-to-image with local files will upload your local image bytes to the IMA CDN (imapi.liveme.com). If you need to process sensitive images, avoid uploading or confirm the provider's privacy policy. - The skill writes logs and a prefs file under ~/.openclaw. Logs may include filenames, truncated URLs, and status messages. Review or rotate logs if they may contain sensitive info. - The code includes a hardcoded APP_ID/APP_KEY (documented as non-secret) and mentions optional APP_UID/APP_TOKEN env vars in comments; these are not required by the skill as packaged. If you see requests to configure extra environment variables later, treat them as optional and validate why they are needed. - The skill may read optional knowledge files from ~/.openclaw/skills/ima-knowledge-ai/references/* for visual-consistency guidance; this cross-skill read is declared in metadata. If you have sensitive data in other skills' reference folders, be aware of the read scope. If you trust imastudio.com / imapi.liveme.com and are comfortable with uploads of local images and storing logs locally, this skill is proportionate to its stated purpose. If not, consider using a generator with stricter local-only handling or one that documents on-premise operation.
Capability Analysis
Type: OpenClaw Skill Name: ima-ai-image-photo-generator Version: 1.0.0 The skill bundle is a legitimate integration for the IMA Studio AI image generation service. The primary script, `scripts/ima_image_create.py`, facilitates text-to-image and image-to-image tasks by communicating with documented IMA Studio endpoints (api.imastudio.com and imapi.liveme.com). It includes robust error handling, a 'reflection' mechanism for automatic retries, and local preference management. While the script can read local files to upload them for image-to-image processing, this behavior is transparently documented and aligned with the tool's stated purpose. No evidence of malicious intent, data exfiltration to unauthorized parties, or prompt injection attacks was found.
Capability Assessment
Purpose & Capability
Name/description (image & photo generation) matches the declared requirements (python3, requests) and the code: HTTP calls to IMA APIs and an upload service, model selection, text→image and image→image flows. Requiring IMA_API_KEY as the primary credential is expected.
Instruction Scope
Runtime instructions require reading the included SKILL-DETAIL.md and (optionally) cross-skill knowledge files for visual-consistency guidance; the README explicitly instructs agents how to send progress messages. This is within scope for a complex image-generation skill, but notable because the agent is instructed to read local skill-reference files and to not save generation results locally while still supporting uploading local input images.
Install Mechanism
No external install/download steps are present; the skill is instruction- and script-based and only depends on Python and the requests package listed in requirements.txt. No remote arbitrary code downloads or obscure URLs were used.
Credentials
Only one credential is declared (IMA_API_KEY), which is appropriate. The code does send that API key to both api.imastudio.com and a separate upload domain (imapi.liveme.com) when uploading local images — this is documented in the code but is a privacy consideration: local images and the API key are used for authenticated uploads to a CDN. The bundle also hardcodes an APP_KEY/APP_ID (marked non-secret in comments) and writes logs/preferences under the user's home directory.
Persistence & Privilege
The skill requests read/write for its own prefs and log directory (~/.openclaw/memory/ima_prefs.json and ~/.openclaw/logs/ima_skills/) and optionally reads other skill references if present. It does not request always:true nor attempt to modify other skills' configs.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install ima-ai-image-photo-generator
  3. After installation, invoke the skill by name or use /ima-ai-image-photo-generator
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release: SDO-optimized image generation skill
Metadata
Slug ima-ai-image-photo-generator
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design?

AI image generator and photo generator with SeeDream 4.5, Midjourney, Nano Banana 2, and Nano Banana Pro. Generate AI images for posters, thumbnails, logos,... It is an AI Agent Skill for Claude Code / OpenClaw, with 373 downloads so far.

How do I install IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design?

Run "/install ima-ai-image-photo-generator" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design free?

Yes, IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design support?

IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created IMA AI Image & Photo Generator — Poster, Thumbnail, Logo, Art, Illustration, Product & Social Media Graphic Design?

It is built and maintained by Dai Shuo (@dai-shuo); the current version is v1.0.0.

💬 Comments