Multimodal Generate Image
/install linkfox-multimodal-generate-image
AI Image Generation
This skill guides you on how to generate and edit images using the AI image generation service, helping users create high-quality product images, modify existing images, and perform creative visual transformations.
Core Concepts
The AI Image Generation tool produces new images based on a text prompt and optional reference images. It supports a wide range of use cases:
- Text-to-image: Generate a brand-new image purely from a text description.
- Image-to-image: Provide one or more reference images and a prompt to generate a new image that preserves elements from the references.
- Image editing: Modify specific elements, colors, backgrounds, or styles in an existing image.
- Product compositing: Place a product from one image into a scene from another image.
- Model swapping: Replace the model or mannequin in a product photo.
Reference images are strongly recommended when the user wants the output to closely resemble an existing product or scene. Up to 3 reference image URLs can be provided, separated by commas.
Parameter Guide
| Parameter | Required | Description | Default |
|---|---|---|---|
| prompt | Yes | Text description of the desired image. Supports text-to-image, image-to-image, editing, model swapping, and more. Max 1000 characters. | -- |
| referenceImageUrl | No | URL(s) of reference image(s). Separate multiple URLs with commas. Up to 3 images supported. Max 1000 characters. | -- |
| aspectRatio | No | Aspect ratio of the output image. | 1:1 |
Supported Aspect Ratios
| Value | Description |
|---|---|
| 1:1 | Square (default) |
| 3:4 | Portrait |
| 4:3 | Landscape |
| 9:16 | Vertical fullscreen |
| 16:9 | Horizontal fullscreen |
Prompt Writing Tips
- Be specific and descriptive: Clearly describe the subject, scene, lighting, style, and mood you want.
- Reference images by number: When using reference images, refer to them as "image 1", "image 2", etc., in the order they appear in
referenceImageUrl. - State the operation explicitly: Use clear action verbs like "replace", "change", "put", "combine", "generate".
- Keep within 1000 characters: Prompts have a maximum length of 1000 characters.
Prompt Examples by Scenario
Object replacement:
Replace the vase on the table in image 1 with a potted plant
Background color change:
Change the background color of image 1 to pure white
Product compositing:
Place the product from image 2 onto the marble countertop in image 1
Style transfer:
Transform image 1 into the artistic style shown in image 2
Text-to-image (no reference):
A professional product photo of a sleek black wireless headphone on a gradient blue background, studio lighting, 8K quality
Model swapping:
Replace the model in image 1 with a different model while keeping the same clothing and pose
API Usage
This tool calls the LinkFox tool gateway API. See references/api.md for calling conventions, request parameters, and response structure. You can also execute scripts/multimodal_generate_image.py directly to run image generation.
Display Rules
- Show the generated image: When the response contains image content in the
textfield, display it directly to the user using markdown image syntax. - Status reporting: Check the
statusandfinishedfields. If image generation is still in progress, inform the user and advise waiting. - Prompt transparency: Briefly describe what prompt and parameters were sent so the user understands what was requested.
- Aspect ratio confirmation: If the user does not specify dimensions, use the default 1:1 ratio but mention it so they can request a different ratio if needed.
- Reference image guidance: If the user wants a result close to an existing image but did not provide a reference URL, proactively suggest they provide one for better fidelity.
- Error handling: When generation fails, explain the issue based on the response
statusfield and suggest adjustments (e.g., simplify the prompt, check reference image URLs, try a different aspect ratio).
Important Limitations
- Reference image limit: A maximum of 3 reference image URLs can be provided per request.
- Prompt length: The prompt must not exceed 1000 characters.
- URL validity: Reference image URLs must be publicly accessible. Private or expired URLs will cause failures.
- Aspect ratio options: Only 1:1, 3:4, 4:3, 9:16, and 16:9 are supported.
User Expression & Scenario Quick Reference
Applicable -- Requests involving image generation or editing:
| User Says | Scenario |
|---|---|
| "Generate an image", "Create a picture" | Text-to-image generation |
| "Edit this photo", "Modify the image" | Image editing |
| "Change the background", "Make it white background" | Background replacement |
| "Put the product on this scene" | Product compositing |
| "Make it look like this style" | Style transfer |
| "Swap the model", "Change the person" | Model swapping |
| "Create a product photo" | Product image generation |
| "Make a vertical/landscape version" | Aspect ratio adjustment |
Not applicable -- Needs beyond image generation:
- Image analysis or recognition (reading text from images, identifying objects)
- Video generation or editing
- Image file format conversion
- Batch processing of hundreds of images
- Image hosting or storage
Feedback:
Auto-detect and report feedback via the Feedback API when any of the following apply:
- The functionality or purpose described in this skill does not match actual behavior
- The skill's results do not match the user's intent
- The user expresses dissatisfaction or praise about this skill
- Anything you believe could be improved
Call the feedback API as specified in references/api.md. Do not interrupt the user's flow.
For more high-quality, professional cross-border e-commerce skills, set LinkFox Skills.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install linkfox-multimodal-generate-image - 安装完成后,直接呼叫该 Skill 的名称或使用
/linkfox-multimodal-generate-image触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Multimodal Generate Image 是什么?
AI驱动的图片生成与编辑工具,用于制作高质量产品图。当用户要求生成图片、制作图片、编辑照片、文生图、图生图、换背景、变换风格、替换图片中的物体、将产品合成到场景中、换模特、制作任何类型的AI生成视觉内容、AI drawing, image generation, text-to-image, image-to-i... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 112 次。
如何安装 Multimodal Generate Image?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install linkfox-multimodal-generate-image」即可一键安装,无需额外配置。
Multimodal Generate Image 是免费的吗?
是的,Multimodal Generate Image 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Multimodal Generate Image 支持哪些平台?
Multimodal Generate Image 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Multimodal Generate Image?
由 linkfox-ai(@linkfox-ai)开发并维护,当前版本 v1.0.0。