Generate images & videos with: Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key

Name: Generate images & videos with: Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key
Author: 0xjordansg-yolo

by 0xjordansg-yolo · GitHub ↗ · v1.0.0

cross-platform ✓ Security Clean

2235

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install openclaw-aisa-image-video-models-wan2-6-gemini-3-pro-image-nano-banana

Description

Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.

README (SKILL.md)

OpenClaw Media Gen 🎬

用 AIsa API 一把钥匙生成图片与视频：

图片：gemini-3-pro-image-preview（Gemini GenerateContent）
视频：wan2.6-t2v（通义万相 / Qwen Wan 2.6，异步任务）

API 文档索引见 AIsa API Reference（可从 https://aisa.mintlify.app/llms.txt 找到所有页面）。

🔥 你可以做什么

图片生成（Gemini）

"生成一张赛博朋克风格的城市夜景，霓虹灯，雨夜，电影感"

视频生成（Wan 2.6）

"用一张参考图生成 5 秒镜头：镜头缓慢推进，风吹动头发，电影感，浅景深"

Quick Start

export AISA_API_KEY="your-key"

🖼️ Image Generation (Gemini)

Endpoint

Base URL: https://api.aisa.one/v1
POST /models/{model}:generateContent

文档：google-gemini-chat（GenerateContent）见 https://aisa.mintlify.app/api-reference/chat/chat-api/google-gemini-chat.md。

curl 示例（返回 inline_data 时为图片）

curl -X POST "https://api.aisa.one/v1/models/gemini-3-pro-image-preview:generateContent" \
  -H "Authorization: Bearer $AISA_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "contents":[
      {"role":"user","parts":[{"text":"A cute red panda, ultra-detailed, cinematic lighting"}]}
    ]
  }'

说明：该接口的响应中可能出现 candidates[].parts[].inline_data（通常包含 base64 数据与 mime 类型）；客户端脚本会自动解析并保存文件。

🎞️ Video Generation (Qwen Wan 2.6 / Tongyi Wanxiang)

Create task

Base URL: https://api.aisa.one/apis/v1
POST /services/aigc/video-generation/video-synthesis
Header：X-DashScope-Async: enable（必填，异步）

文档：video-generation 见 https://aisa.mintlify.app/api-reference/aliyun/video/video-generation.md。

curl -X POST "https://api.aisa.one/apis/v1/services/aigc/video-generation/video-synthesis" \
  -H "Authorization: Bearer $AISA_API_KEY" \
  -H "Content-Type: application/json" \
  -H "X-DashScope-Async: enable" \
  -d '{
    "model":"wan2.6-t2v",
    "input":{
      "prompt":"cinematic close-up, slow push-in, shallow depth of field",
      "img_url":"https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Cat03.jpg/320px-Cat03.jpg"
    },
    "parameters":{
      "resolution":"720P",
      "duration":5,
      "shot_type":"single",
      "watermark":false
    }
  }'

Poll task

GET /services/aigc/tasks?task_id=...

文档：task 见 https://aisa.mintlify.app/api-reference/aliyun/video/task.md。

curl "https://api.aisa.one/apis/v1/services/aigc/tasks?task_id=YOUR_TASK_ID" \
  -H "Authorization: Bearer $AISA_API_KEY"

Python Client

# 生成图片（保存到本地文件）
python3 {baseDir}/scripts/media_gen_client.py image \
  --prompt "A cute red panda, cinematic lighting" \
  --out "out.png"

# 创建视频任务（需要 img_url）
python3 {baseDir}/scripts/media_gen_client.py video-create \
  --prompt "cinematic close-up, slow push-in" \
  --img-url "https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Cat03.jpg/320px-Cat03.jpg" \
  --duration 5

# 轮询任务状态
python3 {baseDir}/scripts/media_gen_client.py video-status --task-id YOUR_TASK_ID

# 等待直到成功（可选：成功后打印 video_url）
python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --poll 10 --timeout 600

# 等待直到成功并自动下载 mp4
python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --download --out out.mp4

Usage Guidance

This skill is coherent: it uses AISA_API_KEY to call api.aisa.one for image and async video generation and saves returned media locally. Before installing, confirm you trust the AIsa service (api.aisa.one) because all prompts, images, and any provided URLs will be transmitted there; treat AISA_API_KEY like any API secret (use least-privilege keys if possible, and rotate if compromised). Note the script will download files from URLs you provide and from URLs returned by the service—avoid passing internal or sensitive URLs. If you need stronger assurance, review the full Python script locally and consider running it in an isolated environment (e.g., container) and limit network access as appropriate.

Capability Analysis

Type: OpenClaw Skill Name: openclaw-aisa-image-video-models-wan2-6-gemini-3-pro-image-nano-banana Version: 1.0.0 The skill is designed to generate images and videos using the AIsa API. The `SKILL.md` and `README.md` provide clear instructions and examples for this purpose. The Python script `scripts/media_gen_client.py` handles API calls to `https://api.aisa.one` using the `AISA_API_KEY` and saves generated media (images, videos) to local files. There is no evidence of data exfiltration, malicious execution, persistence mechanisms, or prompt injection attempts against the agent. All network communication is directed to the stated `aisa.one` domain, and file operations are limited to saving the generated output.

Capability Assessment

✓ Purpose & Capability

Name/description promise (Gemini image + Qwen Wan video via AIsa) matches the code and SKILL.md: network calls target api.aisa.one and video endpoints, and the single required env var is AISA_API_KEY. Requiring curl and python3 is reasonable given the curl examples and the Python client.

✓ Instruction Scope

SKILL.md instructs only how to call AIsa endpoints, poll async tasks, and save returned media. The included Python client only sends requests to api.aisa.one, parses responses, saves files, and downloads returned media URLs. It does not read unrelated system files or additional environment variables.

✓ Install Mechanism

No install spec; this is an instruction-only skill with a bundled Python script. No downloads from arbitrary URLs or archive extraction are performed at install time.

✓ Credentials

Only AISA_API_KEY is required and is the primary credential; that is proportional to an API client. The code does not request other secrets or config paths.

✓ Persistence & Privilege

Skill is not always-enabled and does not request elevated persistence or modify other skills. It can be invoked by the agent (default), which is normal for skills of this type.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install openclaw-aisa-image-video-models-wan2-6-gemini-3-pro-image-nano-banana
After installation, invoke the skill by name or use /openclaw-aisa-image-video-models-wan2-6-gemini-3-pro-image-nano-banana
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

OpenClaw Media Gen 1.0.0 – unified image & video generation via AIsa API - Initial release combining image (Gemini 3 Pro Image) and video (Qwen Wan 2.6) generation in one skill. - Supports both image prompts and video tasks using a single API key. - Provides bash curl and Python client command examples for quick image and video generation, task polling, and result downloading. - Requires AISA_API_KEY and standard tools (python3, curl) for usage. - Documentation links and usage instructions included for both endpoints.

Metadata

Slug openclaw-aisa-image-video-models-wan2-6-gemini-3-pro-image-nano-banana

Version 1.0.0

License —

All-time Installs 11

Active Installs 11

Total Versions 1

Frequently Asked Questions

What is Generate images & videos with: Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key?

Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key. It is an AI Agent Skill for Claude Code / OpenClaw, with 2235 downloads so far.

How do I install Generate images & videos with: Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key?

Run "/install openclaw-aisa-image-video-models-wan2-6-gemini-3-pro-image-nano-banana" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Generate images & videos with: Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key free?

Yes, Generate images & videos with: Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Generate images & videos with: Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key support?

Generate images & videos with: Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Generate images & videos with: Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key?

It is built and maintained by 0xjordansg-yolo (@0xjordansg-yolo); the current version is v1.0.0.

More Skills