← Back to Skills Marketplace

AI Media Generation En

Name: AI Media Generation En
Author: bowen-dotcom

by bowen-dotcom · GitHub ↗ · v1.0.0

cross-platform ✓ Security Clean

805

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install aisa-media-gen-en-skill

Description

Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.

README (SKILL.md)

OpenClaw Media Gen 🎬

Generate images and videos with one AIsa API key:

Image: gemini-3-pro-image-preview (Gemini GenerateContent)
Video: wan2.6-t2v (Qwen Wan 2.6 / Tongyi Wanxiang, async task)

API Reference: AIsa API Reference (all pages available at https://aisa.mintlify.app/llms.txt)

🎯 Pricing Advantage

Video Generation (WAN) - Cost Comparison

Resolution	AIsa (Contract)	AIsa (Official)	Bailian (Official)	OpenRouter
720P	$0.06/sec	~$0.08	~$0.10	❌
1080P	$0.09/sec	~$0.12	~$0.15	❌
Pro/Animate	$0.108–0.156	~$0.18	~$0.25	❌

Key Benefits:

25-40% cheaper than Bailian official pricing
OpenRouter doesn't support video - AIsa is the only unified API with video generation
Contract pricing available for production workloads
Single API key for both image and video generation

🔥 What You Can Do

Image Generation (Gemini)

"Generate a cyberpunk cityscape at night, neon lights, rainy, cinematic"

Video Generation (Wan 2.6)

"Use a reference image to generate a 5-second shot: slow camera push-in, wind blowing hair, cinematic, shallow depth of field"

Quick Start

export AISA_API_KEY="your-key"

🖼️ Image Generation (Gemini)

Endpoint

Base URL: https://api.aisa.one/v1
POST /models/{model}:generateContent

Documentation: google-gemini-chat (GenerateContent) at https://aisa.mintlify.app/api-reference/chat/chat-api/google-gemini-chat.md

curl Example (returns inline_data for images)

curl -X POST "https://api.aisa.one/v1/models/gemini-3-pro-image-preview:generateContent" \
  -H "Authorization: Bearer $AISA_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "contents":[
      {"role":"user","parts":[{"text":"A cute red panda, ultra-detailed, cinematic lighting"}]}
    ]
  }'

Note: Response may contain candidates[].parts[].inline_data (typically with base64 data and mime type); client script automatically parses and saves the file.

🎞️ Video Generation (Qwen Wan 2.6 / Tongyi Wanxiang)

Create Task

Base URL: https://api.aisa.one/apis/v1
POST /services/aigc/video-generation/video-synthesis
Header: X-DashScope-Async: enable (required for async)

Documentation: video-generation at https://aisa.mintlify.app/api-reference/aliyun/video/video-generation.md

curl -X POST "https://api.aisa.one/apis/v1/services/aigc/video-generation/video-synthesis" \
  -H "Authorization: Bearer $AISA_API_KEY" \
  -H "Content-Type: application/json" \
  -H "X-DashScope-Async: enable" \
  -d '{
    "model":"wan2.6-t2v",
    "input":{
      "prompt":"cinematic close-up, slow push-in, shallow depth of field",
      "img_url":"https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Cat03.jpg/320px-Cat03.jpg"
    },
    "parameters":{
      "resolution":"720P",
      "duration":5,
      "shot_type":"single",
      "watermark":false
    }
  }'

Poll Task Status

GET /services/aigc/tasks?task_id=...

Documentation: task at https://aisa.mintlify.app/api-reference/aliyun/video/task.md

curl "https://api.aisa.one/apis/v1/services/aigc/tasks?task_id=YOUR_TASK_ID" \
  -H "Authorization: Bearer $AISA_API_KEY"

Python Client

# Generate image (save to local file)
python3 {baseDir}/scripts/media_gen_client.py image \
  --prompt "A cute red panda, cinematic lighting" \
  --out "out.png"

# Create video task (requires img_url)
python3 {baseDir}/scripts/media_gen_client.py video-create \
  --prompt "cinematic close-up, slow push-in" \
  --img-url "https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Cat03.jpg/320px-Cat03.jpg" \
  --duration 5

# Poll task status
python3 {baseDir}/scripts/media_gen_client.py video-status --task-id YOUR_TASK_ID

# Wait until success (optional: print video_url on success)
python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --poll 10 --timeout 600

# Wait until success and auto-download mp4
python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --download --out out.mp4

💡 Use Cases

AI Agents: Automate visual content generation for social media, marketing materials
Content Creators: Generate custom images and videos programmatically
Developers: Build apps with multimodal generation capabilities
Businesses: Cost-effective alternative to Bailian with better pricing

🚀 Why AIsa for Media Generation?

Unified API: Single key for both images (Gemini) and videos (WAN)
Best Pricing: 25-40% cheaper than alternatives
Production Ready: Contract pricing and enterprise support available
No Competition: OpenRouter doesn't support video generation
Simple Integration: Python client with async task management built-in

Usage Guidance

This skill appears coherent and implements what it claims, but consider the following before installing: 1) You will be sending whatever key you provide (AISA_API_KEY) to api.aisa.one — only use an API key you trust to be used with this third party and avoid reusing high-privilege or long-lived credentials. 2) The client will download media (including potentially large video files) to local disk when you use the video-download options — ensure you trust the returned URLs and have disk space. 3) Review the AIsa service's privacy/terms and pricing (the SKILL.md advertises third-party "contract" pricing); this is a trust decision outside the code. 4) If you have any doubt about the source, run the included Python script in an isolated environment (container) and inspect network traffic or use a throwaway API key.

Capability Analysis

Type: OpenClaw Skill Name: aisa-media-gen-en-skill Version: 1.0.0 The skill bundle is benign. The `SKILL.md` and `README.md` provide clear instructions for media generation using the AIsa API, without any prompt injection attempts or instructions for malicious actions. The `scripts/media_gen_client.py` Python code correctly implements the client logic, making HTTP requests to `https://api.aisa.one` for image and video generation, reading the `AISA_API_KEY` environment variable for authentication, and saving generated media files locally. All observed behaviors are directly aligned with the stated purpose of generating images and videos, with no evidence of data exfiltration, malicious execution, persistence mechanisms, or obfuscation.

Capability Assessment

✓ Purpose & Capability

Name/description, required binaries (python3 and curl) and the single required env var (AISA_API_KEY) match the implementation: the Python client and curl examples call the AIsa endpoints. (Minor note: curl is used only in SKILL.md examples; the shipped client uses python3.)

✓ Instruction Scope

SKILL.md instructs the agent to call documented AIsa endpoints, export the AISA_API_KEY, create/poll video tasks, and save returned media. The included Python client only uses the declared env var and network calls to api.aisa.one; it does not access other local files or unrelated environment variables. It will download media URLs returned by the API to disk when requested (expected behavior for a media client).

✓ Install Mechanism

No install spec is present (instruction-only with a bundled Python script). This is low-risk: nothing is downloaded/installed automatically by the skill beyond running the provided script.

✓ Credentials

Only AISA_API_KEY is required (primaryEnv). The code accepts an explicit --api-key override and otherwise reads only AISA_API_KEY from the environment. No other credentials or unrelated secrets are requested.

✓ Persistence & Privilege

The skill does not request always:true or any elevated platform presence and does not modify other skills or system settings. It runs on-demand using the provided client and network calls.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install aisa-media-gen-en-skill
After installation, invoke the skill by name or use /aisa-media-gen-en-skill
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

AI Image & Video Generation: Create images with Gemini and videos with Qwen Wan 2.6 via single API key. Async video processing, automatic file saving, parameter control for resolution, duration, and artistic styles.

Metadata

Slug aisa-media-gen-en-skill

Version 1.0.0

License —

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is AI Media Generation En?

Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key. It is an AI Agent Skill for Claude Code / OpenClaw, with 805 downloads so far.

How do I install AI Media Generation En?

Run "/install aisa-media-gen-en-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is AI Media Generation En free?

Yes, AI Media Generation En is completely free (open-source). You can download, install and use it at no cost.

Which platforms does AI Media Generation En support?

AI Media Generation En is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created AI Media Generation En?

It is built and maintained by bowen-dotcom (@bowen-dotcom); the current version is v1.0.0.

More Skills