← Back to Skills Marketplace
bowen-dotcom

AI Media Generation En

by bowen-dotcom · GitHub ↗ · v1.0.0
cross-platform ✓ Security Clean
805
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install aisa-media-gen-en-skill
Description
Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.
README (SKILL.md)

OpenClaw Media Gen 🎬

Generate images and videos with one AIsa API key:

  • Image: gemini-3-pro-image-preview (Gemini GenerateContent)
  • Video: wan2.6-t2v (Qwen Wan 2.6 / Tongyi Wanxiang, async task)

API Reference: AIsa API Reference (all pages available at https://aisa.mintlify.app/llms.txt)

🎯 Pricing Advantage

Video Generation (WAN) - Cost Comparison

Resolution AIsa (Contract) AIsa (Official) Bailian (Official) OpenRouter
720P $0.06/sec ~$0.08 ~$0.10
1080P $0.09/sec ~$0.12 ~$0.15
Pro/Animate $0.108–0.156 ~$0.18 ~$0.25

Key Benefits:

  • 25-40% cheaper than Bailian official pricing
  • OpenRouter doesn't support video - AIsa is the only unified API with video generation
  • Contract pricing available for production workloads
  • Single API key for both image and video generation

🔥 What You Can Do

Image Generation (Gemini)

"Generate a cyberpunk cityscape at night, neon lights, rainy, cinematic"

Video Generation (Wan 2.6)

"Use a reference image to generate a 5-second shot: slow camera push-in, wind blowing hair, cinematic, shallow depth of field"

Quick Start

export AISA_API_KEY="your-key"

🖼️ Image Generation (Gemini)

Endpoint

  • Base URL: https://api.aisa.one/v1
  • POST /models/{model}:generateContent

Documentation: google-gemini-chat (GenerateContent) at https://aisa.mintlify.app/api-reference/chat/chat-api/google-gemini-chat.md

curl Example (returns inline_data for images)

curl -X POST "https://api.aisa.one/v1/models/gemini-3-pro-image-preview:generateContent" \
  -H "Authorization: Bearer $AISA_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "contents":[
      {"role":"user","parts":[{"text":"A cute red panda, ultra-detailed, cinematic lighting"}]}
    ]
  }'

Note: Response may contain candidates[].parts[].inline_data (typically with base64 data and mime type); client script automatically parses and saves the file.


🎞️ Video Generation (Qwen Wan 2.6 / Tongyi Wanxiang)

Create Task

  • Base URL: https://api.aisa.one/apis/v1
  • POST /services/aigc/video-generation/video-synthesis
  • Header: X-DashScope-Async: enable (required for async)

Documentation: video-generation at https://aisa.mintlify.app/api-reference/aliyun/video/video-generation.md

curl -X POST "https://api.aisa.one/apis/v1/services/aigc/video-generation/video-synthesis" \
  -H "Authorization: Bearer $AISA_API_KEY" \
  -H "Content-Type: application/json" \
  -H "X-DashScope-Async: enable" \
  -d '{
    "model":"wan2.6-t2v",
    "input":{
      "prompt":"cinematic close-up, slow push-in, shallow depth of field",
      "img_url":"https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Cat03.jpg/320px-Cat03.jpg"
    },
    "parameters":{
      "resolution":"720P",
      "duration":5,
      "shot_type":"single",
      "watermark":false
    }
  }'

Poll Task Status

  • GET /services/aigc/tasks?task_id=...

Documentation: task at https://aisa.mintlify.app/api-reference/aliyun/video/task.md

curl "https://api.aisa.one/apis/v1/services/aigc/tasks?task_id=YOUR_TASK_ID" \
  -H "Authorization: Bearer $AISA_API_KEY"

Python Client

# Generate image (save to local file)
python3 {baseDir}/scripts/media_gen_client.py image \
  --prompt "A cute red panda, cinematic lighting" \
  --out "out.png"

# Create video task (requires img_url)
python3 {baseDir}/scripts/media_gen_client.py video-create \
  --prompt "cinematic close-up, slow push-in" \
  --img-url "https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Cat03.jpg/320px-Cat03.jpg" \
  --duration 5

# Poll task status
python3 {baseDir}/scripts/media_gen_client.py video-status --task-id YOUR_TASK_ID

# Wait until success (optional: print video_url on success)
python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --poll 10 --timeout 600

# Wait until success and auto-download mp4
python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --download --out out.mp4

💡 Use Cases

  • AI Agents: Automate visual content generation for social media, marketing materials
  • Content Creators: Generate custom images and videos programmatically
  • Developers: Build apps with multimodal generation capabilities
  • Businesses: Cost-effective alternative to Bailian with better pricing

🚀 Why AIsa for Media Generation?

  1. Unified API: Single key for both images (Gemini) and videos (WAN)
  2. Best Pricing: 25-40% cheaper than alternatives
  3. Production Ready: Contract pricing and enterprise support available
  4. No Competition: OpenRouter doesn't support video generation
  5. Simple Integration: Python client with async task management built-in
Usage Guidance
This skill appears coherent and implements what it claims, but consider the following before installing: 1) You will be sending whatever key you provide (AISA_API_KEY) to api.aisa.one — only use an API key you trust to be used with this third party and avoid reusing high-privilege or long-lived credentials. 2) The client will download media (including potentially large video files) to local disk when you use the video-download options — ensure you trust the returned URLs and have disk space. 3) Review the AIsa service's privacy/terms and pricing (the SKILL.md advertises third-party "contract" pricing); this is a trust decision outside the code. 4) If you have any doubt about the source, run the included Python script in an isolated environment (container) and inspect network traffic or use a throwaway API key.
Capability Analysis
Type: OpenClaw Skill Name: aisa-media-gen-en-skill Version: 1.0.0 The skill bundle is benign. The `SKILL.md` and `README.md` provide clear instructions for media generation using the AIsa API, without any prompt injection attempts or instructions for malicious actions. The `scripts/media_gen_client.py` Python code correctly implements the client logic, making HTTP requests to `https://api.aisa.one` for image and video generation, reading the `AISA_API_KEY` environment variable for authentication, and saving generated media files locally. All observed behaviors are directly aligned with the stated purpose of generating images and videos, with no evidence of data exfiltration, malicious execution, persistence mechanisms, or obfuscation.
Capability Assessment
Purpose & Capability
Name/description, required binaries (python3 and curl) and the single required env var (AISA_API_KEY) match the implementation: the Python client and curl examples call the AIsa endpoints. (Minor note: curl is used only in SKILL.md examples; the shipped client uses python3.)
Instruction Scope
SKILL.md instructs the agent to call documented AIsa endpoints, export the AISA_API_KEY, create/poll video tasks, and save returned media. The included Python client only uses the declared env var and network calls to api.aisa.one; it does not access other local files or unrelated environment variables. It will download media URLs returned by the API to disk when requested (expected behavior for a media client).
Install Mechanism
No install spec is present (instruction-only with a bundled Python script). This is low-risk: nothing is downloaded/installed automatically by the skill beyond running the provided script.
Credentials
Only AISA_API_KEY is required (primaryEnv). The code accepts an explicit --api-key override and otherwise reads only AISA_API_KEY from the environment. No other credentials or unrelated secrets are requested.
Persistence & Privilege
The skill does not request always:true or any elevated platform presence and does not modify other skills or system settings. It runs on-demand using the provided client and network calls.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install aisa-media-gen-en-skill
  3. After installation, invoke the skill by name or use /aisa-media-gen-en-skill
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
AI Image & Video Generation: Create images with Gemini and videos with Qwen Wan 2.6 via single API key. Async video processing, automatic file saving, parameter control for resolution, duration, and artistic styles.
Metadata
Slug aisa-media-gen-en-skill
Version 1.0.0
License
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is AI Media Generation En?

Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key. It is an AI Agent Skill for Claude Code / OpenClaw, with 805 downloads so far.

How do I install AI Media Generation En?

Run "/install aisa-media-gen-en-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is AI Media Generation En free?

Yes, AI Media Generation En is completely free (open-source). You can download, install and use it at no cost.

Which platforms does AI Media Generation En support?

AI Media Generation En is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created AI Media Generation En?

It is built and maintained by bowen-dotcom (@bowen-dotcom); the current version is v1.0.0.

💬 Comments