← 返回 Skills 市场
veya2ztn

kameo

作者 veya2ztn · GitHub ↗ · v1.0.1
cross-platform ⚠ suspicious
1928
总下载
5
收藏
2
当前安装
2
版本数
在 OpenClaw 中安装
/install kameo
功能描述
Generate expressive talking-head videos from static images using Kameo AI. Converts static avatars/portraits into dynamic 5-second videos with realistic facial expressions, lip-sync, and motion. Use when you need to bring static images to life, create AI character videos, demonstrate visual communication, or generate talking avatars from photos.
使用说明 (SKILL.md)

Kameo AI - Talking Head Video Generation

Transform static images into expressive talking-head videos with realistic motion and lip-sync.

Quick Start

scripts/generate_video.sh \x3Cimage_path> \x3Cprompt> [output_file]

Example:

scripts/generate_video.sh avatar.jpg "Hello, I am an AI assistant" output.mp4

What It Does

  • Takes a static image (portrait/avatar)
  • Adds realistic facial motion, expressions, and lip-sync based on your prompt
  • Generates 5-second video in 9:16, 16:9, or 1:1 aspect ratio
  • Returns CDN URL instantly (processing ~10-30 seconds)

Authentication

Set your Kameo API key:

export KAMEO_API_KEY="kam_I3rdx43IymFNbfBw1c0ZbSc7o3aUfQgz8cljZA6T7fs"

Or store in ~/.config/kameo/credentials.json:

{
  "api_key": "kam_I3rdx43IymFNbfBw1c0ZbSc7o3aUfQgz8cljZA6T7fs"
}

Getting an API Key:

  1. Register at kameo.chat (requires email verification)
  2. Login to get JWT token
  3. Create API key via /api/public/keys endpoint
  4. Or use the registration helper: scripts/register.sh

Prompt Engineering

Basic Prompts (Simple)

Just the dialogue:

"Hello, I'm here to help you today"
"こんにちは、私はガッキーです。愛してます。"

Works but results are generic.

Enhanced Prompts (Recommended)

Format:

[Detailed scene/environment], [person's complete appearance and expression], speaking in [tone], "[DIALOGUE]". [Camera and lighting details].

Example:

In a bright outdoor winter setting with soft, overcast daylight, a young woman with long dark hair wearing a white knitted winter hat with ear flaps and a colorful patterned sweater stands centered in frame. She looks directly into the camera with a warm, genuine smile, her eyes crinkling with joy, speaking in a cheerful, affectionate tone, "こんにちは、私はガッキーです。愛してます。" The scene is captured in a medium close-up shot, framed at eye level. The lighting is natural and diffused from above, creating soft, even illumination.

Why Enhanced Prompts Matter:

  • Better facial expressions matching the scene context
  • More natural motion and gestures
  • Improved lip-sync quality
  • Contextual emotional delivery

Prompt Enhancement Workflow

For best results, use vision AI to analyze the image first:

  1. Feed the image to a vision model (Gemini, GPT-4V, Claude)
  2. Ask it to describe the scene in cinematic detail
  3. Insert your dialogue into the description
  4. Use the enhanced prompt for Kameo

See: scripts/enhance_prompt.sh for automated enhancement.

API Details

Base URL: https://api.kameo.chat/api/public

Generate Video

curl -X POST https://api.kameo.chat/api/public/generate \
  -H "X-API-Key: kam_I3rdx43IymFNbfBw1c0ZbSc7o3aUfQgz8cljZA6T7fs" \
  -H "Content-Type: application/json" \
  -d '{
    "image_base64": "\x3Cbase64_encoded_image>",
    "prompt": "Your detailed prompt here",
    "seconds": 5,
    "aspect_ratio": "9:16"
  }'

Parameters:

  • image_base64 (required): Base64-encoded JPEG/PNG
  • prompt (required): Dialogue and/or scene description
  • seconds (optional): 5 (default) or 10
  • aspect_ratio (optional): "9:16" (default), "16:9", or "1:1"

Response:

{
  "job_id": "uuid",
  "status": "completed",
  "video_url": "https://cdn.kameo.chat/videos/{uuid}.mp4",
  "duration_seconds": 5,
  "processing_time_ms": 15000
}

Check Credits

curl -H "X-API-Key: kam_..." \
  https://api.kameo.chat/api/public/credits

Response:

{
  "permanent_credits": 294,
  "subscription_credits": 0,
  "total_available": 294
}

Pricing

curl https://api.kameo.chat/api/public/pricing

Cost: 3 credits per video

Performance

  • Processing time: 8-35 seconds (depends on aspect ratio and queue)
  • 9:16 (portrait): ~30-35s
  • 16:9 (landscape): ~15-20s
  • 1:1 (square): ~10-15s

Best Practices

  1. Optimize image size - Resize large images before encoding (saves bandwidth, faster upload)

    ffmpeg -i large.jpg -vf scale=720:-1 optimized.jpg
    
  2. Use descriptive prompts - Enhanced prompts = better results

  3. Choose aspect ratio wisely

    • 9:16: Mobile/social media (TikTok, Instagram Stories)
    • 16:9: Desktop/YouTube
    • 1:1: Profile pictures, square posts
  4. Monitor credits - Check balance with scripts/check_credits.sh

Limitations

  • CDN access: Video URLs may have time-limited access or require authentication
  • Download: Videos may return 403 when downloaded via curl (use browser or authenticated session)
  • Rate limits: 10 generations per minute

Troubleshooting

"401 Unauthorized"

  • Check your API key is set correctly
  • Verify key hasn't been revoked

"402 Insufficient credits"

  • Check credit balance: scripts/check_credits.sh
  • Need to add credits at kameo.chat

"Timeout errors"

  • 9:16 videos take longer (~30s)
  • Increase timeout in scripts
  • Retry if server is busy

"403 when downloading video"

  • CDN URLs may be time-limited
  • Try accessing in browser immediately after generation
  • Or save the base64 response if available

Use Cases

  • AI character videos - Bring bot avatars to life
  • Social media content - Dynamic profile videos
  • Demos and presentations - Talking product demos
  • Educational content - Video tutorials with AI presenters
  • Multilingual content - Same avatar speaking different languages
安全使用建议
Things to consider before installing or running this skill: - Do not run the scripts blindly. Review generate_video.sh / enhance_prompt.sh / register.sh before executing. - The skill requires a KAMEO_API_KEY (env or ~/.config/kameo/credentials.json) though the registry metadata omitted that. Set a dedicated key you control, and be ready to revoke it if needed. - The enhance_prompt.sh script uploads your image to Google Generative Language (Gemini) and requires GOOGLE_API_KEY. If you don't want your images sent to Google (privacy/sensitivity), avoid the enhanced workflow or modify the script to use a local/alternative vision model. - SKILL.md and USAGE.md include a plaintext API key-like string (kam_...); treat that as potentially exposed. Do not assume it is safe — if you paste that key into your environment you may be using someone else's credential. Prefer creating your own key via the service and rotate/revoke keys if you suspect reuse. - register.sh uses SUPABASE_URL and SUPABASE_ANON_KEY placeholders. Do not run it without ensuring those values point to a project you control; the script will transmit email/password and create API keys via that backend. - The scripts require curl, jq, and base64; ensure those binaries are present and review their usage. - If you plan to proceed: run the scripts in an isolated environment (container or throwaway VM), avoid exposing sensitive images/identities, and monitor/limit network egress. If you need more assurance, ask the publisher for a canonical homepage/repository and a statement of which external services are contacted (Gemini vs optional) and why the example API key appears in docs. Confidence is medium: the code matches the declared high-level purpose, but the undisclosed env requirements and embedded key are clear inconsistencies that warrant caution.
功能分析
Type: OpenClaw Skill Name: kameo Version: 1.0.1 The skill is designed to interact with the Kameo AI API for video generation and Google Gemini for prompt enhancement. It sends user-provided images and prompts to these external services, which is explicitly documented and central to its stated purpose. While this involves sending sensitive data (images, API keys, and potentially user credentials during registration) to third-party APIs, there is no evidence of intentional malicious behavior such as unauthorized data exfiltration, backdoors, or prompt injection against the OpenClaw agent. The `register.sh` script contains placeholder API keys, preventing immediate functionality or malicious use, and its intended purpose is legitimate account management.
能力评估
Purpose & Capability
The scripts implement the stated purpose (encode an image, call api.kameo.chat/generate), which is coherent. However the published registry metadata says no required env vars or binaries, while package.json lists required binaries (curl, jq, base64) and the scripts actually expect KAMEO_API_KEY (env or ~/.config/kameo/credentials.json) and optionally GOOGLE_API_KEY for prompt enhancement — these required credentials/env vars are not declared in the skill metadata, which is inconsistent.
Instruction Scope
Runtime instructions & scripts do more than just call Kameo: the enhance_prompt.sh script uploads the user's image to Google Generative Language (Gemini) using GOOGLE_API_KEY, and register.sh interacts with a Supabase instance to create accounts/keys. These external calls mean user images and credentials may be transmitted to services beyond api.kameo.chat; that behaviour is not fully documented in the skill metadata and is potentially privacy-sensitive.
Install Mechanism
There is no install spec (instruction-only), which minimizes arbitrary code downloads. However the package ships multiple shell scripts that will run locally; the lack of an install step is low risk in itself but you still must review and run these scripts manually. package.json lists required binaries (curl, jq, base64) which the runtime scripts actually use — the registry metadata omitted these.
Credentials
The scripts require KAMEO_API_KEY (and will read/write ~/.config/kameo/credentials.json). The prompt-enhancement path requires GOOGLE_API_KEY (used to call Gemini) but the skill metadata does not declare it. Additionally, the SKILL.md/USAGE.md embed a plaintext API key string (kam_...), which may be a leaked or placeholder credential — having a key visible in docs is risky. register.sh relies on SUPABASE_URL and SUPABASE_ANON_KEY placeholders; if these are left pointing to a third-party project they could send user credentials to an external Supabase instance.
Persistence & Privilege
The skill is not always-enabled and does not request elevated platform privileges. It stores its own API key into ~/.config/kameo/credentials.json (chmod 600) which is normal for CLI helpers. It does not attempt to modify other skills or system-wide settings.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install kameo
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /kameo 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
- Updated documentation examples to use a new API key format. - No changes to code or features.
v1.0.0
Kameo 1.0.0 - Initial Release - Generate expressive talking-head videos from static images with realistic facial motion, expressions, and lip-sync. - Supports 5-second video creation in 9:16, 16:9, or 1:1 aspect ratios, returning a CDN URL for instant access. - Includes CLI scripts for video generation, prompt enhancement, registration, and credit checks. - Offers detailed guidance on API usage, prompt engineering, best practices, and troubleshooting. - Use cases: create AI character videos, social media content, demos, educational videos, and multilingual avatars.
元数据
Slug kameo
版本 1.0.1
许可证
累计安装 2
当前安装数 2
历史版本数 2
常见问题

kameo 是什么?

Generate expressive talking-head videos from static images using Kameo AI. Converts static avatars/portraits into dynamic 5-second videos with realistic facial expressions, lip-sync, and motion. Use when you need to bring static images to life, create AI character videos, demonstrate visual communication, or generate talking avatars from photos. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 1928 次。

如何安装 kameo?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install kameo」即可一键安装,无需额外配置。

kameo 是免费的吗?

是的,kameo 完全免费(开源免费),可自由下载、安装和使用。

kameo 支持哪些平台?

kameo 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 kameo?

由 veya2ztn(@veya2ztn)开发并维护,当前版本 v1.0.1。

💬 留言讨论