← Back to Skills Marketplace
xiaoxiaole2025

Volcengine Digital Human Video Generator

by xiaoxiaole2025 · GitHub ↗ · v1.0.4 · MIT-0
cross-platform ⚠ suspicious
112
Downloads
0
Stars
0
Active Installs
5
Versions
Install in OpenClaw
/install volc-digital-human
Description
火山引擎数字人视频生成技能。当用户发送照片并提供对白或配音文案,要求生成数字人口播视频时触发。全自动完成:图片上传、形象创建、TTS配音(自动性别检测、多音色匹配)、视频合成、最后发回给用户。触发词包括数字人、视频合成、口播视频、数字人视频。
Usage Guidance
Key things to consider before installing or using this skill: - Do not upload sensitive or private images/audio. The skill uploads user-provided media to public file hosts (catbox.moe, 0x0.st / references mention uguu.se) so anyone with the URL can access them during processing. - The package contains a config.json with hard-coded AK/SK credentials. Treat this as insecure: either remove that file, replace the credentials with your own, or set VOLC_AK/VOLC_SK in environment variables. If you cannot verify those keys' ownership, do not rely on them — they may be leaked or abused. - Consider rotating any Volcengine keys you plan to use for this skill and use a minimal-permission RAM user for the Digital Human service only. - The script can download ML models at runtime (deepface/retinaface) and calls external services; run it in an isolated environment (container) if you need to limit network/file-system exposure. - Verify and/or pin dependency installation (edge-tts, ffmpeg, OpenCV, deepface) in a controlled environment; the package does not provide an install step. If you need this capability but are uncomfortable with public uploads or embedded credentials, ask the skill author to remove the bundled config.json, provide clear metadata declaring required env vars, and offer an option to use private storage (your own S3/minio) instead of public file hosts.
Capability Analysis
Type: OpenClaw Skill Name: volc-digital-human Version: 1.0.4 The skill poses significant privacy and security risks by uploading user-provided images and generated audio to public anonymous file-sharing hosts (catbox.moe and 0x0.st) to facilitate Volcengine API calls, as seen in scripts/volc_digital_human.py. It also contains hardcoded Volcengine credentials in config.json, which contradicts security warnings in SKILL.md. Furthermore, the use of subprocess.run to execute ffmpeg and edge-tts with potentially unsanitized user input presents a command injection vulnerability.
Capability Assessment
Purpose & Capability
The name/description (Volcengine digital human video generator) match the code and instructions: image upload → create avatar → TTS → synthesize video. Requiring Volcengine AK/SK, TTS (edge-tts) and ffmpeg is coherent. However the registry metadata at the top claimed no required env vars/credentials while SKILL.md and the script explicitly require VOLC_AK/VOLC_SK and even include a config.json with AK/SK — that metadata mismatch is unexpected and should be explained by the author.
Instruction Scope
The SKILL.md and script instruct the agent to read images from /root/.openclaw/media/inbound and to upload user images/audio/video to public file hosts (catbox.moe, 0x0.st; references also mention uguu.se). Reading inbound media and calling external APIs is necessary for the task, but automatic public hosting of user-supplied images/audio is a significant privacy risk. The SKILL.md warns about this, but the automation will still expose content publicly during processing — verify users understand this before use.
Install Mechanism
No install spec (instruction-only), so nothing is written by an installer. The script has heavy runtime dependencies (opencv, deepface/retinaface, numpy, edge-tts, ffmpeg) and deepface may download models at runtime. Lack of an install spec means dependency installation/behavior (and model downloads) will happen outside the package and should be managed explicitly.
Credentials
Requesting VOLC_AK and VOLC_SK is appropriate for calling Volcengine. However the included config.json in the package contains ak/sk values (hard-coded credentials). Shipping credentials in a skill package is a serious red flag: it may be a leaked/shared key or intentionally embedded account credentials. The script will read a config.json in its directory if env vars are not set, causing accidental use of those embedded credentials. This is disproportionate and may grant the package author (or whoever controls that account) access to usage and uploaded content.
Persistence & Privilege
always:false and normal autonomous invocation are fine. The skill reads from the agent's inbound media directory and writes temporary files under /tmp and its own workspace; it does not modify other skills or system-wide configs. Still, the combination of autonomous invocation plus public uploads means the agent could automatically expose user media when invoked — be cautious about enabling it for unattended runs.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install volc-digital-human
  3. After installation, invoke the skill by name or use /volc-digital-human
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.4
Add required_env_vars, runtime_dependencies, file_upload_hosts, and privacy notice to manifest. Fix docs inconsistency (uguu->catbox). Add English version of documentation.
v1.0.3
Remove hardcoded credentials from references/volc_api.md and clarify user setup instructions
v1.0.2
Fix: Add .skillignore to exclude config.json with hardcoded credentials
v1.0.1
Initial release: Generate talking avatar videos from photos
v1.0.0
首个版本
Metadata
Slug volc-digital-human
Version 1.0.4
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 5
Frequently Asked Questions

What is Volcengine Digital Human Video Generator?

火山引擎数字人视频生成技能。当用户发送照片并提供对白或配音文案,要求生成数字人口播视频时触发。全自动完成:图片上传、形象创建、TTS配音(自动性别检测、多音色匹配)、视频合成、最后发回给用户。触发词包括数字人、视频合成、口播视频、数字人视频。 It is an AI Agent Skill for Claude Code / OpenClaw, with 112 downloads so far.

How do I install Volcengine Digital Human Video Generator?

Run "/install volc-digital-human" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Volcengine Digital Human Video Generator free?

Yes, Volcengine Digital Human Video Generator is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Volcengine Digital Human Video Generator support?

Volcengine Digital Human Video Generator is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Volcengine Digital Human Video Generator?

It is built and maintained by xiaoxiaole2025 (@xiaoxiaole2025); the current version is v1.0.4.

💬 Comments