← 返回 Skills 市场
pauldelavallaz

UGC Manual

作者 Paul de Lavallaz · GitHub ↗ · v1.0.2
cross-platform ⚠ suspicious
1003
总下载
2
收藏
1
当前安装
2
版本数
在 OpenClaw 中安装
/install ugc-manual
功能描述
Generate lip-sync video from image + user's own audio recording. ✅ USE WHEN: - User provides their OWN audio file (voice recording) - Want to sync image to specific audio/voice - User recorded the script themselves - Need exact audio timing preserved ❌ DON'T USE WHEN: - User provides text script (not audio) → use veed-ugc - Need AI to generate the voice → use veed-ugc - Don't have audio file yet → use veed-ugc with script INPUT: Image + audio file (user's recording) OUTPUT: MP4 video with lip-sync to provided audio KEY DIFFERENCE: veed-ugc = script → AI voice → video ugc-manual = user audio → video (no voice generation)
安全使用建议
Key things to check before installing or running: - Metadata mismatch: the package metadata claims no required env vars/binaries, but the script requires COMFY_DEPLOY_API_KEY and ffmpeg. Expect to supply an API key and have ffmpeg available. - Data privacy: the script uploads user-provided images and audio to https://api.comfydeploy.com and queues a fixed deployment ID. If your audio or image is sensitive or private, do not use this skill unless you trust ComfyDeploy and understand their retention/privacy policy. - Verify API key provenance: only provide COMFY_DEPLOY_API_KEY if you obtained it from a trusted ComfyDeploy account; otherwise the key could be misused to upload/queue jobs under your account. - Source provenance: the skill lists no homepage and the owner is not human-readable. If provenance is important, request the upstream source or inspect the repo before use. - Testing suggestion: run the script in a sandbox or with non-sensitive sample media first. Confirm network endpoints, deployment ID, and resulting behavior match expectations. - If you need offline or local-only processing (no upload), do not use this skill — it is designed to use ComfyDeploy and will transmit media off your machine. If you want, I can extract the exact places the code will contact the network and show the minimal set of commands/requests it will make, or help you modify the script to use a different endpoint or to run locally if available.
功能分析
Type: OpenClaw Skill Name: ugc-manual Version: 1.0.2 The skill bundle is benign. The `SKILL.md` provides clear instructions for generating lip-sync videos using an image and audio, and does not contain any prompt injection attempts against the AI agent. The `scripts/generate.py` Python script implements this functionality by interacting with the `https://api.comfydeploy.com` API for file uploads, workflow queuing, and video downloads. It also uses `subprocess.run` to execute the `ffmpeg` command for audio conversion, which is explicitly mentioned in the documentation and implemented safely using a list of arguments, mitigating shell injection risks. All network and file system operations are directly related to the skill's stated purpose, and there is no evidence of malicious intent such as data exfiltration, unauthorized remote execution, or persistence mechanisms.
能力评估
Purpose & Capability
The script implements exactly what the description says: it uploads an image and converted audio to ComfyDeploy, queues a specific deployment, polls for completion, and downloads the MP4. Requiring an API key for ComfyDeploy (COMFY_DEPLOY_API_KEY) is coherent with this purpose. However, the registry metadata claims 'Required env vars: none' and 'Required binaries: none' while the code requires COMFY_DEPLOY_API_KEY and ffmpeg; that metadata mismatch is unexpected.
Instruction Scope
Runtime instructions and code stay within the expected scope: they may download audio from a provided URL, convert audio locally with ffmpeg, upload files to https://api.comfydeploy.com, queue a workflow using a fixed deployment ID, poll status, and download the resulting video. The agent is instructed to send user-provided image/audio to a third-party service (ComfyDeploy) — this is necessary for the stated workflow but is an important privacy/transfer decision and should be made explicit to users.
Install Mechanism
There is no install spec (instruction-only with included scripts). A pyproject.toml lists only the 'requests' dependency. No third-party binary download or obscure URL extraction is used. Running the script executes local ffmpeg and Python requests calls; the install surface is low but the script will perform network I/O at runtime.
Credentials
The code requires a single environment variable COMFY_DEPLOY_API_KEY (used to authenticate file uploads and workflow queueing), which is proportionate to the functionality. However, the skill's declared requirements in the registry do not list this credential (metadata states 'none'), creating an inconsistency. The missing declaration makes it harder for users to know what secrets they must provide and trust.
Persistence & Privilege
The skill is not configured as always: true and does not request persistent/system-level privileges. It only uses one environment variable and does not modify other skills or global agent configuration.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install ugc-manual
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /ugc-manual 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.2
Auto-convert audio to WAV PCM 16-bit mono 48kHz for FabricLipsync compatibility
v1.0.1
Improved routing: user audio vs AI voice differentiation
元数据
Slug ugc-manual
版本 1.0.2
许可证
累计安装 1
当前安装数 1
历史版本数 2
常见问题

UGC Manual 是什么?

Generate lip-sync video from image + user's own audio recording. ✅ USE WHEN: - User provides their OWN audio file (voice recording) - Want to sync image to specific audio/voice - User recorded the script themselves - Need exact audio timing preserved ❌ DON'T USE WHEN: - User provides text script (not audio) → use veed-ugc - Need AI to generate the voice → use veed-ugc - Don't have audio file yet → use veed-ugc with script INPUT: Image + audio file (user's recording) OUTPUT: MP4 video with lip-sync to provided audio KEY DIFFERENCE: veed-ugc = script → AI voice → video ugc-manual = user audio → video (no voice generation). 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 1003 次。

如何安装 UGC Manual?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ugc-manual」即可一键安装,无需额外配置。

UGC Manual 是免费的吗?

是的,UGC Manual 完全免费(开源免费),可自由下载、安装和使用。

UGC Manual 支持哪些平台?

UGC Manual 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 UGC Manual?

由 Paul de Lavallaz(@pauldelavallaz)开发并维护,当前版本 v1.0.2。

💬 留言讨论