← Back to Skills Marketplace
pauldelavallaz

UGC Manual

by Paul de Lavallaz · GitHub ↗ · v1.0.2
cross-platform ⚠ suspicious
1003
Downloads
2
Stars
1
Active Installs
2
Versions
Install in OpenClaw
/install ugc-manual
Description
Generate lip-sync video from image + user's own audio recording. ✅ USE WHEN: - User provides their OWN audio file (voice recording) - Want to sync image to specific audio/voice - User recorded the script themselves - Need exact audio timing preserved ❌ DON'T USE WHEN: - User provides text script (not audio) → use veed-ugc - Need AI to generate the voice → use veed-ugc - Don't have audio file yet → use veed-ugc with script INPUT: Image + audio file (user's recording) OUTPUT: MP4 video with lip-sync to provided audio KEY DIFFERENCE: veed-ugc = script → AI voice → video ugc-manual = user audio → video (no voice generation)
Usage Guidance
Key things to check before installing or running: - Metadata mismatch: the package metadata claims no required env vars/binaries, but the script requires COMFY_DEPLOY_API_KEY and ffmpeg. Expect to supply an API key and have ffmpeg available. - Data privacy: the script uploads user-provided images and audio to https://api.comfydeploy.com and queues a fixed deployment ID. If your audio or image is sensitive or private, do not use this skill unless you trust ComfyDeploy and understand their retention/privacy policy. - Verify API key provenance: only provide COMFY_DEPLOY_API_KEY if you obtained it from a trusted ComfyDeploy account; otherwise the key could be misused to upload/queue jobs under your account. - Source provenance: the skill lists no homepage and the owner is not human-readable. If provenance is important, request the upstream source or inspect the repo before use. - Testing suggestion: run the script in a sandbox or with non-sensitive sample media first. Confirm network endpoints, deployment ID, and resulting behavior match expectations. - If you need offline or local-only processing (no upload), do not use this skill — it is designed to use ComfyDeploy and will transmit media off your machine. If you want, I can extract the exact places the code will contact the network and show the minimal set of commands/requests it will make, or help you modify the script to use a different endpoint or to run locally if available.
Capability Analysis
Type: OpenClaw Skill Name: ugc-manual Version: 1.0.2 The skill bundle is benign. The `SKILL.md` provides clear instructions for generating lip-sync videos using an image and audio, and does not contain any prompt injection attempts against the AI agent. The `scripts/generate.py` Python script implements this functionality by interacting with the `https://api.comfydeploy.com` API for file uploads, workflow queuing, and video downloads. It also uses `subprocess.run` to execute the `ffmpeg` command for audio conversion, which is explicitly mentioned in the documentation and implemented safely using a list of arguments, mitigating shell injection risks. All network and file system operations are directly related to the skill's stated purpose, and there is no evidence of malicious intent such as data exfiltration, unauthorized remote execution, or persistence mechanisms.
Capability Assessment
Purpose & Capability
The script implements exactly what the description says: it uploads an image and converted audio to ComfyDeploy, queues a specific deployment, polls for completion, and downloads the MP4. Requiring an API key for ComfyDeploy (COMFY_DEPLOY_API_KEY) is coherent with this purpose. However, the registry metadata claims 'Required env vars: none' and 'Required binaries: none' while the code requires COMFY_DEPLOY_API_KEY and ffmpeg; that metadata mismatch is unexpected.
Instruction Scope
Runtime instructions and code stay within the expected scope: they may download audio from a provided URL, convert audio locally with ffmpeg, upload files to https://api.comfydeploy.com, queue a workflow using a fixed deployment ID, poll status, and download the resulting video. The agent is instructed to send user-provided image/audio to a third-party service (ComfyDeploy) — this is necessary for the stated workflow but is an important privacy/transfer decision and should be made explicit to users.
Install Mechanism
There is no install spec (instruction-only with included scripts). A pyproject.toml lists only the 'requests' dependency. No third-party binary download or obscure URL extraction is used. Running the script executes local ffmpeg and Python requests calls; the install surface is low but the script will perform network I/O at runtime.
Credentials
The code requires a single environment variable COMFY_DEPLOY_API_KEY (used to authenticate file uploads and workflow queueing), which is proportionate to the functionality. However, the skill's declared requirements in the registry do not list this credential (metadata states 'none'), creating an inconsistency. The missing declaration makes it harder for users to know what secrets they must provide and trust.
Persistence & Privilege
The skill is not configured as always: true and does not request persistent/system-level privileges. It only uses one environment variable and does not modify other skills or global agent configuration.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install ugc-manual
  3. After installation, invoke the skill by name or use /ugc-manual
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.2
Auto-convert audio to WAV PCM 16-bit mono 48kHz for FabricLipsync compatibility
v1.0.1
Improved routing: user audio vs AI voice differentiation
Metadata
Slug ugc-manual
Version 1.0.2
License
All-time Installs 1
Active Installs 1
Total Versions 2
Frequently Asked Questions

What is UGC Manual?

Generate lip-sync video from image + user's own audio recording. ✅ USE WHEN: - User provides their OWN audio file (voice recording) - Want to sync image to specific audio/voice - User recorded the script themselves - Need exact audio timing preserved ❌ DON'T USE WHEN: - User provides text script (not audio) → use veed-ugc - Need AI to generate the voice → use veed-ugc - Don't have audio file yet → use veed-ugc with script INPUT: Image + audio file (user's recording) OUTPUT: MP4 video with lip-sync to provided audio KEY DIFFERENCE: veed-ugc = script → AI voice → video ugc-manual = user audio → video (no voice generation). It is an AI Agent Skill for Claude Code / OpenClaw, with 1003 downloads so far.

How do I install UGC Manual?

Run "/install ugc-manual" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is UGC Manual free?

Yes, UGC Manual is completely free (open-source). You can download, install and use it at no cost.

Which platforms does UGC Manual support?

UGC Manual is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created UGC Manual?

It is built and maintained by Paul de Lavallaz (@pauldelavallaz); the current version is v1.0.2.

💬 Comments