← Back to Skills Marketplace
Talking Head Production
by
Ömer Karışman
· GitHub ↗
· v0.1.5
814
Downloads
0
Stars
2
Active Installs
2
Versions
Install in OpenClaw
/install talking-head-production
Description
Talking head video production with AI avatars, lipsync, and voiceover. Covers portrait requirements, audio quality, OmniHuman, PixVerse lipsync, Dia TTS. Use...
Usage Guidance
This skill appears to do what it claims (create talking‑head videos) but it directs you to run a remote installer (curl | sh) from cli.inference.sh and to upload images/audio to third‑party inference services. Before installing or running anything: (1) do not run curl | sh from an unfamiliar domain without verifying the source — prefer official release pages or GitHub releases and verify checksums yourself; (2) research the inference.sh service (privacy, data retention, who runs dist.inference.sh), and create a throwaway/test account first if you want to experiment; (3) confirm where and how infsh stores login credentials and what scopes the login grants; (4) avoid uploading sensitive personal data (faces, private audio) until you’ve confirmed data handling policies; (5) consider running the CLI in a sandbox or VM and inspect the installer script before executing. If you want, provide the cli.inference.sh and dist.inference.sh URLs or any homepage/source repository you can find and I can review those artifacts for stronger assurance.
Capability Analysis
Type: OpenClaw Skill
Name: talking-head-production
Version: 0.1.5
The skill bundle is classified as suspicious due to the `curl -fsSL https://cli.inference.sh | sh` command provided in `SKILL.md` for installing the `inference.sh` CLI. While this instruction is intended for the user to set up their environment and the agent's `allowed-tools` are strictly limited to `Bash(infsh *)` (preventing the agent from executing arbitrary shell commands), the `curl | sh` method itself represents a significant supply chain vulnerability. If `cli.inference.sh` were compromised, executing this command could lead to arbitrary code execution on the user's system. There is no evidence of intentional malicious behavior or prompt injection attempts against the agent.
Capability Assessment
Purpose & Capability
The name, description and runtime instructions consistently describe generating TTS, lipsync and avatar videos via the infsh CLI and specific apps (falai, bytedance). The requested operations (generate audio, run OmniHuman, PixVerse lipsync) align with the stated purpose.
Instruction Scope
SKILL.md instructs the agent/user to curl|sh a remote installer and to run infsh commands that will read local media files and submit them to remote apps. It does not instruct reading unrelated system files, but it implicitly requires logging into inference.sh and uploading potentially sensitive media without documenting privacy, storage, or retention policies.
Install Mechanism
Although the registry has no formal install spec, SKILL.md tells users to run curl -fsSL https://cli.inference.sh | sh which is a high‑risk pattern (download-and-execute). The domains (cli.inference.sh, dist.inference.sh) are not standard well-known release hosts like GitHub releases; the doc claims checksum verification but the initial bootstrap still executes remote script — this is disproportionate risk for an instruction-only skill.
Credentials
The skill declares no required env vars, which is consistent, but it calls infsh login (implying credentials are needed) and invokes third‑party apps (bytedance, falai). The skill does not declare where credentials are stored, what scopes are required, or whether user media will be uploaded to third parties — this lack of detail about credentials and data handling is a proportionality/privacy concern.
Persistence & Privilege
The skill is not always-included and does not request elevated platform privileges. It does cause installation of an external CLI (per SKILL.md), but it does not request to modify other skills or system-wide config in the documented instructions.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install talking-head-production - After installation, invoke the skill by name or use
/talking-head-production - Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.5
- Initial release of talking-head-production skill.
- Provides workflows for creating AI avatar talking head videos, including requirements for portraits and audio.
- Documents models supported: OmniHuman, PixVerse lipsync, Dia TTS, and more.
- Includes best practices, sample commands, segmenting long videos, and multi-character support.
- Covers common mistakes and practical troubleshooting steps.
v0.1.0
Initial release: Create AI-powered talking head videos from portraits and voiceover.
- Supports OmniHuman and PixVerse models for lipsync and animation.
- Details professional portrait and audio requirements for best results.
- Provides CLI-based production workflows for basic, captioned, multi-segment, and multi-character videos.
- Includes guidelines to avoid common mistakes and maximize video quality.
- Lists related skills and tools for extended functionality.
Metadata
Frequently Asked Questions
What is Talking Head Production?
Talking head video production with AI avatars, lipsync, and voiceover. Covers portrait requirements, audio quality, OmniHuman, PixVerse lipsync, Dia TTS. Use... It is an AI Agent Skill for Claude Code / OpenClaw, with 814 downloads so far.
How do I install Talking Head Production?
Run "/install talking-head-production" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Talking Head Production free?
Yes, Talking Head Production is completely free (open-source). You can download, install and use it at no cost.
Which platforms does Talking Head Production support?
Talking Head Production is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Talking Head Production?
It is built and maintained by Ömer Karışman (@okaris); the current version is v0.1.5.
More Skills