← Back to Skills Marketplace
slide-to-video-converter
by
hzsunzixiang
· GitHub ↗
· v1.0.0
· MIT-0
124
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install slide-to-video-converter
Description
End-to-end pipeline for converting PPT/PPTX/PDF slides with speaker notes into narrated MP4 videos. Defaults to Edge TTS (Microsoft free online API) for univ...
Usage Guidance
This skill appears to do what it claims: convert PPT/PPTX/PDF + JSON speaker notes into narrated MP4s. Before installing/running, consider:
- Privacy: default Edge TTS will send your slide text to Microsoft's online service; the HTTP TTS option will POST slide text to whatever server you configure. If your slides contain sensitive information, avoid Edge/HTTP modes or point the HTTP mode at a trusted local server.
- Offline use: the 'direct' Qwen3-TTS mode loads a local model which may require large downloads and GPU/Metal compilation; expect heavy resource use and follow tts_server.py's instructions.
- Dependencies & system tools: you must install ffmpeg, poppler, and (for PPTX→PDF) libreoffice; the pipeline uses subprocess to invoke these tools. Run inside a controlled environment (virtualenv, container) if you have security or reproducibility concerns.
- Review tts_server.py if you plan to run the included HTTP server or point to a remote HTTP TTS endpoint; verify it doesn't expose unintended network interfaces.
If you understand the data-flow (local files → optional network TTS) and are comfortable with dependencies and privacy trade-offs, the skill is coherent and reasonable to use.
Capability Analysis
Type: OpenClaw Skill
Name: slide-to-video-converter
Version: 1.0.0
The slide-to-video-converter skill bundle is a legitimate and well-structured tool for converting presentations into narrated videos. It implements a multi-stage pipeline using standard libraries like MoviePy, FFmpeg, and Edge TTS. The scripts (e.g., pipeline.py, compose_video.py, and tts_server.py) perform expected tasks such as PDF/PPTX conversion, speech synthesis, and video encoding. While the bundle includes a shell script (restart_tts.sh) that manages local processes and Python scripts that execute subprocesses, these are used exclusively for the stated purpose of media processing and local service management, with no evidence of malicious intent, data exfiltration, or unauthorized access.
Capability Assessment
Purpose & Capability
The name/description (slide → narrated video) matches the included scripts (PDF/PPTX→images, TTS, per-slide composition, merge). Requested system commands (ffmpeg, libreoffice, poppler) and Python packages are appropriate for the described work. No unrelated binaries, env vars, or config paths are required.
Instruction Scope
SKILL.md and scripts operate on local slide files (slides/presentation.pdf or .pptx), a subtitles JSON, and assets/config.json. They do perform network I/O for TTS: default Edge TTS uses an online Microsoft endpoint via the edge-tts package, and the HTTP mode will POST the text payload to whatever _tts_http_url you configure. This network behavior is expected by the skill but is a privacy consideration (your slide text will be transmitted).
Install Mechanism
No automated install spec is provided (instruction-only). Dependencies are installed via common package managers and pip as documented. The 'direct' Qwen3-TTS mode implies large local model loading and native GPU/metal compilation steps (expected for local TTS). There are no downloads from obscure URLs in the install instructions.
Credentials
The skill declares no required environment variables or credentials, which is consistent. However: (1) Edge TTS transmits text to Microsoft's online service (edge-tts) without an API key by design; (2) HTTP mode will send JSON containing the slide text to the configured HTTP server (default localhost:8100, but can be set to any host). If you set the HTTP URL to an external server, you will be sending content externally. These behaviors are expected but important to consider.
Persistence & Privilege
The skill does not request 'always: true' and does not appear to modify other skills or system-wide agent settings. It creates/uses local output directories under the project (output/images, output/audio, output/video) which is normal for a pipeline of this type.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install slide-to-video-converter - After installation, invoke the skill by name or use
/slide-to-video-converter - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of Slide to Video Converter.
- Converts PPT, PPTX, or PDF slides with speaker notes into high-quality narrated MP4 videos.
- Features auto-synced subtitles, audio validation, and batch processing.
- Supports three TTS modes: Edge TTS (default, online), Qwen3-TTS (local GPU), and HTTP service.
- Includes optimized PPTX support via LibreOffice, with fallback to python-pptx if needed.
- Flexible pipeline allows per-slide partial processing, custom audio/video options, and quality control settings.
Metadata
Frequently Asked Questions
What is slide-to-video-converter?
End-to-end pipeline for converting PPT/PPTX/PDF slides with speaker notes into narrated MP4 videos. Defaults to Edge TTS (Microsoft free online API) for univ... It is an AI Agent Skill for Claude Code / OpenClaw, with 124 downloads so far.
How do I install slide-to-video-converter?
Run "/install slide-to-video-converter" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is slide-to-video-converter free?
Yes, slide-to-video-converter is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does slide-to-video-converter support?
slide-to-video-converter is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created slide-to-video-converter?
It is built and maintained by hzsunzixiang (@hzsunzixiang); the current version is v1.0.0.
More Skills