← 返回 Skills 市场
hzsunzixiang

slide-to-video-converter

作者 hzsunzixiang · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
124
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install slide-to-video-converter
功能描述
End-to-end pipeline for converting PPT/PPTX/PDF slides with speaker notes into narrated MP4 videos. Defaults to Edge TTS (Microsoft free online API) for univ...
安全使用建议
This skill appears to do what it claims: convert PPT/PPTX/PDF + JSON speaker notes into narrated MP4s. Before installing/running, consider: - Privacy: default Edge TTS will send your slide text to Microsoft's online service; the HTTP TTS option will POST slide text to whatever server you configure. If your slides contain sensitive information, avoid Edge/HTTP modes or point the HTTP mode at a trusted local server. - Offline use: the 'direct' Qwen3-TTS mode loads a local model which may require large downloads and GPU/Metal compilation; expect heavy resource use and follow tts_server.py's instructions. - Dependencies & system tools: you must install ffmpeg, poppler, and (for PPTX→PDF) libreoffice; the pipeline uses subprocess to invoke these tools. Run inside a controlled environment (virtualenv, container) if you have security or reproducibility concerns. - Review tts_server.py if you plan to run the included HTTP server or point to a remote HTTP TTS endpoint; verify it doesn't expose unintended network interfaces. If you understand the data-flow (local files → optional network TTS) and are comfortable with dependencies and privacy trade-offs, the skill is coherent and reasonable to use.
功能分析
Type: OpenClaw Skill Name: slide-to-video-converter Version: 1.0.0 The slide-to-video-converter skill bundle is a legitimate and well-structured tool for converting presentations into narrated videos. It implements a multi-stage pipeline using standard libraries like MoviePy, FFmpeg, and Edge TTS. The scripts (e.g., pipeline.py, compose_video.py, and tts_server.py) perform expected tasks such as PDF/PPTX conversion, speech synthesis, and video encoding. While the bundle includes a shell script (restart_tts.sh) that manages local processes and Python scripts that execute subprocesses, these are used exclusively for the stated purpose of media processing and local service management, with no evidence of malicious intent, data exfiltration, or unauthorized access.
能力评估
Purpose & Capability
The name/description (slide → narrated video) matches the included scripts (PDF/PPTX→images, TTS, per-slide composition, merge). Requested system commands (ffmpeg, libreoffice, poppler) and Python packages are appropriate for the described work. No unrelated binaries, env vars, or config paths are required.
Instruction Scope
SKILL.md and scripts operate on local slide files (slides/presentation.pdf or .pptx), a subtitles JSON, and assets/config.json. They do perform network I/O for TTS: default Edge TTS uses an online Microsoft endpoint via the edge-tts package, and the HTTP mode will POST the text payload to whatever _tts_http_url you configure. This network behavior is expected by the skill but is a privacy consideration (your slide text will be transmitted).
Install Mechanism
No automated install spec is provided (instruction-only). Dependencies are installed via common package managers and pip as documented. The 'direct' Qwen3-TTS mode implies large local model loading and native GPU/metal compilation steps (expected for local TTS). There are no downloads from obscure URLs in the install instructions.
Credentials
The skill declares no required environment variables or credentials, which is consistent. However: (1) Edge TTS transmits text to Microsoft's online service (edge-tts) without an API key by design; (2) HTTP mode will send JSON containing the slide text to the configured HTTP server (default localhost:8100, but can be set to any host). If you set the HTTP URL to an external server, you will be sending content externally. These behaviors are expected but important to consider.
Persistence & Privilege
The skill does not request 'always: true' and does not appear to modify other skills or system-wide agent settings. It creates/uses local output directories under the project (output/images, output/audio, output/video) which is normal for a pipeline of this type.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install slide-to-video-converter
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /slide-to-video-converter 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of Slide to Video Converter. - Converts PPT, PPTX, or PDF slides with speaker notes into high-quality narrated MP4 videos. - Features auto-synced subtitles, audio validation, and batch processing. - Supports three TTS modes: Edge TTS (default, online), Qwen3-TTS (local GPU), and HTTP service. - Includes optimized PPTX support via LibreOffice, with fallback to python-pptx if needed. - Flexible pipeline allows per-slide partial processing, custom audio/video options, and quality control settings.
元数据
Slug slide-to-video-converter
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

slide-to-video-converter 是什么?

End-to-end pipeline for converting PPT/PPTX/PDF slides with speaker notes into narrated MP4 videos. Defaults to Edge TTS (Microsoft free online API) for univ... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 124 次。

如何安装 slide-to-video-converter?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install slide-to-video-converter」即可一键安装,无需额外配置。

slide-to-video-converter 是免费的吗?

是的,slide-to-video-converter 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

slide-to-video-converter 支持哪些平台?

slide-to-video-converter 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 slide-to-video-converter?

由 hzsunzixiang(@hzsunzixiang)开发并维护,当前版本 v1.0.0。

💬 留言讨论