← 返回 Skills 市场
Ppt Video
作者
vincentlau2046-sudo
· GitHub ↗
· v1.4.1
· MIT-0
96
总下载
0
收藏
1
当前安装
2
版本数
在 OpenClaw 中安装
/install ppt-video
功能描述
将PPTX/PDF/HTML演示文稿及背景材料自动转换成1280×720口语化播报视频,支持内容风格识别和语速调节。
安全使用建议
This skill appears to do what it claims, but proceed with caution: 1) Provide a tightly-scoped input directory containing only files you intend the skill to process (avoid passing root or broad workspace paths). 2) The Node script uses execSync with concatenated command strings; malicious or specially-crafted filenames might cause command injection or break quoting—sanitize filenames before running, or run the skill in a sandboxed environment (container or VM). 3) The docs disagree on resolution and framerate—verify the produced video matches your requirements before using in production. 4) edge-tts likely requires network access; confirm privacy/usage expectations for TTS output. 5) If you need higher assurance, review the remainder of generate.js (the truncated part) for any network calls, hidden endpoints, or remote uploads. If you lack the ability to sandbox, consider running only on non-sensitive test material or asking the author for clarifications/patches that avoid shell concatenation (use execFile or spawn with argument arrays and strict sanitization).
功能分析
Type: OpenClaw Skill
Name: ppt-video
Version: 1.4.1
The skill bundle provides a functional pipeline for converting presentations to videos using Node.js, Python, and external CLI tools like FFmpeg and edge-tts. It is classified as suspicious due to significant command injection vulnerabilities in `scripts/generate.js`. The script uses `execSync` to execute shell commands with strings constructed from user-provided file content and paths. While it attempts basic sanitization by removing backticks and escaping double quotes, it fails to sanitize other shell metacharacters (notably `$`), which could allow arbitrary code execution via command substitution if a malicious input file is processed. Repository: https://github.com/vincentlau2046-sudo/ppt-video.git.
能力评估
Purpose & Capability
Name/description match the included code: the scripts extract text, render screenshots, run TTS and assemble video. Declared runtime dependencies in SKILL.md (libreoffice, pdftoppm/poppler, ffmpeg, ImageMagick/inkscape, edge-tts) align with the conversion tasks. Minor inconsistencies exist in documented output specs (some docs say 1280×720, others 1024×720; frame rate and resolution differ across README/SKILL.md/package.json and generate.js). These are likely documentation drift rather than malicious intent.
Instruction Scope
SKILL.md and scripts instruct the agent to recursively scan the provided input directory for PPT/MD/TXT files and then run external commands (libreoffice, pdftoppm, convert/inkscape, edge-tts, ffmpeg). That behavior is necessary for the task but broad: recursive scanning may pick up unexpected files if the input path is not tightly scoped, and generate.js constructs shell commands by concatenating user-controlled paths into execSync strings. While many variables are wrapped in quotes, concatenation of arbitrary file names into shell commands creates a potential command-injection/escaping risk and could lead to unintended command execution or processing of sensitive files if the input directory is not controlled.
Install Mechanism
No install spec in the registry (instruction-only with code files). The skill relies on common system packages and Python/Node packages documented in SKILL.md/README. No remote downloads or obscure install URLs are used in the metadata; installation is manual via apt/pip and npm runtime, which is proportionate to the functionality.
Credentials
The skill does not request environment variables, secrets, or external credentials in registry metadata. Scripts use standard paths (HOME/.openclaw/workspace examples) and allow optional NODE_BIN override in tests. There are no declared or required API keys—TTS is expected to use the local 'edge-tts' tool which may require network access but no secret is requested by the skill.
Persistence & Privilege
Skill flags are default: always=false and agent invocation allowed. The skill does not request persistent platform privileges and does not attempt to modify other skills or system-wide agent configuration. It writes output to configured directories and temporary files within the project/output directories as expected.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install ppt-video - 安装完成后,直接呼叫该 Skill 的名称或使用
/ppt-video触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.4.1
修复硬编码路径安全问题,移除 /home/Vincent/ 引用,改用环境变量和相对路径
v3.2.0
ppt-video v3.2.0
- 新增讲稿口语化重写功能,自动将书面语转换为汇报/新闻风格
- 优化句子长度(自动调整为 15–25 字/句)
- 自动添加连接词,使内容更自然流畅(先重点后事实)
- 完善多格式支持与输入优先级说明
- 优化 TTS 语速逻辑和标点停顿策略
- 加强页面对齐及音画同步验证
元数据
常见问题
Ppt Video 是什么?
将PPTX/PDF/HTML演示文稿及背景材料自动转换成1280×720口语化播报视频,支持内容风格识别和语速调节。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 96 次。
如何安装 Ppt Video?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install ppt-video」即可一键安装,无需额外配置。
Ppt Video 是免费的吗?
是的,Ppt Video 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Ppt Video 支持哪些平台?
Ppt Video 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Ppt Video?
由 vincentlau2046-sudo(@vincentlau2046-sudo)开发并维护,当前版本 v1.4.1。
推荐 Skills