← 返回 Skills 市场
101
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install video-to-doc
功能描述
将操作视频自动转换为图文并茂的Word操作指南文档,支持智能截图、语音转录、LLM内容提炼和流程图生成
安全使用建议
This package appears coherent for converting tutorial videos into Word guides. Before installing:
- Ensure you have ffmpeg and Python available; the included install script uses apt/brew and pip to add expected packages.
- read_image is referenced as a platform built-in: confirm your agent environment provides that tool before relying on the main-dialog analysis step.
- Optional API keys (ANTHROPIC_API_KEY, OPENAI_API_KEY, and OpenAI Whisper API key) improve LLM/refinement performance; do not provide keys unless you trust the runtime environment. Keys are used by scripts to call remote LLMs — providing them grants those scripts access to those services.
- The scripts process local files (extracted frames, audio, transcripts). Make sure you run the skill only on videos you are allowed to process (they may contain sensitive UI or PII).
- If you need stricter isolation, run the pipeline locally without supplying external API keys and avoid enabling remote model calls.
功能分析
Type: OpenClaw Skill
Name: video-to-doc
Version: 1.0.0
The skill bundle is a legitimate tool designed to convert video tutorials into Word documentation. It utilizes a multi-step process involving frame extraction (via ffmpeg), audio transcription (via Whisper), and content refinement using LLMs (Claude/OpenAI). The code includes a license verification and usage tracking mechanism that communicates with 'xiaping.coze.site', which is consistent with the commercial metadata found in 'skill.json'. All scripts (Python, Node.js, Shell) and the 'SKILL.md' instructions are aligned with the stated purpose, and no evidence of malicious intent, unauthorized data exfiltration, or persistence was found.
能力评估
Purpose & Capability
Name/description (video → Word guide) match the included scripts: frame extraction, transcription, frame analysis, LLM refinement, and doc generation. The SKILL.md and code reference only relevant tools (ffmpeg, whisper/faster-whisper, python-docx, read_image) consistent with the stated capability.
Instruction Scope
Runtime instructions focus on extracting frames, transcribing audio, calling the platform's read_image for visual analysis, merging results, and generating a Word doc. They instruct the main agent to read files under the frames directory and run read_image on those images — this is expected for the stated task and does not ask for unrelated system data or arbitrary file paths.
Install Mechanism
No install spec in registry (instruction-only), but the repository includes a sensible install script (scripts/install_deps.sh) that uses apt/brew and pip to install expected dependencies. There are no downloads from unknown hosts or URL shorteners; no extracted arbitrary archives. Installation behavior is proportionate to the task.
Credentials
The skill declares no required env vars, which matches registry metadata. Several scripts optionally use third-party API keys (ANTHROPIC_API_KEY, OPENAI_API_KEY) for improved LLM refinement, and SKILL.md mentions optional OpenAI Whisper API and local faster-whisper. These optional keys are reasonable for higher-quality processing but are not required for basic local operation.
Persistence & Privilege
Skill is not always-enabled (always: false) and uses normal autonomous invocation settings. It does not request system-wide configuration changes or other skills' credentials. The skill operates on local files it creates (frames, transcripts) and does not attempt to modify agent configuration.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install video-to-doc - 安装完成后,直接呼叫该 Skill 的名称或使用
/video-to-doc触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release: 支持视频转图文Word文档
元数据
常见问题
视频转操作指南 是什么?
将操作视频自动转换为图文并茂的Word操作指南文档,支持智能截图、语音转录、LLM内容提炼和流程图生成. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 101 次。
如何安装 视频转操作指南?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install video-to-doc」即可一键安装,无需额外配置。
视频转操作指南 是免费的吗?
是的,视频转操作指南 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
视频转操作指南 支持哪些平台?
视频转操作指南 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 视频转操作指南?
由 SIYOU315(@siyou315)开发并维护,当前版本 v1.0.0。
推荐 Skills