← Back to Skills Marketplace
101
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install video-to-doc
Description
将操作视频自动转换为图文并茂的Word操作指南文档,支持智能截图、语音转录、LLM内容提炼和流程图生成
Usage Guidance
This package appears coherent for converting tutorial videos into Word guides. Before installing:
- Ensure you have ffmpeg and Python available; the included install script uses apt/brew and pip to add expected packages.
- read_image is referenced as a platform built-in: confirm your agent environment provides that tool before relying on the main-dialog analysis step.
- Optional API keys (ANTHROPIC_API_KEY, OPENAI_API_KEY, and OpenAI Whisper API key) improve LLM/refinement performance; do not provide keys unless you trust the runtime environment. Keys are used by scripts to call remote LLMs — providing them grants those scripts access to those services.
- The scripts process local files (extracted frames, audio, transcripts). Make sure you run the skill only on videos you are allowed to process (they may contain sensitive UI or PII).
- If you need stricter isolation, run the pipeline locally without supplying external API keys and avoid enabling remote model calls.
Capability Analysis
Type: OpenClaw Skill
Name: video-to-doc
Version: 1.0.0
The skill bundle is a legitimate tool designed to convert video tutorials into Word documentation. It utilizes a multi-step process involving frame extraction (via ffmpeg), audio transcription (via Whisper), and content refinement using LLMs (Claude/OpenAI). The code includes a license verification and usage tracking mechanism that communicates with 'xiaping.coze.site', which is consistent with the commercial metadata found in 'skill.json'. All scripts (Python, Node.js, Shell) and the 'SKILL.md' instructions are aligned with the stated purpose, and no evidence of malicious intent, unauthorized data exfiltration, or persistence was found.
Capability Assessment
Purpose & Capability
Name/description (video → Word guide) match the included scripts: frame extraction, transcription, frame analysis, LLM refinement, and doc generation. The SKILL.md and code reference only relevant tools (ffmpeg, whisper/faster-whisper, python-docx, read_image) consistent with the stated capability.
Instruction Scope
Runtime instructions focus on extracting frames, transcribing audio, calling the platform's read_image for visual analysis, merging results, and generating a Word doc. They instruct the main agent to read files under the frames directory and run read_image on those images — this is expected for the stated task and does not ask for unrelated system data or arbitrary file paths.
Install Mechanism
No install spec in registry (instruction-only), but the repository includes a sensible install script (scripts/install_deps.sh) that uses apt/brew and pip to install expected dependencies. There are no downloads from unknown hosts or URL shorteners; no extracted arbitrary archives. Installation behavior is proportionate to the task.
Credentials
The skill declares no required env vars, which matches registry metadata. Several scripts optionally use third-party API keys (ANTHROPIC_API_KEY, OPENAI_API_KEY) for improved LLM refinement, and SKILL.md mentions optional OpenAI Whisper API and local faster-whisper. These optional keys are reasonable for higher-quality processing but are not required for basic local operation.
Persistence & Privilege
Skill is not always-enabled (always: false) and uses normal autonomous invocation settings. It does not request system-wide configuration changes or other skills' credentials. The skill operates on local files it creates (frames, transcripts) and does not attempt to modify agent configuration.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install video-to-doc - After installation, invoke the skill by name or use
/video-to-doc - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release: 支持视频转图文Word文档
Metadata
Frequently Asked Questions
What is 视频转操作指南?
将操作视频自动转换为图文并茂的Word操作指南文档,支持智能截图、语音转录、LLM内容提炼和流程图生成. It is an AI Agent Skill for Claude Code / OpenClaw, with 101 downloads so far.
How do I install 视频转操作指南?
Run "/install video-to-doc" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is 视频转操作指南 free?
Yes, 视频转操作指南 is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does 视频转操作指南 support?
视频转操作指南 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created 视频转操作指南?
It is built and maintained by SIYOU315 (@siyou315); the current version is v1.0.0.
More Skills