← 返回 Skills 市场
michaelgold

Video Skill

作者 Michael Gold · GitHub ↗ · v0.1.2
cross-platform ✓ 安全检测通过
360
总下载
1
收藏
2
当前安装
3
版本数
在 OpenClaw 中安装
/install video-skill
功能描述
Run the video-skill pipeline to convert narrated videos into structured step data and enriched timeline-ready outputs. Use when a user asks to process a vide...
安全使用建议
This package appears coherent for its stated purpose, but review these before running: 1) Provider trust: the enrichment steps will send transcript text and base64-encoded frame images to whichever base_url you configure for 'reasoning' and 'vlm' — only point those at services you control or trust. 2) Model downloads & docker: bootstrap_models.sh will download large model files (requires HF CLI and an authenticated account) and the docker-compose file pulls images from GHCR — verify sources and run in a machine with sufficient disk/GPU or use an isolated VM/container. 3) Local commands & subprocesses: the tool invokes ffmpeg and subprocess.run (clip extraction); run on files you trust and consider limiting permissions/using a sandbox. 4) Config review: config.example.json leaves api_key_env null; if you populate api_key_env make sure env vars are set appropriately and contain only credentials for intended providers. 5) Minor docs inconsistency: SKILL.md says 'no repo clone required' but many commands expect a local repo — follow the README/INSTRUCTIONS for correct setup. If you need higher assurance, review the remaining truncated source files (settings and any network code) and run the pipeline in a disposable container before using with sensitive data.
功能分析
Type: OpenClaw Skill Name: video-skill Version: 0.1.2 The OpenClaw AgentSkills bundle 'video-skill' appears benign. Its primary function is to process videos using AI models, involving file I/O, external API calls to configured endpoints, and local execution of `ffmpeg` for video/frame extraction. API keys are handled securely via environment variables. The `SKILL.md` and other documentation provide instructions for using the skill's CLI, without any evidence of prompt injection attempts designed to manipulate the AI agent into unauthorized actions. The use of `subprocess.run` with a list of arguments for `ffmpeg` commands in `src/video_skill_extractor/clips.py` and `src/video_skill_extractor/frames.py` mitigates common shell injection vulnerabilities, as arguments are passed directly to the executable rather than interpreted by a shell. No signs of data exfiltration, persistence, or other malicious behaviors were found.
能力评估
Purpose & Capability
Name/description (convert narrated videos into steps and enriched outputs) aligns with the code and CLI commands. The required binaries (uv, ffmpeg, python3) are reasonable for a Python CLI that uses ffmpeg and the uv packaging/runtime. The included scripts, docker-compose, and model-bootstrapping are appropriate for a self-hosted model-backed pipeline.
Instruction Scope
SKILL.md and the CLI instruct the agent/operator to run transcription, chunking, extraction, frame sampling, enrichment, and markdown rendering. The instructions direct the tool to call configured model provider endpoints and to read and base64-encode image files (frames) and include them in model requests — expected for VLM-based enrichment but important to note: large binary image payloads will be sent to whatever provider URL is configured. SKILL.md contains a minor contradiction (claims 'no repo clone required' while showing commands that assume a local repo).
Install Mechanism
The registry lists no automated install spec (instruction-only). The bundle nevertheless contains many source files and helper scripts. This is not itself dangerous, but the provided scripts (scripts/bootstrap_models.sh) will download large model binaries from Hugging Face via the HF CLI and docker-compose references GHCR images — both reasonable for a local/self-hosted setup but require trust in those sources and will write substantial data to disk.
Credentials
The skill does not require any environment variables by default (config.example.json uses api_key_env:null). The code supports provider API keys if configured, which is appropriate for calling model endpoints. There are no unrelated credentials requested in the manifest. Note: if you set provider.api_key_env to point to an env var, that env var will be used to authenticate requests to the configured model endpoints — so only set keys for providers you trust.
Persistence & Privilege
always:false and no install spec means the skill does not request forced persistent inclusion or elevated platform privileges. It performs normal file I/O (reads/writes JSONL, writes frames/clips, runs ffmpeg subprocesses) and spawns subprocesses; this is expected behavior for a CLI tool of this scope.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install video-skill
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /video-skill 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.2
fix: don't mention github
v0.1.1
fix: requirements
v0.1.0
Initial public release
元数据
Slug video-skill
版本 0.1.2
许可证
累计安装 2
当前安装数 2
历史版本数 3
常见问题

Video Skill 是什么?

Run the video-skill pipeline to convert narrated videos into structured step data and enriched timeline-ready outputs. Use when a user asks to process a vide... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 360 次。

如何安装 Video Skill?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install video-skill」即可一键安装,无需额外配置。

Video Skill 是免费的吗?

是的,Video Skill 完全免费(开源免费),可自由下载、安装和使用。

Video Skill 支持哪些平台?

Video Skill 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Video Skill?

由 Michael Gold(@michaelgold)开发并维护,当前版本 v0.1.2。

💬 留言讨论