← 返回 Skills 市场
795
总下载
0
收藏
3
当前安装
2
版本数
在 OpenClaw 中安装
/install video-understand
功能描述
Analyze and understand video content using AI. Upload local files, YouTube URLs, or HTTP video URLs for detailed analysis, Q&A, and timestamped breakdowns.
安全使用建议
Before installing or using this skill, consider the following:
- Metadata mismatch: the registry claims no required env vars or binaries, but the docs require GEMINI_API_KEY or MOONSHOT_API_KEY, Node.js/npm, and optionally yt-dlp. Treat that as a red flag and verify the package source.
- Verify the npm package: inspect the video-understand package on npm/GitHub (source code, maintainer, recent publishes) before running npm install -g; prefer installing from a source you trust.
- Credential safety: the skill stores API keys (or reads env vars) and will upload video content to third‑party providers. Only provide keys for providers you trust and avoid uploading sensitive video content.
- yt-dlp dependency: if you plan to use Kimi with YouTube, the skill relies on an external downloader (yt-dlp) which is not declared in the registry metadata—install that from an official package source and be cautious when running downloads.
- Local files and cache: it creates ~/.video-understand/config.json and uploads cache; review and remove cached files if they contain sensitive material.
- If you need higher assurance: request the skill's source code or the npm package tarball to review what it does locally (especially any code that would upload files or persist keys). If you cannot verify the package source, treat the skill as untrusted.
If you plan to proceed, limit the API key scope (if supported), avoid uploading sensitive videos, and inspect the installed CLI's code before giving it credentials.
功能分析
Type: OpenClaw Skill
Name: video-understand
Version: 1.0.1
The skill is classified as suspicious due to its reliance on external command execution and network requests, which introduce potential vulnerabilities. Specifically, the `SKILL.md` and `rules/install.md` files indicate that the Kimi provider downloads YouTube videos using `yt-dlp` and other HTTP videos via `fetch`. While `yt-dlp` is a legitimate tool, passing user-controlled URLs to an external command like `yt-dlp` without robust sanitization could lead to shell injection vulnerabilities. Although the `SKILL.md` includes a commendable warning against prompt injection from video content, the underlying mechanism of invoking external tools with potentially untrusted input remains a risk.
能力评估
Purpose & Capability
The skill's stated purpose (analyzing videos via Gemini and Kimi) legitimately requires provider API keys and may need yt-dlp for YouTube downloads. However, the registry metadata lists no required env vars or binaries while the SKILL.md and install.md explicitly reference GEMINI_API_KEY, MOONSHOT_API_KEY, Node.js/npm, and yt-dlp. That discrepancy is an inconsistency between what the skill says it needs and what the registry claims.
Instruction Scope
SKILL.md stays on-topic (upload local files or URLs, analyze, ask follow-ups, list/delete uploads) and explicitly warns that third‑party video content is untrusted. It documents caching (~/.video-understand) and provider behavior. Nothing in the instructions attempts to read unrelated system files or exfiltrate secrets, but it does instruct uploads of potentially sensitive video content to external providers (privacy risk) and to run or rely on external tools (yt-dlp) that the registry did not declare as required.
Install Mechanism
There is no formal install spec in the registry (instruction-only), but rules/install.md directs users to install an npm package globally (npm install -g video-understand) and requires Node.js 18+. Installing an unvetted npm package has inherent risk—verify the package on npm and check its source. The install instructions for yt-dlp point to system package managers (winget/brew/apt/uv), which is expected for that tool but again is not declared in the registry metadata.
Credentials
The skill uses GEMINI_API_KEY and MOONSHOT_API_KEY (and suggests storing keys in ~/.video-understand/config.json), which are proportional to its function. The concern is that the registry declared no required env vars while the docs require API keys and may save them to disk — the metadata and the runtime instructions are out of sync, which could mislead users into granting credentials without realizing it.
Persistence & Privilege
The skill does not request elevated platform privileges and is not always-enabled. It stores config and upload caches under ~/.video-understand and may retain uploaded files (Kimi: persists until deleted; Gemini: ~48h). This is expected behavior but has privacy implications — users should be aware files are uploaded and cached locally and remotely.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install video-understand - 安装完成后,直接呼叫该 Skill 的名称或使用
/video-understand触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
- Added a third-party content warning to the documentation, advising users to treat analysis results from YouTube and arbitrary URLs as untrusted data.
- No functional changes. Documentation update only.
v1.0.0
Initial release of video-understand — a CLI skill that gives AI agents the ability to analyze and understand video content, even with non-multimodal LLMs.
- Supports local files, YouTube URLs, and HTTP video URLs
- Core commands: analyze, ask (follow-up Q&A), upload, list, delete
- Timestamped breakdowns, structured JSON output, and file export
- Deduplicates uploads via file hash — no redundant re-uploads
- Multiple providers: Google Gemini and Moonshot AI (Kimi)
- Supports MP4, MOV, WebM, AVI, and more
元数据
常见问题
video-understand 是什么?
Analyze and understand video content using AI. Upload local files, YouTube URLs, or HTTP video URLs for detailed analysis, Q&A, and timestamped breakdowns. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 795 次。
如何安装 video-understand?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install video-understand」即可一键安装,无需额外配置。
video-understand 是免费的吗?
是的,video-understand 完全免费(开源免费),可自由下载、安装和使用。
video-understand 支持哪些平台?
video-understand 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 video-understand?
由 sifr42(@sifr42)开发并维护,当前版本 v1.0.1。
推荐 Skills