← 返回 Skills 市场

uni-vision-engine

Name: uni-vision-engine
Author: jiahuamld

作者 jiahuamld · GitHub ↗ · v1.0.2 · MIT-0

cross-platform ⚠ suspicious

430

总下载

当前安装

版本数

在 OpenClaw 中安装

/install uni-vision-engine

功能描述

Automated high-quality video generation (text-to-video, image-to-video) via a local jimeng-api Docker service. Features native OpenClaw image interception, a...

安全使用建议

Before installing or enabling this skill, consider the following: - Privacy: The SKILL.md tells the agent to automatically 'intercept' image payloads from the chat; ensure you want the agent to extract and upload user-provided images (including any sensitive content) to a model service. Confirm user consent flows. - Embedded credential: The script contains a hard-coded session token. Treat this as a secret — it may grant access to the Jimeng service. Either remove it or replace it with a configuration mechanism (and do not publish secrets). - Runtime install: The script runs 'npm install' via shell at execution time. If you prefer deterministic installs, pre-install dependencies in a controlled environment rather than allowing runtime network installs. - Local service trust: The script talks to localhost:5100 (jimeng-api Docker). Verify that this local service is under your control and does not forward images or logs externally. If you don't run a local jimeng-api, the script will fail or may reveal that embedded session token is intended to reach a non-local endpoint (investigate first). - Principle of least privilege: If you cannot audit the docker image/service and you do not trust the embedded token, do not enable the automatic interception behavior; require explicit user consent or manual upload instead. If you want help: I can suggest concrete edits to the SKILL.md and scripts to remove the hard-coded token, replace the runtime npm install with an explicit installation step, and make image interception explicit and consent-driven.

功能分析

Type: OpenClaw Skill Name: uni-vision-engine Version: 1.0.2 The skill contains a hardcoded session token in `scripts/generate.js` and uses `execSync` to dynamically install the `form-data` NPM package at runtime, which is a risky practice. While the behavior aligns with the stated purpose of interfacing with a local Docker-based video generation API (localhost:5100), the inclusion of hardcoded credentials and the use of shell execution for dependency management are significant security vulnerabilities.

能力评估

ℹ Purpose & Capability

The skill's name and description match the included code: it expects a local jimeng-api (localhost:5100) and performs image→video/text→video requests. However, the bundled script contains a hard-coded session token (a cached credential) and a default model/port; the SKILL.md says a valid sessionid is required, but the code will silently use the embedded session if you don't supply one. Embedding a credential inside the code is unexpected given the manifest lists no required credentials and is disproportionate to the stated install-free instruction-only approach.

⚠ Instruction Scope

SKILL.md explicitly instructs the agent to 'MUST' intercept image payloads from chat context (base64 or cache path), save them locally (/tmp/target.jpg), and submit them automatically. That gives the agent broad discretion to read chat internals and extract binary content. While this is necessary for image-to-video functionality, the instructions are imperative and broad (could be applied to any image in chat) and therefore increase risk of unintended data access/exfiltration. The instructions also require monitoring Docker logs to retrieve results — no explicit safeguards or user consent checks are specified.

⚠ Install Mechanism

There is no declared install spec, but scripts/generate.js performs a dynamic runtime 'npm install form-data --no-save' via child_process.execSync when run. This is a network operation that installs code from the npm registry at execution time and writes to disk (node_modules). Dynamic, implicit installs via execSync are higher-risk than a declared install step and are surprising for an 'instruction-only' skill.

⚠ Credentials

The skill declares no required environment variables or credentials, but the script contains a hard-coded sessionToken string (b79fcda2...). Embedded credentials are effectively secret material and are disproportionate/unexpected. The script also accepts a --session override, but shipping a usable session inside the code can be abused or leak access. The code sends user-provided images to localhost:5100 only (no obvious external endpoints), but you should verify the local jimeng-api doesn't forward data externally.

ℹ Persistence & Privilege

The skill does not request 'always: true' and does not modify other skills or system-wide configuration. It will, however, create node_modules at runtime when dynamically installing 'form-data' and writes temp image files to /tmp. Those behaviors are relatively limited but worth noting as they create on-disk artifacts and perform network installs the first time they're invoked.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install uni-vision-engine
安装完成后，直接呼叫该 Skill 的名称或使用 /uni-vision-engine 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.2

- Updated documentation and description for clearer English instructions and broader accessibility. - Removed the UI server component (`ui/server.js`), signaling a shift to fully headless, automation-focused operation. - Skill no longer references integrated web UI; focus is now on native chat-based and script-driven image/video generation. - Clarified workflow for automated image interception, strict file handling, and moderation/error responses. - CLI instructions and usage examples updated for consistency with the latest workflow.

v1.0.1

Uni Vision Engine v1.0.1 Changelog - Added ui/server.js, introducing a web-based UI server component to the project. - Enhanced description: now highlights a built-in visual web upload tool and native image extraction from chat, supporting direct image-to-video generation from chat interfaces. - Updated SKILL.md with clearer best practice guidance for handling image messages and using filesystem-based uploads for high-quality video generation. - Added detailed instructions for dealing with domestic content review mechanisms and error codes.

v1.0.0

Uni Vision Engine 1.0.0 - 首次发布，支持通过本地 jimeng-api 自动生成无水印高质量视频（文生视频、图生视频）。 - 集成自动积分检测，确保账户积分充足后再生成视频。 - 完成后自动写入历史记录，并在本地 HTML 控制台可视化管理所有任务。 - 支持通过 sessionid 完全控制视频生成流程。

元数据

Slug uni-vision-engine

版本 1.0.2

许可证 MIT-0

累计安装 1

当前安装数 1

历史版本数 3

常见问题

uni-vision-engine 是什么？

Automated high-quality video generation (text-to-video, image-to-video) via a local jimeng-api Docker service. Features native OpenClaw image interception, a... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 430 次。

如何安装 uni-vision-engine？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install uni-vision-engine」即可一键安装，无需额外配置。

uni-vision-engine 是免费的吗？

是的，uni-vision-engine 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

uni-vision-engine 支持哪些平台？

uni-vision-engine 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 uni-vision-engine？

由 jiahuamld（@jiahuamld）开发并维护，当前版本 v1.0.2。