← 返回 Skills 市场

Media Generation

Name: Media Generation
Author: lijiazhen0623

作者 ljz · GitHub ↗ · v2.2.0 · MIT-0

cross-platform ⚠ suspicious

333

总下载

当前安装

版本数

在 OpenClaw 中安装

/install media-generation

功能描述

Generate images, edit existing images, create short videos, run inpainting/outpainting and object-focused edits, use reference images as provider inputs, bat...

安全使用建议

This skill appears to implement the advertised media generation and editing capabilities, but it expects (and will read/use) an OpenClaw configuration file and provider API keys/env vars even though the package metadata does not declare them. Before installing: 1) Inspect ~/.openclaw/openclaw.json (or $OPENCLAW_CONFIG) to see which provider keys would be used; do not provide high-privilege credentials unnecessarily. 2) Be aware the scripts will perform outbound HTTP requests and will download whatever URLs the provider returns (including internal or unexpected endpoints). Consider running the skill in a network-restricted sandbox if you want to limit egress. 3) If you plan to use the skill, create a least-privilege provider account (API key limited to media endpoints) and confirm the provider's URLs and behavior. 4) If the registry entry should declare required env vars/config paths, ask the publisher to add them; lack of declared credentials is an inconsistency and increases risk if you install without inspecting your config. If you want, I can list the specific files and lines that read the config or env vars so you can audit them more easily.

功能分析

Type: OpenClaw Skill Name: media-generation Version: 2.2.0 The media-generation skill bundle is a legitimate and well-structured set of tools for AI-driven image and video creation and editing. It features robust scripts for API interaction, asynchronous job polling (generate_video.py), and complex image manipulations like inpainting and outpainting (mask_inpaint.py, outpaint_image.py). The bundle follows secure coding practices by using list-based subprocess calls to prevent shell injection and handles sensitive provider credentials via the standard OpenClaw configuration mechanism (media_request_common.py). No indicators of malicious intent, such as data exfiltration, unauthorized remote execution, or harmful prompt injection, were detected across the code or documentation.

能力评估

⚠ Purpose & Capability

The skill's stated purpose (image/video generation & edits) matches the included scripts. However the implementation expects an OpenClaw config with providers (baseUrl and apiKey) and honours env vars like OPENCLAW_MEDIA_PROVIDER, OPENCLAW_MEDIA_IMAGE_MODEL, OPENCLAW_MEDIA_EDIT_MODEL, OPENCLAW_MEDIA_VIDEO_MODEL and OPENCLAW_CONFIG (see references/model-capabilities.md and multiple scripts). The registry metadata declares no required env vars or config paths — that's inconsistent: someone using this skill will need provider credentials/config despite the package not advertising that requirement.

⚠ Instruction Scope

SKILL.md instructs the agent to run bundled scripts that (a) read local images and masks, (b) read a config file to obtain provider base URL and API key, (c) send requests to arbitrary provider endpoints, (d) poll status endpoints, and (e) download returned media (including arbitrary URLs, data: URLs, HTML snippets or b64_json). The scripts also create temp files (tmp/images, tmp/videos, mask dirs) and may write temporary JSON files. These actions are appropriate for media generation, but the SKILL.md gives broad discretion (prompt pass-through, using provider-returned URLs) which means the skill can fetch arbitrary remote resources and will read the local provider config — the instructions therefore go beyond a purely local helper and involve network I/O and credential usage that are not declared in metadata.

ℹ Install Mechanism

There is no install spec (no external downloads or package installs), which reduces risk from supply-chain downloads. However many executable Python scripts are included and will be invoked at runtime; those scripts will be written to disk as part of the skill bundle. The lack of an install step is coherent with the repo layout but does not remove runtime risk because the scripts perform network requests and file I/O.

⚠ Credentials

The code expects and uses provider credentials and config but the skill metadata lists no required environment variables or required config paths. The referenced defaults (in documentation and code) include ~/.openclaw/openclaw.json or $OPENCLAW_CONFIG and env vars OPENCLAW_MEDIA_PROVIDER and OPENCLAW_MEDIA_*_MODEL values. Those are effectively required to call real providers (apiKey/baseUrl). Not declaring these credentials is an incoherence and a safety concern: the skill will attempt to read API keys from user config and then use them to make outbound calls and downloads.

✓ Persistence & Privilege

The skill does not request always:true and does not modify other skills or system-wide settings. It writes outputs and temporary masks under local directories (tmp/images, tmp/videos, mask dir) and uses subprocess to call bundled scripts, which is expected behavior for a script-based media helper.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install media-generation
安装完成后，直接呼叫该 Skill 的名称或使用 /media-generation 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v2.2.0

Refine SKILL.md structure, clean publish artifacts, standardize reference-image helper entrypoint, and validate/package the skill for cleaner distribution.

v2.1.0

Polish reference-media workflow, rename the main reference helper to reference_media.py with backward-compatible wrapper support, add nsfw image and preset video options, expand smoke coverage, and refresh public examples/docs.

v2.0.0

Refocus the skill into a function-first media tool. Remove prompt expansion, style routing, retry/prompt heuristics, and prompt-oriented references. Keep only transport, provider mapping, polling, retrieval, and reference-image compatibility workflows.

v1.0.0

Initial release: image generation, image editing, mask inpainting, outpainting, reference-image consistency workflows, short video generation, object-focused edits, batch media workflows, robust media fetching, portability notes, and bundled smoke tests.

元数据

Slug media-generation

版本 2.2.0

许可证 MIT-0

累计安装 1

当前安装数 1

历史版本数 4

常见问题

Media Generation 是什么？

Generate images, edit existing images, create short videos, run inpainting/outpainting and object-focused edits, use reference images as provider inputs, bat... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 333 次。

如何安装 Media Generation？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install media-generation」即可一键安装，无需额外配置。

Media Generation 是免费的吗？

是的，Media Generation 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Media Generation 支持哪些平台？

Media Generation 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Media Generation？

由 ljz（@lijiazhen0623）开发并维护，当前版本 v2.2.0。