← Back to Skills Marketplace

PPIO Multimodal Skill

Name: PPIO Multimodal Skill
Author: ximasadila

by bbear · GitHub ↗ · v0.2.1 · MIT-0

cross-platform ⚠ suspicious

304

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install ppio-multimodal

Description

使用 PPIO 执行多模态任务：文生图、图生图、文生视频、图生视频、TTS、STT。适用于：生成图片、生成视频、文字转语音、语音识别。

Usage Guidance

This skill looks like a normal PPIO client, but be cautious before using it with real API keys. Do not paste your API key into chat messages — the SKILL.md explicitly asks the agent to look for 'sk_' tokens in user messages, which could leak credentials into conversation logs or to third parties. Prefer configuring the key via a local config file (~/.ppio/config.json) or a protected environment variable (PPIO_API_KEY). Note the skill manifest did not declare these env/config requirements — ask the author to document required env vars and config paths. If you decide to use it, create an API key with minimal scope and billing limits you can tolerate, avoid sharing keys in messages, and rotate/delete the key after testing.

Capability Analysis

Type: OpenClaw Skill Name: ppio-multimodal Version: 0.2.1 The skill bundle provides a legitimate integration for the PPIO multimodal API, allowing an AI agent to perform text-to-image, video generation, and speech tasks. It follows standard practices for API key management (environment variables and a local config file at ~/.ppio/config.json) and includes detailed instructions for user feedback and error handling. No evidence of data exfiltration, malicious command execution, or harmful prompt injection was found.

Capability Assessment

✓ Purpose & Capability

The skill claims to call PPIO multimodal endpoints and all templates, endpoints, and examples point to https://api.ppio.com and models consistent with that purpose. Needing an API key is expected for this functionality.

⚠ Instruction Scope

The SKILL.md explicitly instructs the agent to check user messages for API keys (sk_ prefix) and to read ~/.ppio/config.json or PPIO_API_KEY. Asking the agent to parse user messages for keys and to accept keys pasted directly in chat increases risk of accidental credential exposure or exfiltration. The instruction to always send a progress message prior to calling the API is benign but prescriptive.

✓ Install Mechanism

Instruction-only skill with no install spec or code to download. This is lowest-risk in terms of disk writes or third-party package installation.

⚠ Credentials

The manifest lists no required env vars or config paths, but SKILL.md relies on PPIO_API_KEY and ~/.ppio/config.json and accepts keys pasted in messages — a mismatch. While the requested secrets are appropriate for the stated purpose (an API key), the skill encourages insecure user behavior (pasting keys into chat) and the manifest does not declare these expected environment/config requirements.

✓ Persistence & Privilege

always:false and default autonomous invocation are used. The skill doesn't request persistent system-wide changes or modify other skills' configs.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install ppio-multimodal
After installation, invoke the skill by name or use /ppio-multimodal
Provide required inputs per the skill's parameter spec and get structured output

Version History

v0.2.1

ppio-multimodal 0.2.1 -修正相关显示问题及连接

v0.2.0

- 新增详细的多模态任务支持文档，覆盖文生图、图生图、文生视频、图生视频、TTS、STT等功能，包含配置与调用说明 - 明确API Key的三种配置方式与优先级，并加入未配置时的用户引导模板 - 规范各类任务的进度提示与回复消息，尤其是视频任务的轮询进度展示 - 列明所有主要API端点、模型及对应的curl调用模板 - 增加常见错误码及处理建议，补充获取Key与定价等资源链接

Metadata

Slug ppio-multimodal

Version 0.2.1

License MIT-0

All-time Installs 1

Active Installs 1

Total Versions 2

Frequently Asked Questions

What is PPIO Multimodal Skill?

使用 PPIO 执行多模态任务：文生图、图生图、文生视频、图生视频、TTS、STT。适用于：生成图片、生成视频、文字转语音、语音识别。 It is an AI Agent Skill for Claude Code / OpenClaw, with 304 downloads so far.

How do I install PPIO Multimodal Skill?

Run "/install ppio-multimodal" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is PPIO Multimodal Skill free?

Yes, PPIO Multimodal Skill is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does PPIO Multimodal Skill support?

PPIO Multimodal Skill is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created PPIO Multimodal Skill?

It is built and maintained by bbear (@ximasadila); the current version is v0.2.1.

More Skills