← Back to Skills Marketplace
304
Downloads
1
Stars
1
Active Installs
2
Versions
Install in OpenClaw
/install ppio-multimodal
Description
使用 PPIO 执行多模态任务:文生图、图生图、文生视频、图生视频、TTS、STT。 适用于:生成图片、生成视频、文字转语音、语音识别。
Usage Guidance
This skill looks like a normal PPIO client, but be cautious before using it with real API keys. Do not paste your API key into chat messages — the SKILL.md explicitly asks the agent to look for 'sk_' tokens in user messages, which could leak credentials into conversation logs or to third parties. Prefer configuring the key via a local config file (~/.ppio/config.json) or a protected environment variable (PPIO_API_KEY). Note the skill manifest did not declare these env/config requirements — ask the author to document required env vars and config paths. If you decide to use it, create an API key with minimal scope and billing limits you can tolerate, avoid sharing keys in messages, and rotate/delete the key after testing.
Capability Analysis
Type: OpenClaw Skill
Name: ppio-multimodal
Version: 0.2.1
The skill bundle provides a legitimate integration for the PPIO multimodal API, allowing an AI agent to perform text-to-image, video generation, and speech tasks. It follows standard practices for API key management (environment variables and a local config file at ~/.ppio/config.json) and includes detailed instructions for user feedback and error handling. No evidence of data exfiltration, malicious command execution, or harmful prompt injection was found.
Capability Assessment
Purpose & Capability
The skill claims to call PPIO multimodal endpoints and all templates, endpoints, and examples point to https://api.ppio.com and models consistent with that purpose. Needing an API key is expected for this functionality.
Instruction Scope
The SKILL.md explicitly instructs the agent to check user messages for API keys (sk_ prefix) and to read ~/.ppio/config.json or PPIO_API_KEY. Asking the agent to parse user messages for keys and to accept keys pasted directly in chat increases risk of accidental credential exposure or exfiltration. The instruction to always send a progress message prior to calling the API is benign but prescriptive.
Install Mechanism
Instruction-only skill with no install spec or code to download. This is lowest-risk in terms of disk writes or third-party package installation.
Credentials
The manifest lists no required env vars or config paths, but SKILL.md relies on PPIO_API_KEY and ~/.ppio/config.json and accepts keys pasted in messages — a mismatch. While the requested secrets are appropriate for the stated purpose (an API key), the skill encourages insecure user behavior (pasting keys into chat) and the manifest does not declare these expected environment/config requirements.
Persistence & Privilege
always:false and default autonomous invocation are used. The skill doesn't request persistent system-wide changes or modify other skills' configs.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install ppio-multimodal - After installation, invoke the skill by name or use
/ppio-multimodal - Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.2.1
ppio-multimodal 0.2.1
-修正相关显示问题及连接
v0.2.0
- 新增详细的多模态任务支持文档,覆盖文生图、图生图、文生视频、图生视频、TTS、STT等功能,包含配置与调用说明
- 明确API Key的三种配置方式与优先级,并加入未配置时的用户引导模板
- 规范各类任务的进度提示与回复消息,尤其是视频任务的轮询进度展示
- 列明所有主要API端点、模型及对应的curl调用模板
- 增加常见错误码及处理建议,补充获取Key与定价等资源链接
Metadata
Frequently Asked Questions
What is PPIO Multimodal Skill?
使用 PPIO 执行多模态任务:文生图、图生图、文生视频、图生视频、TTS、STT。 适用于:生成图片、生成视频、文字转语音、语音识别。 It is an AI Agent Skill for Claude Code / OpenClaw, with 304 downloads so far.
How do I install PPIO Multimodal Skill?
Run "/install ppio-multimodal" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is PPIO Multimodal Skill free?
Yes, PPIO Multimodal Skill is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does PPIO Multimodal Skill support?
PPIO Multimodal Skill is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created PPIO Multimodal Skill?
It is built and maintained by bbear (@ximasadila); the current version is v0.2.1.
More Skills