← Back to Skills Marketplace
SiliconFlow 多模态服务,支持图片生成(FLUX/Qwen)、视频生成(Wan)、TTS语音合成、ASR语音识别。使用代金券支付。
by
Sheldon.li
· GitHub ↗
· v1.0.0
552
Downloads
3
Stars
4
Active Installs
1
Versions
Install in OpenClaw
/install siliconflow-media
Description
SiliconFlow 多模态服务,支持图片生成(FLUX/Qwen)、视频生成(Wan)、TTS语音合成、ASR语音识别。使用代金券支付。
Usage Guidance
This skill is internally consistent, but before installing: (1) confirm you trust the siliconflow.cn service and the API key you provide (the key gives the service access to use your voucher balance and process uploaded files); (2) be aware that any audio/images you pass will be uploaded to SiliconFlow servers; (3) ensure your runtime has the dependencies (requests, pillow) installed; and (4) if you need stricter privacy, avoid sending sensitive audio/images or use a vetted/private model provider instead.
Capability Analysis
Type: OpenClaw Skill
Name: siliconflow-media
Version: 1.0.0
The skill bundle exhibits several vulnerabilities related to file path handling. The `scripts/asr.py` and `scripts/video_gen.py` scripts read user-specified input files (`--audio`, `--image`) and send their content to an external API (`api.siliconflow.cn`). This creates an arbitrary file read vulnerability, potentially leading to data exfiltration if an attacker can prompt the agent to read sensitive files (e.g., `/etc/passwd`, `~/.ssh/id_rsa`). Additionally, `scripts/image_gen.py`, `scripts/tts.py`, and `scripts/video_gen.py` write to user-specified output files (`--filename`). This creates an arbitrary file write vulnerability, potentially leading to persistence or privilege escalation (e.g., writing to `/etc/cron.d/malicious_job`) if the agent is prompted to write to sensitive system paths. These vulnerabilities stem from a lack of input sanitization for file paths, making the skill susceptible to prompt injection attacks against the agent, despite the scripts themselves not exhibiting malicious intent.
Capability Assessment
Purpose & Capability
Name/description (multimodal media) match the included scripts (image, video, TTS, ASR). The single required env var (SILICONFLOW_API_KEY) is the API credential you would expect for a hosted media API. No unrelated binaries, config paths, or extra credentials are requested.
Instruction Scope
SKILL.md instructs running the provided scripts and those scripts only: (1) read user-supplied files (audio/image) when appropriate, (2) POST to https://api.siliconflow.cn endpoints, and (3) download returned media URLs. There are no instructions to read unrelated files, scan system state, or send data to third parties beyond the siliconflow endpoints. Users should note that any files you pass (audio/images) are uploaded to the SiliconFlow service.
Install Mechanism
No install spec (instruction-only) is present; inclusion is low-risk. The repository contains runnable scripts but no automated downloads or external installers. The scripts list dependencies (requests, pillow) in comments — these are normal but may need to be installed in your runtime environment.
Credentials
Only SILICONFLOW_API_KEY is required and is used consistently as a Bearer token when calling the siliconflow API. No other secrets or unrelated environment variables are requested or accessed by the scripts.
Persistence & Privilege
The skill does not request permanent/always-on privileges (always: false). It does not modify other skills or system-wide configs. Autonomous invocation is allowed by default but not combined with other concerning factors here.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install siliconflow-media - After installation, invoke the skill by name or use
/siliconflow-media - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of the siliconflow-media skill.
- Supports multi-modal AI services: image generation (FLUX/Qwen), video generation (Wan), TTS (text-to-speech), and ASR (automatic speech recognition).
- Payment via vouchers supported (current balance: 3000+).
- Includes clear command-line usage examples for all features.
- Requires the SILICONFLOW_API_KEY environment variable.
Metadata
Frequently Asked Questions
What is SiliconFlow 多模态服务,支持图片生成(FLUX/Qwen)、视频生成(Wan)、TTS语音合成、ASR语音识别。使用代金券支付。?
SiliconFlow 多模态服务,支持图片生成(FLUX/Qwen)、视频生成(Wan)、TTS语音合成、ASR语音识别。使用代金券支付。 It is an AI Agent Skill for Claude Code / OpenClaw, with 552 downloads so far.
How do I install SiliconFlow 多模态服务,支持图片生成(FLUX/Qwen)、视频生成(Wan)、TTS语音合成、ASR语音识别。使用代金券支付。?
Run "/install siliconflow-media" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is SiliconFlow 多模态服务,支持图片生成(FLUX/Qwen)、视频生成(Wan)、TTS语音合成、ASR语音识别。使用代金券支付。 free?
Yes, SiliconFlow 多模态服务,支持图片生成(FLUX/Qwen)、视频生成(Wan)、TTS语音合成、ASR语音识别。使用代金券支付。 is completely free (open-source). You can download, install and use it at no cost.
Which platforms does SiliconFlow 多模态服务,支持图片生成(FLUX/Qwen)、视频生成(Wan)、TTS语音合成、ASR语音识别。使用代金券支付。 support?
SiliconFlow 多模态服务,支持图片生成(FLUX/Qwen)、视频生成(Wan)、TTS语音合成、ASR语音识别。使用代金券支付。 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created SiliconFlow 多模态服务,支持图片生成(FLUX/Qwen)、视频生成(Wan)、TTS语音合成、ASR语音识别。使用代金券支付。?
It is built and maintained by Sheldon.li (@axdlee); the current version is v1.0.0.
More Skills