← Back to Skills Marketplace
perchouli

Qwen Vision

by perchouli · GitHub ↗ · v0.1.0 · MIT-0
cross-platform ✓ Security Clean
949
Downloads
0
Stars
3
Active Installs
1
Versions
Install in OpenClaw
/install qwen-vision
Description
Analyze images and videos using Qwen Vision API (Alibaba Cloud DashScope). Supports image understanding, OCR, visual reasoning.
Usage Guidance
This skill sends your image (base64-encoded) and prompt to Alibaba Cloud DashScope (Qwen Vision) — you'll need a valid API key. Confirm you trust the external service before sending sensitive images. Note a small mismatch: documentation suggests the key can be read from ~/.openclaw/openclaw.json or DASHSCOPE_API_KEY, but the bundled script requires --api-key (it doesn't auto-read those locations). If you want the convenience of env/config lookup, verify or modify the script yourself. Otherwise, review where you store the API key and avoid passing it in shells or logs if you need secrecy. Lastly, since the skill transmits image data externally, ensure this aligns with your privacy/compliance requirements.
Capability Analysis
Type: OpenClaw Skill Name: qwen-vision Version: 0.1.0 The qwen-vision skill is a legitimate tool for analyzing images via the Alibaba Cloud DashScope API. The core logic in `scripts/analyze_image.py` securely handles image encoding and transmits data only to the official API endpoint (dashscope.aliyuncs.com). The documentation in `SKILL.md` and `README.md` correctly describes the tool's purpose and configuration without any signs of prompt injection, data exfiltration, or malicious intent.
Capability Assessment
Purpose & Capability
Name/description match the actual behavior: the included script base64-encodes an image and posts it to DashScope's Qwen Vision endpoint. The only required binary is python3, which is reasonable.
Instruction Scope
SKILL.md instructs running the shipped script and documents possible locations to obtain the API key (including ~/.openclaw/openclaw.json or DASHSCOPE_API_KEY). The runtime script itself, however, requires an --api-key argument and does not read those config paths or environment variables. The instructions do not direct reading unrelated files or exfiltrating system data.
Install Mechanism
No install spec or remote downloads are present; this is an instruction-only skill with a small included script. Nothing is fetched from external/untrusted URLs during install.
Credentials
The skill legitimately requires a DashScope API key. Registry metadata lists no required env vars, but SKILL.md suggests DASHSCOPE_API_KEY or ~/.openclaw config as sources; the script actually requires an explicit --api-key. This is an implementation/documentation mismatch but the credential requested is proportional to the skill's function.
Persistence & Privilege
Skill is not always-enabled, does not request persistent or elevated system privileges, and does not modify other skills or system-wide configuration.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install qwen-vision
  3. After installation, invoke the skill by name or use /qwen-vision
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.0
- Initial release of qwen-vision skill - Analyze images and videos using Qwen Vision API (Alibaba Cloud DashScope) - Supports image understanding, OCR, and visual reasoning tasks - Easily select between max and plus models - Flexible API key setup via config file or environment variable - Includes prompt examples and usage instructions for CLI integration
Metadata
Slug qwen-vision
Version 0.1.0
License MIT-0
All-time Installs 3
Active Installs 3
Total Versions 1
Frequently Asked Questions

What is Qwen Vision?

Analyze images and videos using Qwen Vision API (Alibaba Cloud DashScope). Supports image understanding, OCR, visual reasoning. It is an AI Agent Skill for Claude Code / OpenClaw, with 949 downloads so far.

How do I install Qwen Vision?

Run "/install qwen-vision" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Qwen Vision free?

Yes, Qwen Vision is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Qwen Vision support?

Qwen Vision is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Qwen Vision?

It is built and maintained by perchouli (@perchouli); the current version is v0.1.0.

💬 Comments