← Back to Skills Marketplace
kawummuwe-stack

Vision Analyzer

by kawummuwe-stack · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ⚠ suspicious
114
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install vision-analyzer
Description
Analyze images using Ollama Cloud's Kimi K2.5 vision capabilities. Use when user wants to describe, understand, or get information about an image. Works with...
Usage Guidance
This skill's code matches its description: it reads a local image and posts it (base64) to Ollama Cloud using OLLAMA_API_KEY. Before installing, verify the skill's provenance (source/homepage missing), confirm you are comfortable sending images (potentially sensitive) to Ollama, and only provide an API key with appropriate, limited scope or a throwaway key for testing. Also ask the publisher to update the registry metadata to declare OLLAMA_API_KEY as a required env var so the permissions are transparent.
Capability Assessment
Purpose & Capability
The skill's name, SKILL.md, and script consistently state it uses Ollama Cloud's Kimi K2.5 model to analyze images — that purpose matches the implementation. However, the registry metadata lists no required environment variables while both SKILL.md and the script require OLLAMA_API_KEY; this metadata omission is an incoherence and reduces transparency. The source/homepage are also missing, which makes provenance unclear.
Instruction Scope
Runtime instructions are narrow and consistent: run the included Python script with a local image path and optionally a prompt. The script only reads the supplied image path and the OLLAMA_API_KEY env var, base64-encodes the image, and posts it to https://ollama.com/api/generate. This is within the stated purpose, but it does transmit the full image (and prompt) to a third party — a privacy exposure users should consider.
Install Mechanism
No install spec is provided (instruction-only plus a small Python script). Nothing downloads arbitrary code at install time and no archives or external install URLs are used. The script will run with the existing python runtime; this is low install risk.
Credentials
The script legitimately requires a single API key (OLLAMA_API_KEY) to call Ollama Cloud, which is proportionate to the stated function. However, the registry metadata incorrectly lists no required env vars — a transparency issue. Also, providing that API key grants the skill (and Ollama) access to any image data you send; users should confirm acceptable data handling and token scope before supplying sensitive images or reuse of high-privilege keys.
Persistence & Privilege
The skill is user-invocable, not always-enabled, and does not attempt to change system settings, other skills, or request elevated privileges. It does not persist credentials or modify other configs.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install vision-analyzer
  3. After installation, invoke the skill by name or use /vision-analyzer
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
- Added new documentation outlining how to analyze images with Ollama Cloud's Kimi K2.5 vision model - Explained typical use cases, configuration steps, and supported image formats - Provided command-line examples for running image analysis and asking targeted questions - Clarified API key setup and expected output
v1.0.0
Initial release of Vision Analyzer. - Analyze images using Kimi K2.5 multimodal vision capabilities via Ollama Cloud API. - Supports JPG, PNG, GIF, and WebP image formats. - Use for describing, understanding, or extracting information from images, screenshots, or diagrams. - Simple CLI script: provide image path and optional prompt for tailored analysis. - Returns a natural language description of image content.
Metadata
Slug vision-analyzer
Version 1.0.1
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 2
Frequently Asked Questions

What is Vision Analyzer?

Analyze images using Ollama Cloud's Kimi K2.5 vision capabilities. Use when user wants to describe, understand, or get information about an image. Works with... It is an AI Agent Skill for Claude Code / OpenClaw, with 114 downloads so far.

How do I install Vision Analyzer?

Run "/install vision-analyzer" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Vision Analyzer free?

Yes, Vision Analyzer is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Vision Analyzer support?

Vision Analyzer is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Vision Analyzer?

It is built and maintained by kawummuwe-stack (@kawummuwe-stack); the current version is v1.0.1.

💬 Comments