← 返回 Skills 市场

ollama-vision

Name: ollama-vision
Author: lzm2023

作者 LZM2023 · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ 安全检测通过

367

总下载

当前安装

版本数

在 OpenClaw 中安装

/install ollama-vision

功能描述

本地调用 Ollama qwen3-vl:4b 模型自动压缩并分析图片，支持描述、OCR 文字提取和自定义信息抽取。

安全使用建议

This skill appears to do what it claims: compress local images, call a locally running Ollama qwen3-vl:4b model via the local API, and return text results. Before installing or running: 1) Ensure you trust the Ollama build you run locally (the skill posts image data to http://localhost:11434). 2) Be aware that the first use will invoke 'ollama pull qwen3-vl:4b' to download a large model over the network. 3) The included test script reads %USERPROFILE%\.openclaw\media\inbound to find sample images — this is for testing only; review or edit the path if you do not want that directory scanned. 4) The code has a small quality issue: test_skill.py calls analyze_image but the main analysis function is named analyze_image_api (a bug, not malicious). 5) Run check_env.py first and consider running in an isolated environment if you are cautious about large model downloads. If you need higher assurance, verify the Ollama CLI binary and its network behavior independently and review/modify the test script's file-paths before use.

功能分析

Type: OpenClaw Skill Name: ollama-vision Version: 1.0.0 The ollama-vision skill bundle provides a legitimate tool for local image analysis using the Ollama API and the qwen3-vl:4b model. The implementation in analyze_image.py includes robust features like automatic image compression using Pillow and safe subprocess handling for environment checks. The code communicates exclusively with a local service (localhost:11434) and lacks any indicators of data exfiltration, malicious execution, or prompt injection.

能力评估

✓ Purpose & Capability

Name/description (local Ollama vision analysis) match the included code and SKILL.md. The code calls local Ollama, compresses images, and invokes a local API; no unrelated credentials, binaries, or remote endpoints are requested.

ℹ Instruction Scope

SKILL.md and code confine runtime actions to image compression, local Ollama model checks/pulls, and POSTs to localhost:11434. Note: model download (ollama pull) will perform network I/O to fetch ~2–3GB model data. test_skill.py also attempts to read %USERPROFILE%\.openclaw\media\inbound to locate test images — reasonable for local testing but it does access a user directory (only for tests).

✓ Install Mechanism

No install spec; instruction-only skill. The only network activity is via the Ollama CLI (ollama pull) to download the model — expected for a local model-based skill. There are no obscure download URLs or extracted archives in the skill itself.

ℹ Credentials

The skill requests no environment variables or credentials (good). It writes temporary compressed images to the OS temp directory and the test script scans a local inbound media folder for images; this is proportional for its purpose but users should be aware the test script reads a user path.

✓ Persistence & Privilege

Skill does not request 'always: true', does not modify other skills or agent-wide config, and has no special persistence privileges. Autonomous invocation is allowed by default but is not combined with other concerning privileges.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install ollama-vision
安装完成后，直接呼叫该 Skill 的名称或使用 /ollama-vision 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

使用 Ollama 本地视觉模型进行图像分析，支持 OCR、文字提取、图像描述等功能。默认使用qwen3-vl:4b模型，记得在ollama下载并运行这个模型。具体功能如下：自动压缩：超过 2MB 的图片会自动压缩后再分析多模式分析：describe（描述）、ocr（文字提取）、extract（自定义提取）临时文件清理：压缩产生的临时文件会自动删除质量优先：优先降低 JPEG 质量，必要时缩小尺寸建议：至少要4-6G显存，方便运行模型 Tags: vision, ollama, image-analysis, 本地

元数据

Slug ollama-vision

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题

ollama-vision 是什么？

本地调用 Ollama qwen3-vl:4b 模型自动压缩并分析图片，支持描述、OCR 文字提取和自定义信息抽取。它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 367 次。

如何安装 ollama-vision？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ollama-vision」即可一键安装，无需额外配置。

ollama-vision 是免费的吗？

是的，ollama-vision 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

ollama-vision 支持哪些平台？

ollama-vision 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 ollama-vision？

由 LZM2023（@lzm2023）开发并维护，当前版本 v1.0.0。