← 返回 Skills 市场

Llava Vision

Name: Llava Vision
Author: 447992399

作者 Jh-server · GitHub ↗ · v1.0.1 · MIT-0

linuxdarwinwin32 ⚠ suspicious

总下载

当前安装

版本数

在 OpenClaw 中安装

/install llava-vision

功能描述

Call a local llama.cpp server with the LLaVA model to analyze images.

使用说明 (SKILL.md)

LLaVA Vision Skill

This skill forwards an image to a locally running llama.cpp server that hosts a LLaVA model and returns the model’s text description of the image. It accepts either a local file path or a remote image URL.

Usage

clawhub llava-vision --image /path/to/photo.jpg
# or
clawhub llava-vision --image https://example.com/photo.jpg

The skill uses the built‑in vision_analyze tool, which expects an image file path. If the image cannot be read or the server is unreachable, an error message will be returned.

Dependencies

Node.js (the skill itself)
A local llama.cpp server with the LLaVA model exposed at the default endpoint.

Example

$ clawhub run llava-vision --image ./cat.png
The image contains a cat sitting on a windowsill, looking out at a sunny garden.

安全使用建议

This skill appears to do what it says: read an image (local path or remote URL), encode it, and POST it to a local llama.cpp/LLaVA server at 127.0.0.1:8081. Before installing or using: (1) ensure you trust the local server (it will receive the raw image bytes); (2) avoid passing paths or URLs that contain sensitive data (the skill will read local files and fetch arbitrary URLs you give it); (3) be aware the code will fetch remote images from any URL you provide — do not pass internal-management URLs you don't want accessed; (4) note a minor interoperability issue: index.js uses require('./tool') while tool.js uses ES export syntax (this may fail on some Node setups). If you need higher assurance, review or run the code locally in an isolated environment and confirm the local llama server is secure and not forwarding model inputs externally.

功能分析

Type: OpenClaw Skill Name: llava-vision Version: 1.0.1 The skill is designed to analyze images by sending them to a local llama.cpp server (127.0.0.1:8081). The code in `tool.js` and `index.js` correctly implements the described functionality, allowing users to provide either a local file path or a remote URL. It uses standard Node.js modules and lacks any indicators of malicious intent, such as data exfiltration to external domains, obfuscation, or unauthorized command execution.

能力评估

✓ Purpose & Capability

The skill's description says it will call a local llama.cpp server hosting LLaVA; tool.js posts an image (as a data: URI) to http://127.0.0.1:8081/v1/chat/completions using model 'llava'. The ability to accept a local file path or remote image URL matches the SKILL.md.

ℹ Instruction Scope

Instructions and code stay within the skill's stated scope: they read a local image file or fetch a remote image URL, base64-encode it, and POST it to the local server. Two practical notes: (1) the skill will perform arbitrary outbound fetch() requests when given remote image URLs (so it can connect to any URL you pass), and (2) it will read any local path you provide — so passing sensitive paths will expose their contents to the local LLaVA server. The skill itself does not forward data to other remote endpoints, but the local server could—verify that local server's behavior/trustworthiness.

✓ Install Mechanism

There is no install spec; the package is instruction/code-only (no downloads or external installers). This is low-risk from an install mechanism perspective.

✓ Credentials

The skill requests no environment variables, no credentials, and no config paths. That is proportional to its purpose.

✓ Persistence & Privilege

The skill is not forced-always, is user-invocable, and does not attempt to modify other skills or system-wide config. It does not request elevated persistence privileges.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install llava-vision
安装完成后，直接呼叫该 Skill 的名称或使用 /llava-vision 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.1

- Skill name changed from "llava-vision" to "llava-vision-local" - No other functional or usage changes documented

v1.0.0

Initial release of llava-vision skill. - Sends local image files or remote image URLs to a local llama.cpp server running a LLaVA model for image analysis. - Returns a text description of the image as provided by the model. - Supports Linux, macOS, and Windows platforms. - Requires a locally running llama.cpp server with the LLaVA model and Node.js.

元数据

Slug llava-vision

版本 1.0.1

许可证 MIT-0

累计安装 1

当前安装数 1

历史版本数 2

常见问题

Llava Vision 是什么？

Call a local llama.cpp server with the LLaVA model to analyze images. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 84 次。

如何安装 Llava Vision？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install llava-vision」即可一键安装，无需额外配置。

Llava Vision 是免费的吗？

是的，Llava Vision 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Llava Vision 支持哪些平台？

Llava Vision 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（linux, darwin, win32）。

谁开发了 Llava Vision？

由 Jh-server（@447992399）开发并维护，当前版本 v1.0.1。