← 返回 Skills 市场

Ollama OCR

Name: Ollama OCR
Author: hongjiahao371-pixel

作者 hongjiahao371-pixel · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ 安全检测通过

283

总下载

当前安装

版本数

在 OpenClaw 中安装

/install ollama-ocr

功能描述

Use Ollama's vision/OCR models to recognize text from images. Supports glm-ocr, llava, moondream, and llama3.2-vision models. Ideal when you need local offli...

安全使用建议

This skill appears to do exactly what it claims: read an image and send it to an Ollama instance for OCR. Before installing/using, verify you run a trusted Ollama instance at the configured host/port (default 172.17.0.2:11434) — if that endpoint is remote or controlled by someone else, your images would be sent off-host. Note the script hardcodes OLLAMA_HOST/PORT; if your Ollama runs elsewhere you must edit the script (or add environment-variable support) to point it to your instance. No credentials are requested by the skill, but be mindful of the images you pass in (they are transmitted to the configured Ollama HTTP endpoint).

功能分析

Type: OpenClaw Skill Name: ollama-ocr Version: 1.0.0 The ollama-ocr skill is a straightforward implementation for local image text recognition using Ollama. The Python script (ollama_ocr.py) uses standard libraries to communicate with a local API endpoint (172.17.0.2) and contains no evidence of data exfiltration, malicious execution, or obfuscation.

能力评估

✓ Purpose & Capability

Name/description (local Ollama OCR) match the code and SKILL.md: the script encodes an image and POSTs it to an Ollama /api/chat endpoint for model-based OCR. No unrelated services, binaries, or credentials are requested.

✓ Instruction Scope

SKILL.md and the Python script limit actions to reading the specified image file and sending it to the Ollama HTTP API; there are no instructions to read other files, environment secrets, system configs, or transmit data to unexpected endpoints. The SKILL.md correctly documents the default endpoint used by the script.

✓ Install Mechanism

This is an instruction-only skill with a small Python script and no install spec. Nothing is downloaded or written to disk by an installer; risk from installation is minimal.

✓ Credentials

The skill requests no environment variables, credentials, or config paths. It relies on an accessible Ollama HTTP endpoint; this is proportional to its purpose. (Note: the endpoint is hardcoded in the script rather than read from an environment variable.)

✓ Persistence & Privilege

The skill does not request persistent/autonomous privileges, does not set always:true, and does not modify other skills or system-wide settings.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install ollama-ocr
安装完成后，直接呼叫该 Skill 的名称或使用 /ollama-ocr 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

Initial release - local OCR with Ollama vision models

元数据

Slug ollama-ocr

版本 1.0.0

许可证 MIT-0

累计安装 2

当前安装数 2

历史版本数 1

常见问题

Ollama OCR 是什么？

Use Ollama's vision/OCR models to recognize text from images. Supports glm-ocr, llava, moondream, and llama3.2-vision models. Ideal when you need local offli... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 283 次。

如何安装 Ollama OCR？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ollama-ocr」即可一键安装，无需额外配置。

Ollama OCR 是免费的吗？

是的，Ollama OCR 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Ollama OCR 支持哪些平台？

Ollama OCR 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Ollama OCR？

由 hongjiahao371-pixel（@hongjiahao371-pixel）开发并维护，当前版本 v1.0.0。