openocr-skill

Name: openocr-skill
Author: topdu

功能描述

Extract text from images, documents and scanned PDFs using OpenOCR - a lightweight and efficient OCR system with document parsing model requiring only 0.1B parameters, capable of running recognition on personal PCs. Supports text detection, recognition, universal VLM recognition, and document parsing with layout analysis

安全使用建议

This skill appears to do what it claims (local OCR and document parsing). Before installing/using it: (1) confirm you have a Python environment and the required ML runtimes (ONNX/Torch) or be prepared to install them; (2) expect the skill to auto-download model weights — check the OpenOCR project's GitHub and verify the model download URLs and integrity or supply model files yourself to avoid unexpected network fetches; (3) be aware the agent will need permission to read the image/PDF files you give it and to write cache/model files to disk; and (4) if you need higher assurance, ask the publisher for an explicit install spec and the exact model-hosting URLs (or use only local model paths).

功能分析

Type: OpenClaw Skill Name: openocr-skill Version: 0.1.6 The skill bundle is a wrapper for the OpenOCR library, designed for text extraction and document parsing. It explicitly declares the need for `code_execution` and `file_operations` tools, which are used transparently for tasks like processing images/PDFs and saving output files. The `SKILL.md` documentation provides functional instructions and examples, without any evidence of prompt injection attempts, data exfiltration, malicious execution, persistence mechanisms, or obfuscation. While the underlying library's Gradio demos can bind to `0.0.0.0`, this is a documented feature for a demo interface, not indicative of malicious intent within the skill bundle itself.

能力评估

✓ Purpose & Capability

The name/description (OpenOCR OCR, document parsing, VLM recognition) matches the instructions: code examples show initializing OpenOCR for detection/recognition/doc parsing and working with image/pdf paths. There are no unrelated credentials, binaries, or config paths requested.

ℹ Instruction Scope

SKILL.md contains only OCR-related instructions and Python examples. It repeatedly relies on passing image/PDF file paths and on auto-downloading models; it does not instruct accessing unrelated system files or secrets. However, the instructions assume ability to run Python code, read/write files, and fetch model binaries from the network.

ℹ Install Mechanism

This is an instruction-only skill with no install spec. The skill examples expect a Python package ('openocr') and optional ONNX/Torch backends and include 'auto_download' of models. Because no install/source-of-models is declared, model and dependency downloads would be performed dynamically by the runtime — verify where those downloads come from and that you trust the source.

✓ Credentials

The skill requests no environment variables, credentials, or config paths. It only needs file access to user-provided images/PDFs and network access to fetch models (if auto_download is used), which is proportional to OCR functionality.

✓ Persistence & Privilege

The skill does not ask to be always-enabled, does not declare persistence or modify other skills, and requires no long-lived credentials. It does instruct writing/reading model and cache files locally, which is typical for model-based tools.

版本历史

v0.1.6

No file changes detected in this version. - Version and metadata remain unchanged. - No updates or modifications to functionality or documentation.

v0.1.5

- Improved SKILL.md documentation with detailed configuration, usage examples, and command-line instructions. - Expanded task descriptions and parameter options for all supported OpenOCR task types. - Added ready-to-use code snippets for text detection, recognition, end-to-end OCR, universal recognition, and document parsing. - Included new instructions for saving outputs in multiple formats (Markdown, JSON, visualization). - Provided clear guidance for both image and PDF input handling.

元数据

Slug openocr-skill

版本 0.1.6

许可证 —

累计安装 10

当前安装数 10

历史版本数 2

常见问题

openocr-skill 是什么？

Extract text from images, documents and scanned PDFs using OpenOCR - a lightweight and efficient OCR system with document parsing model requiring only 0.1B parameters, capable of running recognition on personal PCs. Supports text detection, recognition, universal VLM recognition, and document parsing with layout analysis. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 1752 次。

如何安装 openocr-skill？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install openocr-skill」即可一键安装，无需额外配置。

openocr-skill 是免费的吗？

是的，openocr-skill 完全免费（开源免费），可自由下载、安装和使用。

openocr-skill 支持哪些平台？

openocr-skill 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 openocr-skill？

由 topduke（@topdu）开发并维护，当前版本 v0.1.6。

openocr-skill 是什么？

如何安装 openocr-skill？

openocr-skill 是免费的吗？

openocr-skill 支持哪些平台？

谁开发了 openocr-skill？

💬 留言讨论