openocr-skill

Name: openocr-skill
Author: topdu

Description

Extract text from images, documents and scanned PDFs using OpenOCR - a lightweight and efficient OCR system with document parsing model requiring only 0.1B parameters, capable of running recognition on personal PCs. Supports text detection, recognition, universal VLM recognition, and document parsing with layout analysis

Usage Guidance

This skill appears to do what it claims (local OCR and document parsing). Before installing/using it: (1) confirm you have a Python environment and the required ML runtimes (ONNX/Torch) or be prepared to install them; (2) expect the skill to auto-download model weights — check the OpenOCR project's GitHub and verify the model download URLs and integrity or supply model files yourself to avoid unexpected network fetches; (3) be aware the agent will need permission to read the image/PDF files you give it and to write cache/model files to disk; and (4) if you need higher assurance, ask the publisher for an explicit install spec and the exact model-hosting URLs (or use only local model paths).

Capability Analysis

Type: OpenClaw Skill Name: openocr-skill Version: 0.1.6 The skill bundle is a wrapper for the OpenOCR library, designed for text extraction and document parsing. It explicitly declares the need for `code_execution` and `file_operations` tools, which are used transparently for tasks like processing images/PDFs and saving output files. The `SKILL.md` documentation provides functional instructions and examples, without any evidence of prompt injection attempts, data exfiltration, malicious execution, persistence mechanisms, or obfuscation. While the underlying library's Gradio demos can bind to `0.0.0.0`, this is a documented feature for a demo interface, not indicative of malicious intent within the skill bundle itself.

Capability Assessment

✓ Purpose & Capability

The name/description (OpenOCR OCR, document parsing, VLM recognition) matches the instructions: code examples show initializing OpenOCR for detection/recognition/doc parsing and working with image/pdf paths. There are no unrelated credentials, binaries, or config paths requested.

ℹ Instruction Scope

SKILL.md contains only OCR-related instructions and Python examples. It repeatedly relies on passing image/PDF file paths and on auto-downloading models; it does not instruct accessing unrelated system files or secrets. However, the instructions assume ability to run Python code, read/write files, and fetch model binaries from the network.

ℹ Install Mechanism

This is an instruction-only skill with no install spec. The skill examples expect a Python package ('openocr') and optional ONNX/Torch backends and include 'auto_download' of models. Because no install/source-of-models is declared, model and dependency downloads would be performed dynamically by the runtime — verify where those downloads come from and that you trust the source.

✓ Credentials

The skill requests no environment variables, credentials, or config paths. It only needs file access to user-provided images/PDFs and network access to fetch models (if auto_download is used), which is proportional to OCR functionality.

✓ Persistence & Privilege

The skill does not ask to be always-enabled, does not declare persistence or modify other skills, and requires no long-lived credentials. It does instruct writing/reading model and cache files locally, which is typical for model-based tools.

Version History

v0.1.6

No file changes detected in this version. - Version and metadata remain unchanged. - No updates or modifications to functionality or documentation.

v0.1.5

- Improved SKILL.md documentation with detailed configuration, usage examples, and command-line instructions. - Expanded task descriptions and parameter options for all supported OpenOCR task types. - Added ready-to-use code snippets for text detection, recognition, end-to-end OCR, universal recognition, and document parsing. - Included new instructions for saving outputs in multiple formats (Markdown, JSON, visualization). - Provided clear guidance for both image and PDF input handling.

Metadata

Slug openocr-skill

Version 0.1.6

License —

All-time Installs 10

Active Installs 10

Total Versions 2

Frequently Asked Questions

What is openocr-skill?

Extract text from images, documents and scanned PDFs using OpenOCR - a lightweight and efficient OCR system with document parsing model requiring only 0.1B parameters, capable of running recognition on personal PCs. Supports text detection, recognition, universal VLM recognition, and document parsing with layout analysis. It is an AI Agent Skill for Claude Code / OpenClaw, with 1752 downloads so far.

How do I install openocr-skill?

Run "/install openocr-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is openocr-skill free?

Yes, openocr-skill is completely free (open-source). You can download, install and use it at no cost.

Which platforms does openocr-skill support?

openocr-skill is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created openocr-skill?

It is built and maintained by topduke (@topdu); the current version is v0.1.6.

More Skills

What is openocr-skill?

How do I install openocr-skill?

Is openocr-skill free?

Which platforms does openocr-skill support?

Who created openocr-skill?

💬 Comments