← Back to Skills Marketplace
topdu

openocr-skill

by topduke · GitHub ↗ · v0.1.6
cross-platform ✓ Security Clean
1752
Downloads
0
Stars
10
Active Installs
2
Versions
Install in OpenClaw
/install openocr-skill
Description
Extract text from images, documents and scanned PDFs using OpenOCR - a lightweight and efficient OCR system with document parsing model requiring only 0.1B parameters, capable of running recognition on personal PCs. Supports text detection, recognition, universal VLM recognition, and document parsing with layout analysis
Usage Guidance
This skill appears to do what it claims (local OCR and document parsing). Before installing/using it: (1) confirm you have a Python environment and the required ML runtimes (ONNX/Torch) or be prepared to install them; (2) expect the skill to auto-download model weights — check the OpenOCR project's GitHub and verify the model download URLs and integrity or supply model files yourself to avoid unexpected network fetches; (3) be aware the agent will need permission to read the image/PDF files you give it and to write cache/model files to disk; and (4) if you need higher assurance, ask the publisher for an explicit install spec and the exact model-hosting URLs (or use only local model paths).
Capability Analysis
Type: OpenClaw Skill Name: openocr-skill Version: 0.1.6 The skill bundle is a wrapper for the OpenOCR library, designed for text extraction and document parsing. It explicitly declares the need for `code_execution` and `file_operations` tools, which are used transparently for tasks like processing images/PDFs and saving output files. The `SKILL.md` documentation provides functional instructions and examples, without any evidence of prompt injection attempts, data exfiltration, malicious execution, persistence mechanisms, or obfuscation. While the underlying library's Gradio demos can bind to `0.0.0.0`, this is a documented feature for a demo interface, not indicative of malicious intent within the skill bundle itself.
Capability Assessment
Purpose & Capability
The name/description (OpenOCR OCR, document parsing, VLM recognition) matches the instructions: code examples show initializing OpenOCR for detection/recognition/doc parsing and working with image/pdf paths. There are no unrelated credentials, binaries, or config paths requested.
Instruction Scope
SKILL.md contains only OCR-related instructions and Python examples. It repeatedly relies on passing image/PDF file paths and on auto-downloading models; it does not instruct accessing unrelated system files or secrets. However, the instructions assume ability to run Python code, read/write files, and fetch model binaries from the network.
Install Mechanism
This is an instruction-only skill with no install spec. The skill examples expect a Python package ('openocr') and optional ONNX/Torch backends and include 'auto_download' of models. Because no install/source-of-models is declared, model and dependency downloads would be performed dynamically by the runtime — verify where those downloads come from and that you trust the source.
Credentials
The skill requests no environment variables, credentials, or config paths. It only needs file access to user-provided images/PDFs and network access to fetch models (if auto_download is used), which is proportional to OCR functionality.
Persistence & Privilege
The skill does not ask to be always-enabled, does not declare persistence or modify other skills, and requires no long-lived credentials. It does instruct writing/reading model and cache files locally, which is typical for model-based tools.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install openocr-skill
  3. After installation, invoke the skill by name or use /openocr-skill
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.6
No file changes detected in this version. - Version and metadata remain unchanged. - No updates or modifications to functionality or documentation.
v0.1.5
- Improved SKILL.md documentation with detailed configuration, usage examples, and command-line instructions. - Expanded task descriptions and parameter options for all supported OpenOCR task types. - Added ready-to-use code snippets for text detection, recognition, end-to-end OCR, universal recognition, and document parsing. - Included new instructions for saving outputs in multiple formats (Markdown, JSON, visualization). - Provided clear guidance for both image and PDF input handling.
Metadata
Slug openocr-skill
Version 0.1.6
License
All-time Installs 10
Active Installs 10
Total Versions 2
Frequently Asked Questions

What is openocr-skill?

Extract text from images, documents and scanned PDFs using OpenOCR - a lightweight and efficient OCR system with document parsing model requiring only 0.1B parameters, capable of running recognition on personal PCs. Supports text detection, recognition, universal VLM recognition, and document parsing with layout analysis. It is an AI Agent Skill for Claude Code / OpenClaw, with 1752 downloads so far.

How do I install openocr-skill?

Run "/install openocr-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is openocr-skill free?

Yes, openocr-skill is completely free (open-source). You can download, install and use it at no cost.

Which platforms does openocr-skill support?

openocr-skill is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created openocr-skill?

It is built and maintained by topduke (@topdu); the current version is v0.1.6.

💬 Comments