← Back to Skills Marketplace
topdu

opencr-skill

by topduke · GitHub ↗ · v0.1.4
cross-platform ✓ Security Clean
844
Downloads
0
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install opencr-skill
Description
Extract text from images, documents and scanned PDFs using OpenOCR - supports text detection, recognition, universal VLM recognition, and document parsing with layout analysis
Usage Guidance
This skill appears to do what it says (OCR/document parsing). Before installing or letting an agent execute it: 1) review the OpenOCR source (the SKILL.md points to a GitHub repo) to ensure you trust the code and model download sources; 2) be aware the examples imply running Python, installing dependencies (onnx/torch, etc.), and auto-downloading large model files — run in a sandbox or virtualenv and prefer explicit, verified installs; 3) consider disabling auto_download and manually fetch/verify model files (checksums) from trusted releases; 4) avoid sending highly sensitive images/documents to unknown network endpoints — confirm whether processing is local or uses remote APIs; 5) expect significant CPU/GPU and disk usage for model downloads and inference. If you want a lower-risk install, ask the skill author for a vetted pip package name, signed releases, and explicit model download URLs with checksums.
Capability Analysis
Type: OpenClaw Skill Name: opencr-skill Version: 0.1.4 The skill bundle is benign. It provides an interface for the OpenOCR library, enabling text extraction and document parsing. The `SKILL.md` clearly outlines its purpose, usage, and installation, declaring necessary tools like `code_execution` and `file_operations` which are consistent with its OCR functionality. There is no evidence of prompt injection, data exfiltration, malicious execution, persistence mechanisms, or obfuscation. The `--share` flag for Gradio demos is a standard feature of Gradio, not indicative of malicious intent within the skill itself.
Capability Assessment
Purpose & Capability
The name, description, and SKILL.md all describe OCR, text detection/recognition, and document parsing using the OpenOCR project. There are no unrelated requested credentials, binaries, or config paths.
Instruction Scope
SKILL.md contains detailed Python usage examples and configuration options and instructs use of OpenOCR features (det/rec/ocr/unirec/doc). It also declares tools like code_execution and file_operations — the agent will be expected to run Python, read/write files, and possibly auto-download model files. The instructions do not ask for unrelated system secrets or unrelated file paths, but they do allow broad actions (installing/using libraries, downloading models, executing code) which are expected for an OCR skill but increase operational exposure.
Install Mechanism
There is no explicit install spec (instruction-only), which is lower risk for supply-chain installs. However the SKILL.md implies installing/using a Python package (openocr) and enables auto_download of potentially large models. Those downloads and any extraction/execution are not governed by a provided install spec or checksum verification, so network/model-fetch behavior should be reviewed before allowing execution.
Credentials
The skill requests no environment variables, credentials, or config paths. The settings shown (use_gpu, backend selection, model paths) are proportionate to OCR/modeling tasks and do not request unrelated secrets.
Persistence & Privilege
The skill is not always-enabled and does not request elevated platform privileges. Autonomous invocation is permitted by default (normal). The skill does not declare modifications to other skills or system-wide settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install opencr-skill
  3. After installation, invoke the skill by name or use /opencr-skill
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.4
- Added comprehensive documentation to SKILL.md, detailing usage, configuration, and supported tasks. - Clarified the skill's capabilities, including text extraction, document parsing, and universal recognition using OpenOCR. - Included practical code examples and command-line usage for all supported workflows. - Expanded task descriptions for text detection, recognition, end-to-end OCR, universal recognition, and document parsing. - Outlined configuration options and best practices for performance and customization.
Metadata
Slug opencr-skill
Version 0.1.4
License
All-time Installs 1
Active Installs 1
Total Versions 1
Frequently Asked Questions

What is opencr-skill?

Extract text from images, documents and scanned PDFs using OpenOCR - supports text detection, recognition, universal VLM recognition, and document parsing with layout analysis. It is an AI Agent Skill for Claude Code / OpenClaw, with 844 downloads so far.

How do I install opencr-skill?

Run "/install opencr-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is opencr-skill free?

Yes, opencr-skill is completely free (open-source). You can download, install and use it at no cost.

Which platforms does opencr-skill support?

opencr-skill is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created opencr-skill?

It is built and maintained by topduke (@topdu); the current version is v0.1.4.

💬 Comments