← Back to Skills Marketplace

opencr-skill

Name: opencr-skill
Author: topdu

by topduke · GitHub ↗ · v0.1.4

cross-platform ✓ Security Clean

844

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install opencr-skill

Description

Extract text from images, documents and scanned PDFs using OpenOCR - supports text detection, recognition, universal VLM recognition, and document parsing with layout analysis

Usage Guidance

This skill appears to do what it says (OCR/document parsing). Before installing or letting an agent execute it: 1) review the OpenOCR source (the SKILL.md points to a GitHub repo) to ensure you trust the code and model download sources; 2) be aware the examples imply running Python, installing dependencies (onnx/torch, etc.), and auto-downloading large model files — run in a sandbox or virtualenv and prefer explicit, verified installs; 3) consider disabling auto_download and manually fetch/verify model files (checksums) from trusted releases; 4) avoid sending highly sensitive images/documents to unknown network endpoints — confirm whether processing is local or uses remote APIs; 5) expect significant CPU/GPU and disk usage for model downloads and inference. If you want a lower-risk install, ask the skill author for a vetted pip package name, signed releases, and explicit model download URLs with checksums.

Capability Analysis

Type: OpenClaw Skill Name: opencr-skill Version: 0.1.4 The skill bundle is benign. It provides an interface for the OpenOCR library, enabling text extraction and document parsing. The `SKILL.md` clearly outlines its purpose, usage, and installation, declaring necessary tools like `code_execution` and `file_operations` which are consistent with its OCR functionality. There is no evidence of prompt injection, data exfiltration, malicious execution, persistence mechanisms, or obfuscation. The `--share` flag for Gradio demos is a standard feature of Gradio, not indicative of malicious intent within the skill itself.

Capability Assessment

✓ Purpose & Capability

The name, description, and SKILL.md all describe OCR, text detection/recognition, and document parsing using the OpenOCR project. There are no unrelated requested credentials, binaries, or config paths.

ℹ Instruction Scope

SKILL.md contains detailed Python usage examples and configuration options and instructs use of OpenOCR features (det/rec/ocr/unirec/doc). It also declares tools like code_execution and file_operations — the agent will be expected to run Python, read/write files, and possibly auto-download model files. The instructions do not ask for unrelated system secrets or unrelated file paths, but they do allow broad actions (installing/using libraries, downloading models, executing code) which are expected for an OCR skill but increase operational exposure.

ℹ Install Mechanism

There is no explicit install spec (instruction-only), which is lower risk for supply-chain installs. However the SKILL.md implies installing/using a Python package (openocr) and enables auto_download of potentially large models. Those downloads and any extraction/execution are not governed by a provided install spec or checksum verification, so network/model-fetch behavior should be reviewed before allowing execution.

✓ Credentials

The skill requests no environment variables, credentials, or config paths. The settings shown (use_gpu, backend selection, model paths) are proportionate to OCR/modeling tasks and do not request unrelated secrets.

✓ Persistence & Privilege

The skill is not always-enabled and does not request elevated platform privileges. Autonomous invocation is permitted by default (normal). The skill does not declare modifications to other skills or system-wide settings.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install opencr-skill
After installation, invoke the skill by name or use /opencr-skill
Provide required inputs per the skill's parameter spec and get structured output

Version History

v0.1.4

- Added comprehensive documentation to SKILL.md, detailing usage, configuration, and supported tasks. - Clarified the skill's capabilities, including text extraction, document parsing, and universal recognition using OpenOCR. - Included practical code examples and command-line usage for all supported workflows. - Expanded task descriptions for text detection, recognition, end-to-end OCR, universal recognition, and document parsing. - Outlined configuration options and best practices for performance and customization.

Metadata

Slug opencr-skill

Version 0.1.4

License —

All-time Installs 1

Active Installs 1

Total Versions 1

Frequently Asked Questions

What is opencr-skill?

Extract text from images, documents and scanned PDFs using OpenOCR - supports text detection, recognition, universal VLM recognition, and document parsing with layout analysis. It is an AI Agent Skill for Claude Code / OpenClaw, with 844 downloads so far.

How do I install opencr-skill?

Run "/install opencr-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is opencr-skill free?

Yes, opencr-skill is completely free (open-source). You can download, install and use it at no cost.

Which platforms does opencr-skill support?

opencr-skill is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created opencr-skill?

It is built and maintained by topduke (@topdu); the current version is v0.1.4.

More Skills