← Back to Skills Marketplace

OCR with python

Name: OCR with python
Author: roamer-remote

by Roamer 徐 · GitHub ↗ · v1.0.0

cross-platform ⚠ suspicious

1591

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install ocr-python

Description

Extract Chinese and English text from images and scanned PDFs, including documents like invoices and contracts, using PaddleOCR in Python.

Usage Guidance

This skill appears to be a straightforward OCR helper. Before installing or running: (1) review and run the script in an isolated environment/virtualenv because pip will install large native packages (paddlepaddle) that may require specific platform/driver support; (2) confirm you trust the source (homepage unknown); (3) avoid feeding sensitive documents to unknown third-party tools if you cannot verify where processing occurs — this script runs locally, but dependencies could change behavior if malicious packages are installed; (4) consider running on a disposable VM or container and inspect temporary files under /tmp if you have multi-user concerns.

Capability Analysis

Type: OpenClaw Skill Name: ocr-python Version: 1.0.0 The skill provides OCR functionality using standard libraries. However, the `scripts/ocr.py` script directly uses user-provided file paths for input (`file`) and output (`output`) without explicit sanitization. This creates a vulnerability where a malicious agent or user could potentially read or overwrite arbitrary files on the system by providing crafted paths (e.g., `/etc/passwd` or `../../sensitive.txt`). While this is a significant security risk (arbitrary file read/write), there is no clear evidence of intentional malicious behavior such as data exfiltration, backdoor installation, or prompt injection against the agent, thus classifying it as suspicious rather than malicious.

Capability Assessment

✓ Purpose & Capability

The name/description (OCR for Chinese/English PDFs and images) matches the provided scripts and README. The code imports PaddleOCR and PyMuPDF (fitz) as described, and implements PDF image extraction and image OCR.

✓ Instruction Scope

SKILL.md and scripts/ocr.py limit actions to extracting images from PDFs, running PaddleOCR on images, printing or writing recognized text, and cleaning up temp files. The instructions do not ask the agent to read unrelated system files, transmit data to external endpoints, or access secrets.

✓ Install Mechanism

There is no automated install spec; SKILL.md advises using pip3 to install paddlepaddle and paddleocr. This is an expected, proportionate dependency installation approach for a Python OCR skill (note: pip will fetch packages from PyPI; paddlepaddle is large and platform-dependent).

✓ Credentials

The skill requests no environment variables, credentials, or config paths. The requested runtime imports (paddleocr, fitz) are appropriate for OCR/PDF extraction and are proportional to the stated purpose.

✓ Persistence & Privilege

Skill does not request persistent system privileges, does not set always:true, and does not modify other skills or system-wide agent settings. It runs as a normal invokable tool without elevated persistence.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install ocr-python
After installation, invoke the skill by name or use /ocr-python
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

- Initial release of the OCR skill using PaddleOCR. - Supports text extraction from PDF and image files in both Chinese and English. - Output includes recognized text and confidence scores in JSON format. - Typical use cases covered: scanned PDFs, images, invoices, contracts, and multi-page documents. - Includes quick start guide and dependency installation instructions.

Metadata

Slug ocr-python

Version 1.0.0

License —

All-time Installs 10

Active Installs 10

Total Versions 1

Frequently Asked Questions

What is OCR with python?

Extract Chinese and English text from images and scanned PDFs, including documents like invoices and contracts, using PaddleOCR in Python. It is an AI Agent Skill for Claude Code / OpenClaw, with 1591 downloads so far.

How do I install OCR with python?

Run "/install ocr-python" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is OCR with python free?

Yes, OCR with python is completely free (open-source). You can download, install and use it at no cost.

Which platforms does OCR with python support?

OCR with python is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created OCR with python?

It is built and maintained by Roamer 徐 (@roamer-remote); the current version is v1.0.0.

More Skills