← Back to Skills Marketplace
roamer-remote

OCR with python

by Roamer 徐 · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
1591
Downloads
0
Stars
10
Active Installs
1
Versions
Install in OpenClaw
/install ocr-python
Description
Extract Chinese and English text from images and scanned PDFs, including documents like invoices and contracts, using PaddleOCR in Python.
Usage Guidance
This skill appears to be a straightforward OCR helper. Before installing or running: (1) review and run the script in an isolated environment/virtualenv because pip will install large native packages (paddlepaddle) that may require specific platform/driver support; (2) confirm you trust the source (homepage unknown); (3) avoid feeding sensitive documents to unknown third-party tools if you cannot verify where processing occurs — this script runs locally, but dependencies could change behavior if malicious packages are installed; (4) consider running on a disposable VM or container and inspect temporary files under /tmp if you have multi-user concerns.
Capability Analysis
Type: OpenClaw Skill Name: ocr-python Version: 1.0.0 The skill provides OCR functionality using standard libraries. However, the `scripts/ocr.py` script directly uses user-provided file paths for input (`file`) and output (`output`) without explicit sanitization. This creates a vulnerability where a malicious agent or user could potentially read or overwrite arbitrary files on the system by providing crafted paths (e.g., `/etc/passwd` or `../../sensitive.txt`). While this is a significant security risk (arbitrary file read/write), there is no clear evidence of intentional malicious behavior such as data exfiltration, backdoor installation, or prompt injection against the agent, thus classifying it as suspicious rather than malicious.
Capability Assessment
Purpose & Capability
The name/description (OCR for Chinese/English PDFs and images) matches the provided scripts and README. The code imports PaddleOCR and PyMuPDF (fitz) as described, and implements PDF image extraction and image OCR.
Instruction Scope
SKILL.md and scripts/ocr.py limit actions to extracting images from PDFs, running PaddleOCR on images, printing or writing recognized text, and cleaning up temp files. The instructions do not ask the agent to read unrelated system files, transmit data to external endpoints, or access secrets.
Install Mechanism
There is no automated install spec; SKILL.md advises using pip3 to install paddlepaddle and paddleocr. This is an expected, proportionate dependency installation approach for a Python OCR skill (note: pip will fetch packages from PyPI; paddlepaddle is large and platform-dependent).
Credentials
The skill requests no environment variables, credentials, or config paths. The requested runtime imports (paddleocr, fitz) are appropriate for OCR/PDF extraction and are proportional to the stated purpose.
Persistence & Privilege
Skill does not request persistent system privileges, does not set always:true, and does not modify other skills or system-wide agent settings. It runs as a normal invokable tool without elevated persistence.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install ocr-python
  3. After installation, invoke the skill by name or use /ocr-python
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of the OCR skill using PaddleOCR. - Supports text extraction from PDF and image files in both Chinese and English. - Output includes recognized text and confidence scores in JSON format. - Typical use cases covered: scanned PDFs, images, invoices, contracts, and multi-page documents. - Includes quick start guide and dependency installation instructions.
Metadata
Slug ocr-python
Version 1.0.0
License
All-time Installs 10
Active Installs 10
Total Versions 1
Frequently Asked Questions

What is OCR with python?

Extract Chinese and English text from images and scanned PDFs, including documents like invoices and contracts, using PaddleOCR in Python. It is an AI Agent Skill for Claude Code / OpenClaw, with 1591 downloads so far.

How do I install OCR with python?

Run "/install ocr-python" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is OCR with python free?

Yes, OCR with python is completely free (open-source). You can download, install and use it at no cost.

Which platforms does OCR with python support?

OCR with python is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created OCR with python?

It is built and maintained by Roamer 徐 (@roamer-remote); the current version is v1.0.0.

💬 Comments