← 返回 Skills 市场
1591
总下载
0
收藏
10
当前安装
1
版本数
在 OpenClaw 中安装
/install ocr-python
功能描述
Extract Chinese and English text from images and scanned PDFs, including documents like invoices and contracts, using PaddleOCR in Python.
安全使用建议
This skill appears to be a straightforward OCR helper. Before installing or running: (1) review and run the script in an isolated environment/virtualenv because pip will install large native packages (paddlepaddle) that may require specific platform/driver support; (2) confirm you trust the source (homepage unknown); (3) avoid feeding sensitive documents to unknown third-party tools if you cannot verify where processing occurs — this script runs locally, but dependencies could change behavior if malicious packages are installed; (4) consider running on a disposable VM or container and inspect temporary files under /tmp if you have multi-user concerns.
功能分析
Type: OpenClaw Skill
Name: ocr-python
Version: 1.0.0
The skill provides OCR functionality using standard libraries. However, the `scripts/ocr.py` script directly uses user-provided file paths for input (`file`) and output (`output`) without explicit sanitization. This creates a vulnerability where a malicious agent or user could potentially read or overwrite arbitrary files on the system by providing crafted paths (e.g., `/etc/passwd` or `../../sensitive.txt`). While this is a significant security risk (arbitrary file read/write), there is no clear evidence of intentional malicious behavior such as data exfiltration, backdoor installation, or prompt injection against the agent, thus classifying it as suspicious rather than malicious.
能力评估
Purpose & Capability
The name/description (OCR for Chinese/English PDFs and images) matches the provided scripts and README. The code imports PaddleOCR and PyMuPDF (fitz) as described, and implements PDF image extraction and image OCR.
Instruction Scope
SKILL.md and scripts/ocr.py limit actions to extracting images from PDFs, running PaddleOCR on images, printing or writing recognized text, and cleaning up temp files. The instructions do not ask the agent to read unrelated system files, transmit data to external endpoints, or access secrets.
Install Mechanism
There is no automated install spec; SKILL.md advises using pip3 to install paddlepaddle and paddleocr. This is an expected, proportionate dependency installation approach for a Python OCR skill (note: pip will fetch packages from PyPI; paddlepaddle is large and platform-dependent).
Credentials
The skill requests no environment variables, credentials, or config paths. The requested runtime imports (paddleocr, fitz) are appropriate for OCR/PDF extraction and are proportional to the stated purpose.
Persistence & Privilege
Skill does not request persistent system privileges, does not set always:true, and does not modify other skills or system-wide agent settings. It runs as a normal invokable tool without elevated persistence.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install ocr-python - 安装完成后,直接呼叫该 Skill 的名称或使用
/ocr-python触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
- Initial release of the OCR skill using PaddleOCR.
- Supports text extraction from PDF and image files in both Chinese and English.
- Output includes recognized text and confidence scores in JSON format.
- Typical use cases covered: scanned PDFs, images, invoices, contracts, and multi-page documents.
- Includes quick start guide and dependency installation instructions.
元数据
常见问题
OCR with python 是什么?
Extract Chinese and English text from images and scanned PDFs, including documents like invoices and contracts, using PaddleOCR in Python. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 1591 次。
如何安装 OCR with python?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install ocr-python」即可一键安装,无需额外配置。
OCR with python 是免费的吗?
是的,OCR with python 完全免费(开源免费),可自由下载、安装和使用。
OCR with python 支持哪些平台?
OCR with python 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 OCR with python?
由 Roamer 徐(@roamer-remote)开发并维护,当前版本 v1.0.0。
推荐 Skills