← 返回 Skills 市场

PaddleOCR

Name: PaddleOCR
Author: cat-xierluo

作者 xierluo · GitHub ↗ · v1.1.1 · MIT-0

cross-platform ✓ 安全检测通过

267

总下载

当前安装

版本数

在 OpenClaw 中安装

/install paddle-ocr

功能描述

面向法律 PDF 与扫描件的 PaddleOCR 结构化解析技能。默认将本地 PDF 或图片转换为 Markdown，并在技能内部保留可追溯 archive 归档。本技能应在用户需要法律 PDF OCR、卷宗 OCR、病历 OCR、证据扫描件转 Markdown、表格识别、公式识别、版面分析、PDF 转 Mark...

安全使用建议

This skill appears to do what it says: it converts PDFs/images to Markdown and keeps a local archive, calling a PaddleOCR layout-parsing API that you must configure. Before using it with sensitive documents: 1) Fix the registry/config mismatch — SKILL.md and lib.py require PADDLEOCR_DOC_PARSING_API_URL and PADDLEOCR_ACCESS_TOKEN (create paddle-ocr/config/.env as described). 2) Verify the API URL points to a service you trust (self-hosted or a trusted provider). The tool will upload full documents (Base64) to that endpoint with the token in an Authorization header. 3) Run smoke_test.py first (try --skip-api-test to check config, then with a non-sensitive sample to confirm endpoint behavior). 4) If privacy is critical, prefer a local/self-hosted PaddleOCR layout-parsing endpoint (or localhost) so data does not leave your environment. 5) Be aware 'uv' will install Python dependencies at runtime (via PyPI); inspect dependencies if your environment has strict policies. Finally, review the repo/homepage and .env.example to confirm the configured endpoint and token handling meet your security/compliance needs.

功能分析

Type: OpenClaw Skill Name: paddle-ocr Version: 1.1.1 The paddle-ocr skill bundle is a well-structured tool designed to convert legal PDFs and images into Markdown using the PaddleOCR API. It includes robust features such as automatic PDF batching (scripts/split_pdf.py), image optimization (scripts/optimize_file.py), and a local archival system for traceability. The code uses standard libraries (httpx, pypdfium2, Pillow) and implements proper shell argument quoting in its JXA wrapper (scripts/convert.js) to prevent injection. Network activity is strictly limited to the user-configured API endpoint and a legitimate Baidu-hosted test image (smoke_test.py), with no evidence of data exfiltration or malicious intent.

能力标签

cryptocan-make-purchasesrequires-oauth-tokenrequires-sensitive-credentials

能力评估

ℹ Purpose & Capability

The skill is advertised as a PaddleOCR-based legal-PDF-to-Markdown converter and the included scripts (convert.py, layout_caller.py, lib.py, etc.) implement that. The environment variables and request/response handling in lib.py align with the stated purpose (PADDLEOCR_DOC_PARSING_API_URL and PADDLEOCR_ACCESS_TOKEN). However, the registry metadata claims 'Required env vars: none' while SKILL.md and lib.py clearly require the two PaddleOCR config variables — an inconsistency that can mislead users during installation/configuration.

ℹ Instruction Scope

Runtime instructions are scoped to: reading a local file or remote URL, calling the configured PaddleOCR layout-parsing API (sending file Base64 or file URL), extracting Markdown and images, and writing an archive under the skill's archive/ folder. This is coherent with the purpose. Important privacy/security implication: the scripts will upload entire documents (Base64 payloads) to whatever API URL you configure, so sensitive legal or medical documents will be transmitted to that endpoint. The scripts do not attempt to read unrelated system files or other credentials.

✓ Install Mechanism

There is no registry install spec (instruction-only), and the repo provides Python scripts. Execution uses 'uv run --script' headers to declare dependencies; the SKILL.md asks the user to install 'uv' if needed. No arbitrary network downloads or packaged installers are defined by the registry. Note: running the scripts will install Python packages (via uv/PyPI) at runtime — standard but worth being aware of.

⚠ Credentials

The skill legitimately requires two environment/config values: PADDLEOCR_DOC_PARSING_API_URL and PADDLEOCR_ACCESS_TOKEN (declared in SKILL.md and enforced by lib.py). These are proportional to the functionality. The concern is the mismatch with the registry metadata that listed no required env vars — that omission could lead users to run the skill without realizing they must configure an API endpoint and token (and thus inadvertently send data to an unintended endpoint). The skill does not request unrelated credentials or broad environment access.

✓ Persistence & Privilege

The skill does not request 'always: true' or other elevated platform privileges. It writes files only to an archive directory under its own skill root and to user-specified output paths. It does not modify other skills or global agent configuration.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install paddle-ocr
安装完成后，直接呼叫该 Skill 的名称或使用 /paddle-ocr 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.1.1

面向法律 PDF 与扫描件的 PaddleOCR 结构化解析，支持表格识别、公式识别、版面分析，保留 archive 归档

元数据

Slug paddle-ocr

版本 1.1.1

许可证 MIT-0

累计安装 1

当前安装数 1

历史版本数 1

常见问题

PaddleOCR 是什么？

面向法律 PDF 与扫描件的 PaddleOCR 结构化解析技能。默认将本地 PDF 或图片转换为 Markdown，并在技能内部保留可追溯 archive 归档。本技能应在用户需要法律 PDF OCR、卷宗 OCR、病历 OCR、证据扫描件转 Markdown、表格识别、公式识别、版面分析、PDF 转 Mark... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 267 次。

如何安装 PaddleOCR？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install paddle-ocr」即可一键安装，无需额外配置。

PaddleOCR 是免费的吗？

是的，PaddleOCR 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

PaddleOCR 支持哪些平台？

PaddleOCR 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 PaddleOCR？

由 xierluo（@cat-xierluo）开发并维护，当前版本 v1.1.1。