← Back to Skills Marketplace
267
Downloads
0
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install paddle-ocr
Description
面向法律 PDF 与扫描件的 PaddleOCR 结构化解析技能。默认将本地 PDF 或图片转换为 Markdown,并在技能内部保留可追溯 archive 归档。本技能应在用户需要法律 PDF OCR、卷宗 OCR、病历 OCR、证据扫描件转 Markdown、表格识别、公式识别、版面分析、PDF 转 Mark...
Usage Guidance
This skill appears to do what it says: it converts PDFs/images to Markdown and keeps a local archive, calling a PaddleOCR layout-parsing API that you must configure. Before using it with sensitive documents: 1) Fix the registry/config mismatch — SKILL.md and lib.py require PADDLEOCR_DOC_PARSING_API_URL and PADDLEOCR_ACCESS_TOKEN (create paddle-ocr/config/.env as described). 2) Verify the API URL points to a service you trust (self-hosted or a trusted provider). The tool will upload full documents (Base64) to that endpoint with the token in an Authorization header. 3) Run smoke_test.py first (try --skip-api-test to check config, then with a non-sensitive sample to confirm endpoint behavior). 4) If privacy is critical, prefer a local/self-hosted PaddleOCR layout-parsing endpoint (or localhost) so data does not leave your environment. 5) Be aware 'uv' will install Python dependencies at runtime (via PyPI); inspect dependencies if your environment has strict policies. Finally, review the repo/homepage and .env.example to confirm the configured endpoint and token handling meet your security/compliance needs.
Capability Analysis
Type: OpenClaw Skill
Name: paddle-ocr
Version: 1.1.1
The paddle-ocr skill bundle is a well-structured tool designed to convert legal PDFs and images into Markdown using the PaddleOCR API. It includes robust features such as automatic PDF batching (scripts/split_pdf.py), image optimization (scripts/optimize_file.py), and a local archival system for traceability. The code uses standard libraries (httpx, pypdfium2, Pillow) and implements proper shell argument quoting in its JXA wrapper (scripts/convert.js) to prevent injection. Network activity is strictly limited to the user-configured API endpoint and a legitimate Baidu-hosted test image (smoke_test.py), with no evidence of data exfiltration or malicious intent.
Capability Tags
Capability Assessment
Purpose & Capability
The skill is advertised as a PaddleOCR-based legal-PDF-to-Markdown converter and the included scripts (convert.py, layout_caller.py, lib.py, etc.) implement that. The environment variables and request/response handling in lib.py align with the stated purpose (PADDLEOCR_DOC_PARSING_API_URL and PADDLEOCR_ACCESS_TOKEN). However, the registry metadata claims 'Required env vars: none' while SKILL.md and lib.py clearly require the two PaddleOCR config variables — an inconsistency that can mislead users during installation/configuration.
Instruction Scope
Runtime instructions are scoped to: reading a local file or remote URL, calling the configured PaddleOCR layout-parsing API (sending file Base64 or file URL), extracting Markdown and images, and writing an archive under the skill's archive/ folder. This is coherent with the purpose. Important privacy/security implication: the scripts will upload entire documents (Base64 payloads) to whatever API URL you configure, so sensitive legal or medical documents will be transmitted to that endpoint. The scripts do not attempt to read unrelated system files or other credentials.
Install Mechanism
There is no registry install spec (instruction-only), and the repo provides Python scripts. Execution uses 'uv run --script' headers to declare dependencies; the SKILL.md asks the user to install 'uv' if needed. No arbitrary network downloads or packaged installers are defined by the registry. Note: running the scripts will install Python packages (via uv/PyPI) at runtime — standard but worth being aware of.
Credentials
The skill legitimately requires two environment/config values: PADDLEOCR_DOC_PARSING_API_URL and PADDLEOCR_ACCESS_TOKEN (declared in SKILL.md and enforced by lib.py). These are proportional to the functionality. The concern is the mismatch with the registry metadata that listed no required env vars — that omission could lead users to run the skill without realizing they must configure an API endpoint and token (and thus inadvertently send data to an unintended endpoint). The skill does not request unrelated credentials or broad environment access.
Persistence & Privilege
The skill does not request 'always: true' or other elevated platform privileges. It writes files only to an archive directory under its own skill root and to user-specified output paths. It does not modify other skills or global agent configuration.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install paddle-ocr - After installation, invoke the skill by name or use
/paddle-ocr - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.1.1
面向法律 PDF 与扫描件的 PaddleOCR 结构化解析,支持表格识别、公式识别、版面分析,保留 archive 归档
Metadata
Frequently Asked Questions
What is PaddleOCR?
面向法律 PDF 与扫描件的 PaddleOCR 结构化解析技能。默认将本地 PDF 或图片转换为 Markdown,并在技能内部保留可追溯 archive 归档。本技能应在用户需要法律 PDF OCR、卷宗 OCR、病历 OCR、证据扫描件转 Markdown、表格识别、公式识别、版面分析、PDF 转 Mark... It is an AI Agent Skill for Claude Code / OpenClaw, with 267 downloads so far.
How do I install PaddleOCR?
Run "/install paddle-ocr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is PaddleOCR free?
Yes, PaddleOCR is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does PaddleOCR support?
PaddleOCR is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created PaddleOCR?
It is built and maintained by xierluo (@cat-xierluo); the current version is v1.1.1.
More Skills