← 返回 Skills 市场

PaddleOCR Document Parsing

Name: PaddleOCR Document Parsing
Author: bobholamovic

作者 Lin Manhui · GitHub ↗ · v3.0.0 · MIT-0

cross-platform ✓ 安全检测通过

10267

总下载

当前安装

版本数

在 OpenClaw 中安装

/install paddleocr-doc-parsing

功能描述

Use this skill to extract structured Markdown/JSON from PDFs and document images—tables with cell-level precision, formulas as LaTeX, figures, seals, charts,...

安全使用建议

Install only if you trust the PaddleOCR CLI package and are comfortable using a hosted OCR API with a PaddleOCR access token. Do not process confidential, regulated, financial, legal, or customer documents unless you have confirmed the data-sharing, retention, and compliance terms are acceptable.

能力评估

✓ Purpose & Capability

The stated purpose is extracting structured Markdown/JSON from PDFs and document images, and the artifact only contains PaddleOCR CLI instructions for that task.

ℹ Instruction Scope

Commands are user-directed and scoped to explicit URLs, local file paths, page ranges, and output locations; the skill should more plainly warn that local files submitted with `paddleocr api --file_path` may be sent to a remote OCR service.

ℹ Install Mechanism

Installation declares the `paddleocr` package via `uv`, the `paddleocr` binary, and `PADDLEOCR_ACCESS_TOKEN`; no helper scripts or hidden executable artifacts are included.

ℹ Credentials

Use of an API token and cloud OCR is proportionate for hosted document parsing, but the target documents may include invoices, financial reports, and other sensitive content.

✓ Persistence & Privilege

There is no evidence of background persistence, privilege escalation, broad filesystem indexing, credential harvesting, destructive actions, or automatic execution.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install paddleocr-doc-parsing
安装完成后，直接呼叫该 Skill 的名称或使用 /paddleocr-doc-parsing 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v3.0.0

- Significant update: migrated from custom scripts and detailed workflow to usage of the official PaddleOCR CLI. - Removed all helper scripts and schema/reference files. - Updated instructions for document parsing using the new paddleocr api command-line interface. - Simplified configuration: only requires PADDLEOCR_ACCESS_TOKEN and paddleocr CLI. - Added quick-start usage examples with key CLI options and new output format. - Clarified error handling and preprocessing recommendations.

v2.0.16

- Switched installation and dependency management to inline PEP 723 requirements, eliminating the separate requirements.txt files. - Updated usage instructions: scripts are now executed via uv (Python 3.9+ required), replacing previous pip/python guidance. - Clarified compatibility: now requires Python 3.9+, uv, and internet access. - Added SPDX license field (Apache-2.0). - Cleaned up documentation by removing references to installation via requirements files and updating all example commands to use uv.

v2.0.15

- Replaced the main document parsing script: removed scripts/vl_caller.py and added scripts/layout_caller.py. - All usage and documentation instructions now reference layout_caller.py instead of vl_caller.py. - No other workflow or API changes; skill functionality and interface remain the same.

v2.0.14

No file changes detected in this version. - No user-facing or implementation changes. - Documentation, API, and skill behavior remain unchanged.

v2.0.13

No changes detected in this version. - Version bumped to 2.0.13 with no modifications to files or documentation. - Behavior, usage, and installation remain unchanged.

v2.0.12

- Documentation streamlined for clarity and usability—workflow, troubleshooting, and schema details are now easier to find. - Output fields renamed to match actual script results (e.g., layoutParsingResults) and old field names removed. - More concise and focused instructions: Removed redundant warnings, emphasized complete user-facing output, and added timing guidance for large files. - Detailed first-time setup, error handling, and output parsing sections improved for newcomers and troubleshooting. - Usage examples refreshed; file persistence and `--stdout` handling clarified.

v2.0.11

- Dependency installation has been streamlined: requirements.txt and requirements-optimize.txt are now located at the skill's root, not inside scripts/. - Updated installation instructions to reflect new requirements file locations. - Unused scripts/requirements*.txt files have been removed for clarity. - No changes to skill functionality or API usage.

v2.0.10

No user-facing changes in this version. - No updates detected in files or documentation. - Functionality, usage instructions, and requirements remain unchanged.

v2.0.9

No functional changes detected in this version. - Updated the skill description with more concise, keyword-rich, and bilingual (Chinese/English) trigger terms for improved discovery and routing. - Added an explicit note in the documentation directing routing/discovery logic to use the `description` field for trigger keywords. - No code or behavioral changes; skill usage and workflow remain the same.

v2.0.8

No user-facing changes in this version. - Version bump; no file or documentation changes detected.

v2.0.7

- Added installation instructions for required Python dependencies in the usage guide. - Included optional installation step for document optimization and PDF splitting support. - No changes to API, features, or workflow.

v2.0.6

- Removed the configuration script (scripts/configure.py) from the project. - Users are now expected to set environment variables using standard host application methods. - Documentation updated to clarify secure configuration and that configure.py is no longer used or required. - No functional changes to document parsing workflow.

v2.0.5

### paddleocr-doc-parsing 2.0.5 Changelog - Updated the description for improved clarity and conciseness. - The configuration instructions now assume environment variables are typically pre-configured; only notify about configuration issues if an error occurs during parsing. - Standardized error messages and clarified the configuration workflow for better user guidance. - Minor formatting and wording improvements throughout documentation.

v2.0.4

No user-visible changes in this version. - No file changes detected; documentation, functionality, and usage remain the same.

v2.0.3

- Added metadata section to SKILL.md, specifying required environment variables, dependencies, homepage, and emoji. - No changes to functionality or core documentation content. - Version bump for manifest metadata improvement and compatibility.

v2.0.2

No user-visible changes in this version. - Content and workflow remain unchanged. - No modifications or updates were detected in the skill files.

v2.0.1

- Added Openclaw-compatible metadata, including required environment variables and binaries. - Declared environment variables needed: PADDLEOCR_DOC_PARSING_API_URL, PADDLEOCR_ACCESS_TOKEN, PADDLEOCR_DOC_PARSING_TIMEOUT. - Added metadata section with homepage link and primary environment variable. - No changes to any code or documentation content outside of the metadata addition.

v2.0.0

Version 2.0.0 – Major update: Migrated from bash to Python, improved modular parsing, and enforced strict usage rules. - Replaced legacy bash script (paddleocr_parse.sh) with a Python-based orchestration (e.g., vl_caller.py). - Added configurable scripts for document parsing, PDF splitting, configuration, and optimization. - Introduced a clear API usage policy: only use script-based API calls; no direct parsing or fallback methods allowed. - Enhanced documentation with new setup, workflow, usage constraints, and error-handling guidelines. - Provided schema references and structured output for easier downstream processing.

v1.0.3

- No changes detected in this version; all features and documentation remain the same as the previous release.

v1.0.2

- No changes detected in this release.

元数据

Slug paddleocr-doc-parsing

版本 3.0.0

许可证 MIT-0

累计安装 345

当前安装数 78

历史版本数 22

常见问题

PaddleOCR Document Parsing 是什么？

Use this skill to extract structured Markdown/JSON from PDFs and document images—tables with cell-level precision, formulas as LaTeX, figures, seals, charts,... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 10267 次。

如何安装 PaddleOCR Document Parsing？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install paddleocr-doc-parsing」即可一键安装，无需额外配置。

PaddleOCR Document Parsing 是免费的吗？

是的，PaddleOCR Document Parsing 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

PaddleOCR Document Parsing 支持哪些平台？

PaddleOCR Document Parsing 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 PaddleOCR Document Parsing？

由 Lin Manhui（@bobholamovic）开发并维护，当前版本 v3.0.0。