← 返回 Skills 市场

PaddleOCR Document Parsing V2

Name: PaddleOCR Document Parsing V2
Author: hiotec

作者 hioTEC · GitHub ↗ · v1.0.4

darwinlinux ⚠ suspicious

1076

总下载

当前安装

版本数

在 OpenClaw 中安装

/install paddleocr-doc-parsing-v2

功能描述

Parse documents using PaddleOCR's API. Supports both sync and async modes for images and PDFs.

安全使用建议

This skill appears to be what it claims: a client that uploads a local file (base64-encoded) or references a URL and sends it to whatever PADDLEOCR_API_URL or PADDLEOCR_JOB_URL you configure. Before installing, verify the endpoint URL you set is an official/trusted PaddleOCR endpoint (the skill will send your document contents and token to that URL). Ensure the Python 'requests' package is available if you plan to use async mode. Be aware of the differing Authorization header formats used by sync vs async paths and confirm which your endpoint expects. Run the tool in an isolated environment if you will upload sensitive documents, and consider rotating the API token if you share it with third-party services. If you need the skill to manage dependencies automatically or want stricter metadata, request the author add an explicit dependency declaration for Python packages and harmonize the env metadata with the documented optional variables.

功能分析

Type: OpenClaw Skill Name: paddleocr-doc-parsing-v2 Version: 1.0.4 The skill is classified as suspicious due to a critical shell injection vulnerability found in `scripts/paddleocr_parse.sh`. The script uses `cat "$input_file"` to read local files for base64 encoding. If the `input_file` argument is controlled by an attacker (e.g., via prompt injection against the agent), a crafted string like `"; malicious_command; #.jpg"` could lead to arbitrary command execution. While the Python script (`scripts/paddleocr_parse.py`) handles file paths securely, the shell script's vulnerability poses a significant risk. The skill's core functionality of interacting with the PaddleOCR API, including reading local files and fetching remote URLs, is otherwise aligned with its stated purpose.

能力评估

✓ Purpose & Capability

Name/description, required binaries (curl, base64, jq, python3), env vars (access token and API URL), and included scripts all match a document-parsing client for an external PaddleOCR-like API. No unrelated credentials or tools are requested.

ℹ Instruction Scope

SKILL.md and the scripts limit their actions to encoding or referencing the specified input file/URL and calling the configured API endpoints. They do not attempt to read other system files or exfiltrate unrelated data. Notes: the Python script imports the 'requests' library but SKILL.md and metadata do not declare an installation step or dependency for that package; if not present, async mode will fail. Also the instructions refer to optional PADDLEOCR_JOB_URL and PADDLEOCR_MODEL for async mode — these are documented in SKILL.md but the top-level metadata only lists the two primary env vars, which is a minor mismatch.

✓ Install Mechanism

This is instruction-only (no installer). The skill includes two executable scripts in the bundle and does not download or execute external installers. No high-risk install URLs or archive extraction are present.

ℹ Credentials

Requested env vars (PADDLEOCR_ACCESS_TOKEN, PADDLEOCR_API_URL, and optionally PADDLEOCR_JOB_URL/PADDLEOCR_MODEL) are proportional to the stated purpose. Minor inconsistency: async operation requires PADDLEOCR_JOB_URL but that variable is optional in metadata; also the scripts use different Authorization header formats ('Authorization: token <token>' for sync, 'Authorization: bearer <token>' for async) which may cause confusion depending on the backend's expected scheme.

✓ Persistence & Privilege

The skill does not request permanent or privileged presence (always: false). It does not modify other skills or system-wide settings. Normal autonomous invocation is allowed by platform defaults but not elevated here.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install paddleocr-doc-parsing-v2
安装完成后，直接呼叫该 Skill 的名称或使用 /paddleocr-doc-parsing-v2 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.4

Security fix: remove hardcoded endpoints, require user to set PADDLEOCR_API_URL

v1.0.3

Security fix: remove suspicious timestamps, add async mode support

元数据

Slug paddleocr-doc-parsing-v2

版本 1.0.4

许可证 —

累计安装 3

当前安装数 3

历史版本数 2

常见问题

PaddleOCR Document Parsing V2 是什么？

Parse documents using PaddleOCR's API. Supports both sync and async modes for images and PDFs. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 1076 次。

如何安装 PaddleOCR Document Parsing V2？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install paddleocr-doc-parsing-v2」即可一键安装，无需额外配置。

PaddleOCR Document Parsing V2 是免费的吗？

是的，PaddleOCR Document Parsing V2 完全免费（开源免费），可自由下载、安装和使用。

PaddleOCR Document Parsing V2 支持哪些平台？

PaddleOCR Document Parsing V2 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（darwin, linux）。

谁开发了 PaddleOCR Document Parsing V2？

由 hioTEC（@hiotec）开发并维护，当前版本 v1.0.4。