← Back to Skills Marketplace

PaddleOCR Document Parsing V2

Name: PaddleOCR Document Parsing V2
Author: hiotec

by hioTEC · GitHub ↗ · v1.0.4

darwinlinux ⚠ suspicious

1076

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install paddleocr-doc-parsing-v2

Description

Parse documents using PaddleOCR's API. Supports both sync and async modes for images and PDFs.

Usage Guidance

This skill appears to be what it claims: a client that uploads a local file (base64-encoded) or references a URL and sends it to whatever PADDLEOCR_API_URL or PADDLEOCR_JOB_URL you configure. Before installing, verify the endpoint URL you set is an official/trusted PaddleOCR endpoint (the skill will send your document contents and token to that URL). Ensure the Python 'requests' package is available if you plan to use async mode. Be aware of the differing Authorization header formats used by sync vs async paths and confirm which your endpoint expects. Run the tool in an isolated environment if you will upload sensitive documents, and consider rotating the API token if you share it with third-party services. If you need the skill to manage dependencies automatically or want stricter metadata, request the author add an explicit dependency declaration for Python packages and harmonize the env metadata with the documented optional variables.

Capability Analysis

Type: OpenClaw Skill Name: paddleocr-doc-parsing-v2 Version: 1.0.4 The skill is classified as suspicious due to a critical shell injection vulnerability found in `scripts/paddleocr_parse.sh`. The script uses `cat "$input_file"` to read local files for base64 encoding. If the `input_file` argument is controlled by an attacker (e.g., via prompt injection against the agent), a crafted string like `"; malicious_command; #.jpg"` could lead to arbitrary command execution. While the Python script (`scripts/paddleocr_parse.py`) handles file paths securely, the shell script's vulnerability poses a significant risk. The skill's core functionality of interacting with the PaddleOCR API, including reading local files and fetching remote URLs, is otherwise aligned with its stated purpose.

Capability Assessment

✓ Purpose & Capability

Name/description, required binaries (curl, base64, jq, python3), env vars (access token and API URL), and included scripts all match a document-parsing client for an external PaddleOCR-like API. No unrelated credentials or tools are requested.

ℹ Instruction Scope

SKILL.md and the scripts limit their actions to encoding or referencing the specified input file/URL and calling the configured API endpoints. They do not attempt to read other system files or exfiltrate unrelated data. Notes: the Python script imports the 'requests' library but SKILL.md and metadata do not declare an installation step or dependency for that package; if not present, async mode will fail. Also the instructions refer to optional PADDLEOCR_JOB_URL and PADDLEOCR_MODEL for async mode — these are documented in SKILL.md but the top-level metadata only lists the two primary env vars, which is a minor mismatch.

✓ Install Mechanism

This is instruction-only (no installer). The skill includes two executable scripts in the bundle and does not download or execute external installers. No high-risk install URLs or archive extraction are present.

ℹ Credentials

Requested env vars (PADDLEOCR_ACCESS_TOKEN, PADDLEOCR_API_URL, and optionally PADDLEOCR_JOB_URL/PADDLEOCR_MODEL) are proportional to the stated purpose. Minor inconsistency: async operation requires PADDLEOCR_JOB_URL but that variable is optional in metadata; also the scripts use different Authorization header formats ('Authorization: token <token>' for sync, 'Authorization: bearer <token>' for async) which may cause confusion depending on the backend's expected scheme.

✓ Persistence & Privilege

The skill does not request permanent or privileged presence (always: false). It does not modify other skills or system-wide settings. Normal autonomous invocation is allowed by platform defaults but not elevated here.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install paddleocr-doc-parsing-v2
After installation, invoke the skill by name or use /paddleocr-doc-parsing-v2
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.4

Security fix: remove hardcoded endpoints, require user to set PADDLEOCR_API_URL

v1.0.3

Security fix: remove suspicious timestamps, add async mode support

Metadata

Slug paddleocr-doc-parsing-v2

Version 1.0.4

License —

All-time Installs 3

Active Installs 3

Total Versions 2

Frequently Asked Questions

What is PaddleOCR Document Parsing V2?

Parse documents using PaddleOCR's API. Supports both sync and async modes for images and PDFs. It is an AI Agent Skill for Claude Code / OpenClaw, with 1076 downloads so far.

How do I install PaddleOCR Document Parsing V2?

Run "/install paddleocr-doc-parsing-v2" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is PaddleOCR Document Parsing V2 free?

Yes, PaddleOCR Document Parsing V2 is completely free (open-source). You can download, install and use it at no cost.

Which platforms does PaddleOCR Document Parsing V2 support?

PaddleOCR Document Parsing V2 is cross-platform and runs anywhere OpenClaw / Claude Code is available (darwin, linux).

Who created PaddleOCR Document Parsing V2?

It is built and maintained by hioTEC (@hiotec); the current version is v1.0.4.

More Skills