← Back to Skills Marketplace
1076
Downloads
4
Stars
3
Active Installs
2
Versions
Install in OpenClaw
/install paddleocr-doc-parsing-v2
Description
Parse documents using PaddleOCR's API. Supports both sync and async modes for images and PDFs.
Usage Guidance
This skill appears to be what it claims: a client that uploads a local file (base64-encoded) or references a URL and sends it to whatever PADDLEOCR_API_URL or PADDLEOCR_JOB_URL you configure. Before installing, verify the endpoint URL you set is an official/trusted PaddleOCR endpoint (the skill will send your document contents and token to that URL). Ensure the Python 'requests' package is available if you plan to use async mode. Be aware of the differing Authorization header formats used by sync vs async paths and confirm which your endpoint expects. Run the tool in an isolated environment if you will upload sensitive documents, and consider rotating the API token if you share it with third-party services. If you need the skill to manage dependencies automatically or want stricter metadata, request the author add an explicit dependency declaration for Python packages and harmonize the env metadata with the documented optional variables.
Capability Analysis
Type: OpenClaw Skill
Name: paddleocr-doc-parsing-v2
Version: 1.0.4
The skill is classified as suspicious due to a critical shell injection vulnerability found in `scripts/paddleocr_parse.sh`. The script uses `cat "$input_file"` to read local files for base64 encoding. If the `input_file` argument is controlled by an attacker (e.g., via prompt injection against the agent), a crafted string like `"; malicious_command; #.jpg"` could lead to arbitrary command execution. While the Python script (`scripts/paddleocr_parse.py`) handles file paths securely, the shell script's vulnerability poses a significant risk. The skill's core functionality of interacting with the PaddleOCR API, including reading local files and fetching remote URLs, is otherwise aligned with its stated purpose.
Capability Assessment
Purpose & Capability
Name/description, required binaries (curl, base64, jq, python3), env vars (access token and API URL), and included scripts all match a document-parsing client for an external PaddleOCR-like API. No unrelated credentials or tools are requested.
Instruction Scope
SKILL.md and the scripts limit their actions to encoding or referencing the specified input file/URL and calling the configured API endpoints. They do not attempt to read other system files or exfiltrate unrelated data. Notes: the Python script imports the 'requests' library but SKILL.md and metadata do not declare an installation step or dependency for that package; if not present, async mode will fail. Also the instructions refer to optional PADDLEOCR_JOB_URL and PADDLEOCR_MODEL for async mode — these are documented in SKILL.md but the top-level metadata only lists the two primary env vars, which is a minor mismatch.
Install Mechanism
This is instruction-only (no installer). The skill includes two executable scripts in the bundle and does not download or execute external installers. No high-risk install URLs or archive extraction are present.
Credentials
Requested env vars (PADDLEOCR_ACCESS_TOKEN, PADDLEOCR_API_URL, and optionally PADDLEOCR_JOB_URL/PADDLEOCR_MODEL) are proportional to the stated purpose. Minor inconsistency: async operation requires PADDLEOCR_JOB_URL but that variable is optional in metadata; also the scripts use different Authorization header formats ('Authorization: token <token>' for sync, 'Authorization: bearer <token>' for async) which may cause confusion depending on the backend's expected scheme.
Persistence & Privilege
The skill does not request permanent or privileged presence (always: false). It does not modify other skills or system-wide settings. Normal autonomous invocation is allowed by platform defaults but not elevated here.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install paddleocr-doc-parsing-v2 - After installation, invoke the skill by name or use
/paddleocr-doc-parsing-v2 - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.4
Security fix: remove hardcoded endpoints, require user to set PADDLEOCR_API_URL
v1.0.3
Security fix: remove suspicious timestamps, add async mode support
Metadata
Frequently Asked Questions
What is PaddleOCR Document Parsing V2?
Parse documents using PaddleOCR's API. Supports both sync and async modes for images and PDFs. It is an AI Agent Skill for Claude Code / OpenClaw, with 1076 downloads so far.
How do I install PaddleOCR Document Parsing V2?
Run "/install paddleocr-doc-parsing-v2" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is PaddleOCR Document Parsing V2 free?
Yes, PaddleOCR Document Parsing V2 is completely free (open-source). You can download, install and use it at no cost.
Which platforms does PaddleOCR Document Parsing V2 support?
PaddleOCR Document Parsing V2 is cross-platform and runs anywhere OpenClaw / Claude Code is available (darwin, linux).
Who created PaddleOCR Document Parsing V2?
It is built and maintained by hioTEC (@hiotec); the current version is v1.0.4.
More Skills