← Back to Skills Marketplace
100
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install opendataloader-pdf-wuxie
Description
PDF parsing tool for AI/RAG. Convert PDF to Markdown, JSON, HTML with layout preservation, bounding boxes, and image extraction. Use when you need to extract...
Usage Guidance
This skill appears to do what it says (convert PDFs locally) but there are a few red flags to check before installing: 1) Verify the pip package source — confirm the exact PyPI package and/or GitHub repo matches the SKILL.md homepage and is from a trusted maintainer. 2) Inspect the package contents (and bundled JAR) before installing, or install into an isolated environment/container to observe behavior. 3) Ask the maintainer or check docs what '--hybrid' (docling-fast) does and whether it calls remote services or requires API keys — if it does, confirm what endpoints and credentials are used. 4) Because pipx runs code at install time, avoid installing on sensitive/production hosts until you've validated the package. If you want, provide the actual pip package name or the source repo and I can check it for further inconsistencies.
Capability Analysis
Type: OpenClaw Skill
Name: opendataloader-pdf-wuxie
Version: 1.0.0
The skill bundle provides a PDF parsing utility (opendataloader-pdf) designed for AI and RAG workflows, supporting conversion to Markdown, JSON, and HTML. The SKILL.md documentation and the references/test_script.sh file contain standard usage instructions and testing logic consistent with the tool's stated purpose, with no evidence of data exfiltration, malicious execution, or prompt injection.
Capability Assessment
Purpose & Capability
Name/description, CLI examples, and included test script all align with a PDF parsing CLI. The SKILL.md states the package is installed via pipx and bundles a PDFBox JAR — that is coherent for a Java-based PDF tool. Minor inconsistency: registry metadata lists no homepage/source while SKILL.md includes a GitHub homepage URL, which should be verified.
Instruction Scope
Runtime instructions only run a local CLI (opendataloader-pdf) against local PDF files and write output to local directories. They do not instruct reading unrelated system files or environment variables. Ambiguity: the '--hybrid' / 'Hybrid AI mode: docling-fast' option could imply contacting an external AI service or model; SKILL.md gives no details about network calls, remote endpoints, or required API keys.
Install Mechanism
No install spec in the registry; SKILL.md recommends 'pipx install opendataloader-pdf'. Installing from PyPI/pipx runs package install scripts which may execute code at install time and will pull the package from wherever it's published. The SKILL.md claims a GitHub homepage, but the registry shows source unknown — confirm the exact pip package name and origin before installing.
Credentials
The skill declares no required environment variables, no secrets, and no config paths. That is proportionate for a local PDF parsing CLI. However, the hybrid/AI option appears under-specified: if it uses a remote service it would typically require API credentials (none are declared), so verify whether additional credentials are needed at runtime.
Persistence & Privilege
The skill does not request persistent/autostart privileges (always:false). It is user-invocable and allows autonomous invocation (default) which is normal. It does not declare modifications to other skills or system-wide settings.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install opendataloader-pdf-wuxie - After installation, invoke the skill by name or use
/opendataloader-pdf-wuxie - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release: PDF to Markdown/JSON/HTML with layout preservation
Metadata
Frequently Asked Questions
What is OpenDataLoader PDF Parser (乌贼版)?
PDF parsing tool for AI/RAG. Convert PDF to Markdown, JSON, HTML with layout preservation, bounding boxes, and image extraction. Use when you need to extract... It is an AI Agent Skill for Claude Code / OpenClaw, with 100 downloads so far.
How do I install OpenDataLoader PDF Parser (乌贼版)?
Run "/install opendataloader-pdf-wuxie" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is OpenDataLoader PDF Parser (乌贼版) free?
Yes, OpenDataLoader PDF Parser (乌贼版) is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does OpenDataLoader PDF Parser (乌贼版) support?
OpenDataLoader PDF Parser (乌贼版) is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created OpenDataLoader PDF Parser (乌贼版)?
It is built and maintained by wtjjacobj (@wtjjacobj); the current version is v1.0.0.
More Skills