← Back to Skills Marketplace

WiseDiag MedOCR

Name: WiseDiag MedOCR
Author: wisediag

by wisediag · GitHub ↗ · v1.0.26 · MIT-0

cross-platform ✓ Security Clean

618

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install wisediag-medocr

Description

PDF & Image OCR — Convert a single PDF or image to Markdown via WiseDiag cloud API, with high-accuracy text extraction, table recognition, and multi-column l...

Usage Guidance

This skill appears to do what it says: it uploads one file to WiseDiag's cloud OCR API and saves the returned Markdown locally. Before installing or using it, consider: 1) Do NOT upload sensitive documents — the SKILL.md explicitly warns against IDs, financial, medical, biometric, credentials, or minor data. 2) The script requires WISEDIAG_API_KEY; the package metadata omitted this, so be sure to set the environment variable rather than assume none are needed. 3) The script calls https://openapi.wisediag.com; verify that domain and WiseDiag's privacy/data-retention policy meet your requirements. 4) If you must process sensitive files, use an offline OCR alternative. 5) As a best practice, set the API key in a secure location (avoid committing it to dotfiles or repos). Finally, if you want extra assurance, review network traffic or run the script in an isolated environment (VM/container) while testing with non-sensitive files.

Capability Analysis

Type: OpenClaw Skill Name: wisediag-medocr Version: 1.0.26 The skill provides OCR functionality by uploading PDF or image files to the WiseDiag cloud API (openapi.wisediag.com) for conversion to Markdown. The implementation in scripts/wiseocr.py is transparent, using standard libraries like requests and pypdf, and includes clear privacy warnings in SKILL.md and README.md regarding the handling of sensitive data.

Capability Assessment

ℹ Purpose & Capability

The skill name/description (WiseDiag OCR to Markdown) aligns with its behavior: the script uploads a single PDF/image to WiseDiag's OCR endpoint, receives a JSON result, and writes Markdown to ~/.openclaw/workspace/WiseOCR. Note: the top-level registry metadata in the package summary listed no required env vars, but the SKILL.md and the script both require WISEDIAG_API_KEY — this metadata omission is an inconsistency to be aware of.

✓ Instruction Scope

SKILL.md instructs the user to set WISEDIAG_API_KEY and run the provided Python script. The script only reads the input file, optionally inspects PDF page count, uploads the file to https://openapi.wisediag.com/v1/ocr/pdf, and saves returned markdown locally. It does not access other system files, other credentials, or unusual endpoints. The privacy warning in SKILL.md accurately reflects the upload behavior.

✓ Install Mechanism

There is no custom install step; the project includes a requirements.txt (requests, pypdf) and recommends pip install -r requirements.txt. No remote arbitrary installers, URL downloads, or packaged binaries are present. The code file is plain Python and straightforward to inspect.

ℹ Credentials

The only runtime secret required is WISEDIAG_API_KEY, which is reasonable and proportional for a cloud OCR service. However, the registry-level 'Requirements' field (provided earlier in the package metadata) did not list this env var while SKILL.md and the script both require it — this mismatch could confuse users or automated tooling.

✓ Persistence & Privilege

The skill runs on demand (always:false) and is user-invocable. It writes output only to a subdirectory under the user's home (~/.openclaw/workspace/WiseOCR). It does not modify other skills, system-wide settings, or request permanent elevated privileges.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install wisediag-medocr
After installation, invoke the skill by name or use /wisediag-medocr
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.26

- Skill renamed from wisediag-medocr to wiseocr. - Expanded and clarified privacy and sensitive data warnings, including a detailed table and lists of prohibited data types. - Updated API key acquisition link for clarity. - Description updated for improved accuracy and readability; no functional changes to usage or features.

v1.0.25

- Updated SKILL.md metadata format: moved env_vars block outside credentials for consistency. - No functional or feature changes in this release.

v1.0.24

- No functional changes in this version. - Internal structure of SKILL.md updated: the "credentials" block was moved outside the "registry" section. - All guides, features, and usage instructions remain unchanged.

v1.0.23

- No changes detected in this version.

v1.0.22

- Added support for image OCR: now accepts single image files (jpg, jpeg, png, webp, gif, bmp, tiff) in addition to PDFs. - Updated usage instructions to reflect support for both PDF and image input. - Revised description for broader file compatibility (PDF & Image OCR). - Clarified input argument details to specify that only a single file is supported.

v1.0.21

- No changes detected in this version. - All features, documentation, and privacy guidance remain the same.

v1.0.20

Version 1.0.20 of wisediag-medocr - No file changes detected; documentation and functionality remain unchanged. - Privacy warnings, usage instructions, and API details are consistent with the previous release.

v1.0.19

- Skill renamed from "wiseocr" to "wisediag-medocr" - Shortened and clarified the description for general usage and feature highlights - Updated API key link to a new URL format - Added instruction to never call the API or HTTP endpoints directly; use only the provided script - Removed specific API endpoint details from documentation for better abstraction and safety - General wording and formatting improvements for clarity

v1.0.18

- Renamed the skill from wisediag-medocr to wiseocr. - Updated internal references to use the new skill name. - No changes to core functionality or arguments.

v1.0.17

- No changes detected in this version; documentation and functionality remain the same.

v1.0.16

- Updated the skill description to clarify credential requirements and cloud upload behavior. - Documentation now specifies the API endpoint used for OCR processing. - Instructions emphasize using the provided script exclusively for authentication and file handling. - Improved clarity around data privacy and file handling processes.

v1.0.15

Version 1.0.15 of wisediag-medocr - No file changes detected in this release. - Functionality and documentation remain unchanged from the previous version.

v1.0.14

- Shortened and clarified the skill description for easier understanding. - Added explicit usage instructions: "Upload a PDF and say Use WiseOCR to OCR this." - Provided a direct API key registration link with improved formatting. - Updated guidance to specifically instruct users never to call APIs or HTTP endpoints directly—scripts must be used. - Removed extra tracking parameters from URLs for cleaner presentation.

v1.0.13

No changes detected in this version. - No file changes were found between versions. - Functionality, documentation, and configuration remain the same.

v1.0.12

- Updated the API key registration link to include UTM tracking parameters for improved referral tracking. - No other changes detected.

v1.0.11

- Updated documentation in SKILL.md and README.md for clarity, including clearer instructions and privacy warnings. - Service endpoint information and documentation link added to SKILL.md. - No changes to core functionality or API usage.

v1.0.10

No changes detected in this version. - No file modifications, updates, or new features were introduced. - All documentation and configuration remain the same as in the previous release.

v1.0.9

## v1.0.9 — Security & Metadata Hardening ### 🔐 Credential Declaration (IMPORTANT) This skill requires **one** environment variable to function: - `WISEDIAG_API_KEY` — Required. Used to authenticate PDF uploads to the WiseDiag OCR API. No other credentials, tokens, or secrets are needed or accessed. The `registry.credentials` block in SKILL.md explicitly declares: - `required: true` - `env_vars: [WISEDIAG_API_KEY]` If the platform registry metadata does not reflect this requirement, it is a sync/publishing issue — the source of truth is SKILL.md. ### 🛡️ Security Improvements - **Removed `--service-url` CLI parameter:** The API endpoint is now hardcoded to `https://openapi.wisediag.com`. The script can no longer be redirected to arbitrary endpoints, eliminating the risk of unintended data exfiltration. - **Added prominent Privacy Warning:** A ⚠️ Privacy Warning section is now the first thing users see in SKILL.md, clearly stating that files are uploaded to WiseDiag's cloud servers. - **Explicit data flow documentation:** The Data Privacy section now lists exactly what happens to uploaded files in a numbered sequence. ### 📋 What This Skill Does (and Does NOT Do) ✅ Reads WISEDIAG_API_KEY from environment (single credential) ✅ Uploads user-provided PDF to https://openapi.wisediag.com/v1/ocr/pdf ✅ Saves returned Markdown to ~/.openclaw/workspace/WiseOCR/ ✅ Only depends on `requests` (via requirements.txt) ❌ Does NOT read any other environment variables or credentials ❌ Does NOT access files outside the user-provided input PDF ❌ Does NOT allow overriding the API endpoint (hardcoded) ❌ Does NOT request elevated privileges or persistent access ❌ Does NOT modify other skills or system configurations

v1.0.8

- Updated API key setup instructions to use a new, shortened link for obtaining your key. - Strengthened warnings to never call the OCR API directly; emphasized usage of the provided script only. - No changes to code or functionality. Documentation update only.

v1.0.7

- Updated API key setup instructions to use the direct URL (removed redirect link). - Clarified usage instructions: explicitly state that files must be processed with the provided script, not direct API calls. - No functional or code changes in this release.

Metadata

Slug wisediag-medocr

Version 1.0.26

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 27

Frequently Asked Questions

What is WiseDiag MedOCR?

PDF & Image OCR — Convert a single PDF or image to Markdown via WiseDiag cloud API, with high-accuracy text extraction, table recognition, and multi-column l... It is an AI Agent Skill for Claude Code / OpenClaw, with 618 downloads so far.

How do I install WiseDiag MedOCR?

Run "/install wisediag-medocr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is WiseDiag MedOCR free?

Yes, WiseDiag MedOCR is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does WiseDiag MedOCR support?

WiseDiag MedOCR is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created WiseDiag MedOCR?

It is built and maintained by wisediag (@wisediag); the current version is v1.0.26.

More Skills