← Back to Skills Marketplace
wisediag

WiseDiag MedOCR

by wisediag · GitHub ↗ · v1.0.26 · MIT-0
cross-platform ✓ Security Clean
618
Downloads
1
Stars
0
Active Installs
27
Versions
Install in OpenClaw
/install wisediag-medocr
Description
PDF & Image OCR — Convert a single PDF or image to Markdown via WiseDiag cloud API, with high-accuracy text extraction, table recognition, and multi-column l...
Usage Guidance
This skill appears to do what it says: it uploads one file to WiseDiag's cloud OCR API and saves the returned Markdown locally. Before installing or using it, consider: 1) Do NOT upload sensitive documents — the SKILL.md explicitly warns against IDs, financial, medical, biometric, credentials, or minor data. 2) The script requires WISEDIAG_API_KEY; the package metadata omitted this, so be sure to set the environment variable rather than assume none are needed. 3) The script calls https://openapi.wisediag.com; verify that domain and WiseDiag's privacy/data-retention policy meet your requirements. 4) If you must process sensitive files, use an offline OCR alternative. 5) As a best practice, set the API key in a secure location (avoid committing it to dotfiles or repos). Finally, if you want extra assurance, review network traffic or run the script in an isolated environment (VM/container) while testing with non-sensitive files.
Capability Analysis
Type: OpenClaw Skill Name: wisediag-medocr Version: 1.0.26 The skill provides OCR functionality by uploading PDF or image files to the WiseDiag cloud API (openapi.wisediag.com) for conversion to Markdown. The implementation in scripts/wiseocr.py is transparent, using standard libraries like requests and pypdf, and includes clear privacy warnings in SKILL.md and README.md regarding the handling of sensitive data.
Capability Assessment
Purpose & Capability
The skill name/description (WiseDiag OCR to Markdown) aligns with its behavior: the script uploads a single PDF/image to WiseDiag's OCR endpoint, receives a JSON result, and writes Markdown to ~/.openclaw/workspace/WiseOCR. Note: the top-level registry metadata in the package summary listed no required env vars, but the SKILL.md and the script both require WISEDIAG_API_KEY — this metadata omission is an inconsistency to be aware of.
Instruction Scope
SKILL.md instructs the user to set WISEDIAG_API_KEY and run the provided Python script. The script only reads the input file, optionally inspects PDF page count, uploads the file to https://openapi.wisediag.com/v1/ocr/pdf, and saves returned markdown locally. It does not access other system files, other credentials, or unusual endpoints. The privacy warning in SKILL.md accurately reflects the upload behavior.
Install Mechanism
There is no custom install step; the project includes a requirements.txt (requests, pypdf) and recommends pip install -r requirements.txt. No remote arbitrary installers, URL downloads, or packaged binaries are present. The code file is plain Python and straightforward to inspect.
Credentials
The only runtime secret required is WISEDIAG_API_KEY, which is reasonable and proportional for a cloud OCR service. However, the registry-level 'Requirements' field (provided earlier in the package metadata) did not list this env var while SKILL.md and the script both require it — this mismatch could confuse users or automated tooling.
Persistence & Privilege
The skill runs on demand (always:false) and is user-invocable. It writes output only to a subdirectory under the user's home (~/.openclaw/workspace/WiseOCR). It does not modify other skills, system-wide settings, or request permanent elevated privileges.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install wisediag-medocr
  3. After installation, invoke the skill by name or use /wisediag-medocr
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.26
- Skill renamed from wisediag-medocr to wiseocr. - Expanded and clarified privacy and sensitive data warnings, including a detailed table and lists of prohibited data types. - Updated API key acquisition link for clarity. - Description updated for improved accuracy and readability; no functional changes to usage or features.
v1.0.25
- Updated SKILL.md metadata format: moved env_vars block outside credentials for consistency. - No functional or feature changes in this release.
v1.0.24
- No functional changes in this version. - Internal structure of SKILL.md updated: the "credentials" block was moved outside the "registry" section. - All guides, features, and usage instructions remain unchanged.
v1.0.23
- No changes detected in this version.
v1.0.22
- Added support for image OCR: now accepts single image files (jpg, jpeg, png, webp, gif, bmp, tiff) in addition to PDFs. - Updated usage instructions to reflect support for both PDF and image input. - Revised description for broader file compatibility (PDF & Image OCR). - Clarified input argument details to specify that only a single file is supported.
v1.0.21
- No changes detected in this version. - All features, documentation, and privacy guidance remain the same.
v1.0.20
Version 1.0.20 of wisediag-medocr - No file changes detected; documentation and functionality remain unchanged. - Privacy warnings, usage instructions, and API details are consistent with the previous release.
v1.0.19
- Skill renamed from "wiseocr" to "wisediag-medocr" - Shortened and clarified the description for general usage and feature highlights - Updated API key link to a new URL format - Added instruction to never call the API or HTTP endpoints directly; use only the provided script - Removed specific API endpoint details from documentation for better abstraction and safety - General wording and formatting improvements for clarity
v1.0.18
- Renamed the skill from wisediag-medocr to wiseocr. - Updated internal references to use the new skill name. - No changes to core functionality or arguments.
v1.0.17
- No changes detected in this version; documentation and functionality remain the same.
v1.0.16
- Updated the skill description to clarify credential requirements and cloud upload behavior. - Documentation now specifies the API endpoint used for OCR processing. - Instructions emphasize using the provided script exclusively for authentication and file handling. - Improved clarity around data privacy and file handling processes.
v1.0.15
Version 1.0.15 of wisediag-medocr - No file changes detected in this release. - Functionality and documentation remain unchanged from the previous version.
v1.0.14
- Shortened and clarified the skill description for easier understanding. - Added explicit usage instructions: "Upload a PDF and say Use WiseOCR to OCR this." - Provided a direct API key registration link with improved formatting. - Updated guidance to specifically instruct users never to call APIs or HTTP endpoints directly—scripts must be used. - Removed extra tracking parameters from URLs for cleaner presentation.
v1.0.13
No changes detected in this version. - No file changes were found between versions. - Functionality, documentation, and configuration remain the same.
v1.0.12
- Updated the API key registration link to include UTM tracking parameters for improved referral tracking. - No other changes detected.
v1.0.11
- Updated documentation in SKILL.md and README.md for clarity, including clearer instructions and privacy warnings. - Service endpoint information and documentation link added to SKILL.md. - No changes to core functionality or API usage.
v1.0.10
No changes detected in this version. - No file modifications, updates, or new features were introduced. - All documentation and configuration remain the same as in the previous release.
v1.0.9
## v1.0.9 — Security & Metadata Hardening ### 🔐 Credential Declaration (IMPORTANT) This skill requires **one** environment variable to function: - `WISEDIAG_API_KEY` — Required. Used to authenticate PDF uploads to the WiseDiag OCR API. No other credentials, tokens, or secrets are needed or accessed. The `registry.credentials` block in SKILL.md explicitly declares: - `required: true` - `env_vars: [WISEDIAG_API_KEY]` If the platform registry metadata does not reflect this requirement, it is a sync/publishing issue — the source of truth is SKILL.md. ### 🛡️ Security Improvements - **Removed `--service-url` CLI parameter:** The API endpoint is now hardcoded to `https://openapi.wisediag.com`. The script can no longer be redirected to arbitrary endpoints, eliminating the risk of unintended data exfiltration. - **Added prominent Privacy Warning:** A ⚠️ Privacy Warning section is now the first thing users see in SKILL.md, clearly stating that files are uploaded to WiseDiag's cloud servers. - **Explicit data flow documentation:** The Data Privacy section now lists exactly what happens to uploaded files in a numbered sequence. ### 📋 What This Skill Does (and Does NOT Do) ✅ Reads WISEDIAG_API_KEY from environment (single credential) ✅ Uploads user-provided PDF to https://openapi.wisediag.com/v1/ocr/pdf ✅ Saves returned Markdown to ~/.openclaw/workspace/WiseOCR/ ✅ Only depends on `requests` (via requirements.txt) ❌ Does NOT read any other environment variables or credentials ❌ Does NOT access files outside the user-provided input PDF ❌ Does NOT allow overriding the API endpoint (hardcoded) ❌ Does NOT request elevated privileges or persistent access ❌ Does NOT modify other skills or system configurations
v1.0.8
- Updated API key setup instructions to use a new, shortened link for obtaining your key. - Strengthened warnings to never call the OCR API directly; emphasized usage of the provided script only. - No changes to code or functionality. Documentation update only.
v1.0.7
- Updated API key setup instructions to use the direct URL (removed redirect link). - Clarified usage instructions: explicitly state that files must be processed with the provided script, not direct API calls. - No functional or code changes in this release.
Metadata
Slug wisediag-medocr
Version 1.0.26
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 27
Frequently Asked Questions

What is WiseDiag MedOCR?

PDF & Image OCR — Convert a single PDF or image to Markdown via WiseDiag cloud API, with high-accuracy text extraction, table recognition, and multi-column l... It is an AI Agent Skill for Claude Code / OpenClaw, with 618 downloads so far.

How do I install WiseDiag MedOCR?

Run "/install wisediag-medocr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is WiseDiag MedOCR free?

Yes, WiseDiag MedOCR is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does WiseDiag MedOCR support?

WiseDiag MedOCR is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created WiseDiag MedOCR?

It is built and maintained by wisediag (@wisediag); the current version is v1.0.26.

💬 Comments