← 返回 Skills 市场
wisediag

WiseDiag MedOCR

作者 wisediag · GitHub ↗ · v1.0.26 · MIT-0
cross-platform ✓ 安全检测通过
618
总下载
1
收藏
0
当前安装
27
版本数
在 OpenClaw 中安装
/install wisediag-medocr
功能描述
PDF & Image OCR — Convert a single PDF or image to Markdown via WiseDiag cloud API, with high-accuracy text extraction, table recognition, and multi-column l...
安全使用建议
This skill appears to do what it says: it uploads one file to WiseDiag's cloud OCR API and saves the returned Markdown locally. Before installing or using it, consider: 1) Do NOT upload sensitive documents — the SKILL.md explicitly warns against IDs, financial, medical, biometric, credentials, or minor data. 2) The script requires WISEDIAG_API_KEY; the package metadata omitted this, so be sure to set the environment variable rather than assume none are needed. 3) The script calls https://openapi.wisediag.com; verify that domain and WiseDiag's privacy/data-retention policy meet your requirements. 4) If you must process sensitive files, use an offline OCR alternative. 5) As a best practice, set the API key in a secure location (avoid committing it to dotfiles or repos). Finally, if you want extra assurance, review network traffic or run the script in an isolated environment (VM/container) while testing with non-sensitive files.
功能分析
Type: OpenClaw Skill Name: wisediag-medocr Version: 1.0.26 The skill provides OCR functionality by uploading PDF or image files to the WiseDiag cloud API (openapi.wisediag.com) for conversion to Markdown. The implementation in scripts/wiseocr.py is transparent, using standard libraries like requests and pypdf, and includes clear privacy warnings in SKILL.md and README.md regarding the handling of sensitive data.
能力评估
Purpose & Capability
The skill name/description (WiseDiag OCR to Markdown) aligns with its behavior: the script uploads a single PDF/image to WiseDiag's OCR endpoint, receives a JSON result, and writes Markdown to ~/.openclaw/workspace/WiseOCR. Note: the top-level registry metadata in the package summary listed no required env vars, but the SKILL.md and the script both require WISEDIAG_API_KEY — this metadata omission is an inconsistency to be aware of.
Instruction Scope
SKILL.md instructs the user to set WISEDIAG_API_KEY and run the provided Python script. The script only reads the input file, optionally inspects PDF page count, uploads the file to https://openapi.wisediag.com/v1/ocr/pdf, and saves returned markdown locally. It does not access other system files, other credentials, or unusual endpoints. The privacy warning in SKILL.md accurately reflects the upload behavior.
Install Mechanism
There is no custom install step; the project includes a requirements.txt (requests, pypdf) and recommends pip install -r requirements.txt. No remote arbitrary installers, URL downloads, or packaged binaries are present. The code file is plain Python and straightforward to inspect.
Credentials
The only runtime secret required is WISEDIAG_API_KEY, which is reasonable and proportional for a cloud OCR service. However, the registry-level 'Requirements' field (provided earlier in the package metadata) did not list this env var while SKILL.md and the script both require it — this mismatch could confuse users or automated tooling.
Persistence & Privilege
The skill runs on demand (always:false) and is user-invocable. It writes output only to a subdirectory under the user's home (~/.openclaw/workspace/WiseOCR). It does not modify other skills, system-wide settings, or request permanent elevated privileges.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install wisediag-medocr
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /wisediag-medocr 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.26
- Skill renamed from wisediag-medocr to wiseocr. - Expanded and clarified privacy and sensitive data warnings, including a detailed table and lists of prohibited data types. - Updated API key acquisition link for clarity. - Description updated for improved accuracy and readability; no functional changes to usage or features.
v1.0.25
- Updated SKILL.md metadata format: moved env_vars block outside credentials for consistency. - No functional or feature changes in this release.
v1.0.24
- No functional changes in this version. - Internal structure of SKILL.md updated: the "credentials" block was moved outside the "registry" section. - All guides, features, and usage instructions remain unchanged.
v1.0.23
- No changes detected in this version.
v1.0.22
- Added support for image OCR: now accepts single image files (jpg, jpeg, png, webp, gif, bmp, tiff) in addition to PDFs. - Updated usage instructions to reflect support for both PDF and image input. - Revised description for broader file compatibility (PDF & Image OCR). - Clarified input argument details to specify that only a single file is supported.
v1.0.21
- No changes detected in this version. - All features, documentation, and privacy guidance remain the same.
v1.0.20
Version 1.0.20 of wisediag-medocr - No file changes detected; documentation and functionality remain unchanged. - Privacy warnings, usage instructions, and API details are consistent with the previous release.
v1.0.19
- Skill renamed from "wiseocr" to "wisediag-medocr" - Shortened and clarified the description for general usage and feature highlights - Updated API key link to a new URL format - Added instruction to never call the API or HTTP endpoints directly; use only the provided script - Removed specific API endpoint details from documentation for better abstraction and safety - General wording and formatting improvements for clarity
v1.0.18
- Renamed the skill from wisediag-medocr to wiseocr. - Updated internal references to use the new skill name. - No changes to core functionality or arguments.
v1.0.17
- No changes detected in this version; documentation and functionality remain the same.
v1.0.16
- Updated the skill description to clarify credential requirements and cloud upload behavior. - Documentation now specifies the API endpoint used for OCR processing. - Instructions emphasize using the provided script exclusively for authentication and file handling. - Improved clarity around data privacy and file handling processes.
v1.0.15
Version 1.0.15 of wisediag-medocr - No file changes detected in this release. - Functionality and documentation remain unchanged from the previous version.
v1.0.14
- Shortened and clarified the skill description for easier understanding. - Added explicit usage instructions: "Upload a PDF and say Use WiseOCR to OCR this." - Provided a direct API key registration link with improved formatting. - Updated guidance to specifically instruct users never to call APIs or HTTP endpoints directly—scripts must be used. - Removed extra tracking parameters from URLs for cleaner presentation.
v1.0.13
No changes detected in this version. - No file changes were found between versions. - Functionality, documentation, and configuration remain the same.
v1.0.12
- Updated the API key registration link to include UTM tracking parameters for improved referral tracking. - No other changes detected.
v1.0.11
- Updated documentation in SKILL.md and README.md for clarity, including clearer instructions and privacy warnings. - Service endpoint information and documentation link added to SKILL.md. - No changes to core functionality or API usage.
v1.0.10
No changes detected in this version. - No file modifications, updates, or new features were introduced. - All documentation and configuration remain the same as in the previous release.
v1.0.9
## v1.0.9 — Security & Metadata Hardening ### 🔐 Credential Declaration (IMPORTANT) This skill requires **one** environment variable to function: - `WISEDIAG_API_KEY` — Required. Used to authenticate PDF uploads to the WiseDiag OCR API. No other credentials, tokens, or secrets are needed or accessed. The `registry.credentials` block in SKILL.md explicitly declares: - `required: true` - `env_vars: [WISEDIAG_API_KEY]` If the platform registry metadata does not reflect this requirement, it is a sync/publishing issue — the source of truth is SKILL.md. ### 🛡️ Security Improvements - **Removed `--service-url` CLI parameter:** The API endpoint is now hardcoded to `https://openapi.wisediag.com`. The script can no longer be redirected to arbitrary endpoints, eliminating the risk of unintended data exfiltration. - **Added prominent Privacy Warning:** A ⚠️ Privacy Warning section is now the first thing users see in SKILL.md, clearly stating that files are uploaded to WiseDiag's cloud servers. - **Explicit data flow documentation:** The Data Privacy section now lists exactly what happens to uploaded files in a numbered sequence. ### 📋 What This Skill Does (and Does NOT Do) ✅ Reads WISEDIAG_API_KEY from environment (single credential) ✅ Uploads user-provided PDF to https://openapi.wisediag.com/v1/ocr/pdf ✅ Saves returned Markdown to ~/.openclaw/workspace/WiseOCR/ ✅ Only depends on `requests` (via requirements.txt) ❌ Does NOT read any other environment variables or credentials ❌ Does NOT access files outside the user-provided input PDF ❌ Does NOT allow overriding the API endpoint (hardcoded) ❌ Does NOT request elevated privileges or persistent access ❌ Does NOT modify other skills or system configurations
v1.0.8
- Updated API key setup instructions to use a new, shortened link for obtaining your key. - Strengthened warnings to never call the OCR API directly; emphasized usage of the provided script only. - No changes to code or functionality. Documentation update only.
v1.0.7
- Updated API key setup instructions to use the direct URL (removed redirect link). - Clarified usage instructions: explicitly state that files must be processed with the provided script, not direct API calls. - No functional or code changes in this release.
元数据
Slug wisediag-medocr
版本 1.0.26
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 27
常见问题

WiseDiag MedOCR 是什么?

PDF & Image OCR — Convert a single PDF or image to Markdown via WiseDiag cloud API, with high-accuracy text extraction, table recognition, and multi-column l... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 618 次。

如何安装 WiseDiag MedOCR?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install wisediag-medocr」即可一键安装,无需额外配置。

WiseDiag MedOCR 是免费的吗?

是的,WiseDiag MedOCR 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

WiseDiag MedOCR 支持哪些平台?

WiseDiag MedOCR 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 WiseDiag MedOCR?

由 wisediag(@wisediag)开发并维护,当前版本 v1.0.26。

💬 留言讨论