← 返回 Skills 市场

WiseOCR

Name: WiseOCR
Author: wisediag

作者 wisediag · GitHub ↗ · v1.0.28 · MIT-0

cross-platform ✓ 安全检测通过

944

总下载

当前安装

版本数

在 OpenClaw 中安装

/install wiseocr

功能描述

PDF & Image OCR — Convert a single PDF or image to Markdown via WiseDiag cloud API, with high-accuracy text extraction, table recognition, and multi-column l...

安全使用建议

This skill appears to be what it says: a client that uploads a single PDF/image to WiseDiag and saves the returned Markdown locally. Before installing, consider the following: 1) The SKILL.md/README and script require WISEDIAG_API_KEY — set this only if you trust the WiseDiag service; the top-level registry metadata incorrectly omitted this requirement, so expect to provide the env var. 2) Do NOT upload sensitive/confidential documents — the skill uploads files to https://openapi.wisediag.com for processing. 3) Inspect the small script yourself (it's included) if you want to verify there are no extra network endpoints or telemetry; it uses only requests.post to the declared endpoint. 4) If you need offline processing for privacy, use a local OCR tool instead. 5) If you plan automated use, consider limiting the API key permissions on the WiseDiag console (if supported) and rotate keys regularly.

功能分析

Type: OpenClaw Skill Name: wiseocr Version: 1.0.28 The WiseOCR skill is a legitimate tool for converting PDFs and images to Markdown using the WiseDiag cloud API. The core script (scripts/wiseocr.py) performs standard file handling and HTTP requests to the documented endpoint (openapi.wisediag.com) without any signs of obfuscation, persistence, or unauthorized data access. Both SKILL.md and README.md include prominent privacy warnings regarding the cloud-based nature of the service and advise against processing sensitive personal information.

能力评估

✓ Purpose & Capability

Name, description, README, SKILL.md and the script all consistently implement a single-file PDF/image OCR that uploads to WiseDiag and returns Markdown. The code performs file validation, page counting, upload, and writing output to ~/.openclaw/workspace/WiseOCR — all proportional to an OCR client.

✓ Instruction Scope

SKILL.md instructs the user to set WISEDIAG_API_KEY and to run the provided script; the script only reads the provided input file, uses pypdf to count pages (for PDFs), uploads the file to the declared API endpoint, and writes back a markdown file. There are no instructions to read unrelated system files or exfiltrate other secrets. The README and SKILL.md explicitly warn about privacy and uploading sensitive documents.

✓ Install Mechanism

This is instruction-only with a small Python script and requirements.txt; there is no install spec that downloads arbitrary archives or runs code from unknown URLs. The only external dependency is requests and pypdf via pip — expected for a Python CLI.

ℹ Credentials

The script and docs require a single environment variable WISEDIAG_API_KEY (reasonable for a cloud OCR service). However, the registry metadata at the top of the provided summary lists 'Required env vars: none' and 'Primary credential: none', which contradicts SKILL.md/README and the script (which exits if WISEDIAG_API_KEY is unset). This appears to be a packaging/metadata inconsistency and should be corrected but does not change the proportionality of the credential.

✓ Persistence & Privilege

The skill is not marked always:true and does not modify other skills or system-wide agent settings. It writes output to a workspace directory under the user's home (~/.openclaw/workspace/WiseOCR) which is normal for a CLI tool. Autonomous invocation is allowed by default (expected) but that is not combined with broad unexpected privileges.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install wiseocr
安装完成后，直接呼叫该 Skill 的名称或使用 /wiseocr 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.28

Version 1.0.28 of WiseOCR - No file changes detected in this release. - Functionality and documentation remain unchanged.

v1.0.27

- Updated the privacy warning section: Medical & health information is no longer explicitly listed as restricted data. - Documentation now omits references to medical records, diagnoses, prescriptions, and health insurance information in privacy and sensitive information warnings. - All guidance and usage instructions remain unchanged.

v1.0.26

wiseocr 1.0.26 - No code or documentation changes detected in this version. - Functionality and usage instructions remain the same.

v1.0.25

- Updated privacy warning: medical/health information examples removed from sensitive data list. - Clarified sensitive categories in warnings to match latest recommendations. - No functional or API changes; documentation update only.

v1.0.24

wiseocr 1.0.24 - No code or documentation changes detected in this release. - Functionality and privacy warnings remain unchanged.

v1.0.23

- Removed references to "Medical & health information" from the sensitive information warnings. - Updated privacy tables and lists to exclude medical-related examples. - No functional or code changes; documentation update only.

v1.0.22

- Updated API key management link in documentation to use the correct WiseDiag Console address. - No code or functional changes; documentation clarity improvement only.

v1.0.21

- Skill name updated from "wisediag-medocr" to "wiseocr" for consistency. - Description revised and slightly shortened for clarity; core OCR features remain unchanged. - No code changes or feature updates; documentation only. - All usage instructions, privacy warnings, and installation steps remain the same.

v1.0.20

No user-visible changes in this release. - No file changes detected since the previous version.

v1.0.19

- Added a detailed warning section specifying categories of personal sensitive information not to be uploaded (e.g., identity, financial, medical, biometric, credentials, minors' info). - Reinforced privacy guidance to recommend local/offline OCR for documents with any sensitive or regulated data. - Clarified existing privacy instructions with explicit examples and a warning table for better user understanding.

v1.0.18

No user-facing changes in this release. - No file changes or updates detected between versions. - Functionality and documentation remain the same.

v1.0.17

- Added support for image OCR (jpg, jpeg, png, webp, gif, bmp, tiff) alongside PDF. - Clarified usage: now processes a single PDF or image file per request. - Updated instructions for file types and usage examples. - Refined privacy notice and installation/setup instructions. - Minor rewording and structure improvements for clarity.

v1.0.16

No user-facing changes detected in this version. - No file or documentation updates present. - Functionality and usage remain the same as the previous release.

v1.0.15

- Improved privacy documentation: clarified data privacy and file handling steps in SKILL.md. - Updated API key setup instructions and registration link for easier access. - Streamlined usage instructions, emphasizing use of the provided script for processing. - Shortened and clarified the skill description to clearly state cloud upload and required credentials.

v1.0.14

- No changes detected in this version. - Functionality, documentation, and configuration remain unchanged.

v1.0.13

No changes were detected in this version. - No file changes or updates were made for version 1.0.13.

v1.0.12

- Added a prominent privacy warning about file uploads to WiseDiag's cloud servers. - Updated description to highlight support for table recognition, multi-column layouts, and improved text extraction. - Clarified that files should not be used if containing sensitive or confidential information unless the user trusts WiseDiag. - No code changes; documentation and safety guidance improvements only.

v1.0.11

- Clarified that PDF OCR is performed via WiseDiag's cloud API, with files uploaded for processing. - Added a new Data Privacy & Cost section explaining API usage fees and privacy implications. - Advised users to avoid uploading confidential documents unless they trust WiseDiag, and to consider offline alternatives for sensitive files. - Updated description to mention potential API costs and file upload requirement.

v1.0.10

Version 1.0.10 Changelog - Updated name and author details for consistency with upstream project. - Simplified and clarified usage instructions and API key setup. - Modified default output directory to `~/.openclaw/workspace/WiseOCR`. - Streamlined privacy notice and removed duplicate warnings. - Improved documentation layout and reduced repetition.

v1.0.9

- Updated SKILL.md to remove Chinese examples and keywords, using only English filenames and terms for usage instructions and examples. - Generalized usage documentation (e.g., example filenames like "medical_report" instead of "体检报告"). - No changes were made to the code or functionality.

元数据

Slug wiseocr

版本 1.0.28

许可证 MIT-0

累计安装 1

当前安装数 1

历史版本数 28

常见问题

WiseOCR 是什么？

PDF & Image OCR — Convert a single PDF or image to Markdown via WiseDiag cloud API, with high-accuracy text extraction, table recognition, and multi-column l... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 944 次。

如何安装 WiseOCR？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install wiseocr」即可一键安装，无需额外配置。

WiseOCR 是免费的吗？

是的，WiseOCR 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

WiseOCR 支持哪些平台？

WiseOCR 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 WiseOCR？

由 wisediag（@wisediag）开发并维护，当前版本 v1.0.28。