← 返回 Skills 市场

General OCR Struct

Name: General OCR Struct
Author: 9penny

作者 JY · GitHub ↗ · v0.1.0 · MIT-0

cross-platform ✓ 安全检测通过

384

总下载

当前安装

版本数

在 OpenClaw 中安装

/install general-ocr-struct

功能描述

Offline OCR extracting and structuring Chinese/English screenshot text into raw or cleaned rows and fields for receipts, tables, and statements.

使用说明 (SKILL.md)

General OCR Struct

Use this skill to separate OCR recognition from downstream content整理.

Workflow

Run the local OCR script on the image first.
Return the raw OCR text before making business interpretations when accuracy matters.
If the image is a transaction-detail screenshot, run structuring mode to group rows into fields.
Mark uncertain fields explicitly as 待确认; do not guess missing content.
Only after the user confirms recognition quality, use the result for tables, summaries, or documents.

Commands

Raw OCR

python3 scripts/general_ocr.py raw /path/to/image.jpg

Structured transaction extraction

python3 scripts/general_ocr.py transactions /path/to/image.jpg

JSON output

python3 scripts/general_ocr.py transactions /path/to/image.jpg --json

Output rules

Prefer showing the recognition result first, then the cleaned structure.
Preserve source wording where possible.
For uncertain content, use 待确认 instead of inferring.
Adapt the structure to the source image type. For statement-like screenshots, common fields are: card_last4, date, time, currency, merchant, amount.

Notes

This skill uses RapidOCR locally.
First install may need Python packages; after setup it runs offline.
If OCR quality is weak, request a higher-resolution original screenshot before doing deeper整理.

安全使用建议

This skill appears coherent and runs OCR locally, but you should: (1) confirm rapidocr_onnxruntime is installed from a trusted source (pip/official release) because the script will import and execute that package locally; (2) verify your host's RapidOCR runtime does not auto-download models or call the network if you require strictly offline operation; (3) only run the script on images you are comfortable processing (it will read the image file you pass); and (4) test it in a controlled environment before using with sensitive financial or personal documents to validate the heuristics and ensure no unexpected behavior.

功能分析

Type: OpenClaw Skill Name: general-ocr-struct Version: 0.1.0 The skill provides local OCR and transaction data structuring using the RapidOCR library. Analysis of scripts/general_ocr.py shows safe path handling via pathlib and purely heuristic-based data processing (regex) without any network activity, credential harvesting, or suspicious execution patterns. The instructions in SKILL.md and references.md are strictly aligned with the stated purpose of image-to-text conversion and do not contain any malicious prompt injection or exfiltration commands.

能力评估

✓ Purpose & Capability

Name/description (offline OCR + structuring) match the included script and SKILL.md. The script runs RapidOCR locally, extracts lines, and heuristically structures transaction-like rows—behavior aligns with the stated purpose.

✓ Instruction Scope

SKILL.md instructs only local usage of the provided Python script on user-supplied image paths, returning OCR text or structured transactions. The instructions do not ask the agent to read unrelated files, environment variables, or send data externally.

ℹ Install Mechanism

This is an instruction-only skill (no install spec). The Python script depends on the third‑party package rapidocr_onnxruntime; SKILL.md mentions installing Python packages but the registry entry does not provide an automated install step. This is low risk but requires the host to have a trusted RapidOCR runtime installed.

✓ Credentials

No environment variables, credentials, or config paths are requested. The script only reads the image path provided by the caller. No disproportionate access is requested.

✓ Persistence & Privilege

The skill is not always-enabled and does not modify other skills or system settings. It operates only when invoked and has no elevated persistence requirements.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install general-ocr-struct
安装完成后，直接呼叫该 Skill 的名称或使用 /general-ocr-struct 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v0.1.0

Initial release: local RapidOCR-based OCR with raw extraction and transaction-style structuring workflow.

元数据

Slug general-ocr-struct

版本 0.1.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题