← 返回 Skills 市场
384
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install general-ocr-struct
功能描述
Offline OCR extracting and structuring Chinese/English screenshot text into raw or cleaned rows and fields for receipts, tables, and statements.
使用说明 (SKILL.md)
General OCR Struct
Use this skill to separate OCR recognition from downstream content整理.
Workflow
- Run the local OCR script on the image first.
- Return the raw OCR text before making business interpretations when accuracy matters.
- If the image is a transaction-detail screenshot, run structuring mode to group rows into fields.
- Mark uncertain fields explicitly as
待确认; do not guess missing content. - Only after the user confirms recognition quality, use the result for tables, summaries, or documents.
Commands
Raw OCR
python3 scripts/general_ocr.py raw /path/to/image.jpg
Structured transaction extraction
python3 scripts/general_ocr.py transactions /path/to/image.jpg
JSON output
python3 scripts/general_ocr.py transactions /path/to/image.jpg --json
Output rules
- Prefer showing the recognition result first, then the cleaned structure.
- Preserve source wording where possible.
- For uncertain content, use
待确认instead of inferring. - Adapt the structure to the source image type. For statement-like screenshots, common fields are:
card_last4,date,time,currency,merchant,amount.
Notes
- This skill uses RapidOCR locally.
- First install may need Python packages; after setup it runs offline.
- If OCR quality is weak, request a higher-resolution original screenshot before doing deeper整理.
安全使用建议
This skill appears coherent and runs OCR locally, but you should: (1) confirm rapidocr_onnxruntime is installed from a trusted source (pip/official release) because the script will import and execute that package locally; (2) verify your host's RapidOCR runtime does not auto-download models or call the network if you require strictly offline operation; (3) only run the script on images you are comfortable processing (it will read the image file you pass); and (4) test it in a controlled environment before using with sensitive financial or personal documents to validate the heuristics and ensure no unexpected behavior.
功能分析
Type: OpenClaw Skill
Name: general-ocr-struct
Version: 0.1.0
The skill provides local OCR and transaction data structuring using the RapidOCR library. Analysis of scripts/general_ocr.py shows safe path handling via pathlib and purely heuristic-based data processing (regex) without any network activity, credential harvesting, or suspicious execution patterns. The instructions in SKILL.md and references.md are strictly aligned with the stated purpose of image-to-text conversion and do not contain any malicious prompt injection or exfiltration commands.
能力评估
Purpose & Capability
Name/description (offline OCR + structuring) match the included script and SKILL.md. The script runs RapidOCR locally, extracts lines, and heuristically structures transaction-like rows—behavior aligns with the stated purpose.
Instruction Scope
SKILL.md instructs only local usage of the provided Python script on user-supplied image paths, returning OCR text or structured transactions. The instructions do not ask the agent to read unrelated files, environment variables, or send data externally.
Install Mechanism
This is an instruction-only skill (no install spec). The Python script depends on the third‑party package rapidocr_onnxruntime; SKILL.md mentions installing Python packages but the registry entry does not provide an automated install step. This is low risk but requires the host to have a trusted RapidOCR runtime installed.
Credentials
No environment variables, credentials, or config paths are requested. The script only reads the image path provided by the caller. No disproportionate access is requested.
Persistence & Privilege
The skill is not always-enabled and does not modify other skills or system settings. It operates only when invoked and has no elevated persistence requirements.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install general-ocr-struct - 安装完成后,直接呼叫该 Skill 的名称或使用
/general-ocr-struct触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.0
Initial release: local RapidOCR-based OCR with raw extraction and transaction-style structuring workflow.
元数据
常见问题
General OCR Struct 是什么?
Offline OCR extracting and structuring Chinese/English screenshot text into raw or cleaned rows and fields for receipts, tables, and statements. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 384 次。
如何安装 General OCR Struct?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install general-ocr-struct」即可一键安装,无需额外配置。
General OCR Struct 是免费的吗?
是的,General OCR Struct 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
General OCR Struct 支持哪些平台?
General OCR Struct 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 General OCR Struct?
由 JY(@9penny)开发并维护,当前版本 v0.1.0。
推荐 Skills