← 返回 Skills 市场
1752
总下载
0
收藏
10
当前安装
2
版本数
在 OpenClaw 中安装
/install openocr-skill
功能描述
Extract text from images, documents and scanned PDFs using OpenOCR - a lightweight and efficient OCR system with document parsing model requiring only 0.1B parameters, capable of running recognition on personal PCs. Supports text detection, recognition, universal VLM recognition, and document parsing with layout analysis
安全使用建议
This skill appears to do what it claims (local OCR and document parsing). Before installing/using it: (1) confirm you have a Python environment and the required ML runtimes (ONNX/Torch) or be prepared to install them; (2) expect the skill to auto-download model weights — check the OpenOCR project's GitHub and verify the model download URLs and integrity or supply model files yourself to avoid unexpected network fetches; (3) be aware the agent will need permission to read the image/PDF files you give it and to write cache/model files to disk; and (4) if you need higher assurance, ask the publisher for an explicit install spec and the exact model-hosting URLs (or use only local model paths).
功能分析
Type: OpenClaw Skill
Name: openocr-skill
Version: 0.1.6
The skill bundle is a wrapper for the OpenOCR library, designed for text extraction and document parsing. It explicitly declares the need for `code_execution` and `file_operations` tools, which are used transparently for tasks like processing images/PDFs and saving output files. The `SKILL.md` documentation provides functional instructions and examples, without any evidence of prompt injection attempts, data exfiltration, malicious execution, persistence mechanisms, or obfuscation. While the underlying library's Gradio demos can bind to `0.0.0.0`, this is a documented feature for a demo interface, not indicative of malicious intent within the skill bundle itself.
能力评估
Purpose & Capability
The name/description (OpenOCR OCR, document parsing, VLM recognition) matches the instructions: code examples show initializing OpenOCR for detection/recognition/doc parsing and working with image/pdf paths. There are no unrelated credentials, binaries, or config paths requested.
Instruction Scope
SKILL.md contains only OCR-related instructions and Python examples. It repeatedly relies on passing image/PDF file paths and on auto-downloading models; it does not instruct accessing unrelated system files or secrets. However, the instructions assume ability to run Python code, read/write files, and fetch model binaries from the network.
Install Mechanism
This is an instruction-only skill with no install spec. The skill examples expect a Python package ('openocr') and optional ONNX/Torch backends and include 'auto_download' of models. Because no install/source-of-models is declared, model and dependency downloads would be performed dynamically by the runtime — verify where those downloads come from and that you trust the source.
Credentials
The skill requests no environment variables, credentials, or config paths. It only needs file access to user-provided images/PDFs and network access to fetch models (if auto_download is used), which is proportional to OCR functionality.
Persistence & Privilege
The skill does not ask to be always-enabled, does not declare persistence or modify other skills, and requires no long-lived credentials. It does instruct writing/reading model and cache files locally, which is typical for model-based tools.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install openocr-skill - 安装完成后,直接呼叫该 Skill 的名称或使用
/openocr-skill触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.6
No file changes detected in this version.
- Version and metadata remain unchanged.
- No updates or modifications to functionality or documentation.
v0.1.5
- Improved SKILL.md documentation with detailed configuration, usage examples, and command-line instructions.
- Expanded task descriptions and parameter options for all supported OpenOCR task types.
- Added ready-to-use code snippets for text detection, recognition, end-to-end OCR, universal recognition, and document parsing.
- Included new instructions for saving outputs in multiple formats (Markdown, JSON, visualization).
- Provided clear guidance for both image and PDF input handling.
元数据
常见问题
openocr-skill 是什么?
Extract text from images, documents and scanned PDFs using OpenOCR - a lightweight and efficient OCR system with document parsing model requiring only 0.1B parameters, capable of running recognition on personal PCs. Supports text detection, recognition, universal VLM recognition, and document parsing with layout analysis. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 1752 次。
如何安装 openocr-skill?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install openocr-skill」即可一键安装,无需额外配置。
openocr-skill 是免费的吗?
是的,openocr-skill 完全免费(开源免费),可自由下载、安装和使用。
openocr-skill 支持哪些平台?
openocr-skill 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 openocr-skill?
由 topduke(@topdu)开发并维护,当前版本 v0.1.6。
推荐 Skills