← 返回 Skills 市场

Vision Recognition Ocr

Name: Vision Recognition Ocr
Author: wangziiiiii

作者 wangziiiiii · GitHub ↗ · v1.0.1 · MIT-0

cross-platform ⚠ suspicious

943

总下载

当前安装

版本数

在 OpenClaw 中安装

/install vision-recognition-ocr

功能描述

Vehicle/animal/plant recognition plus OCR for screenshots, photos, invoices, and tables. Use when users ask 识别车型/看图识别/提取文字/OCR. Supports local path, URL, and...

安全使用建议

This package will send whatever image you provide to Baidu's cloud OCR/classify endpoints and requires Baidu credentials. Before installing: (1) be aware the registry metadata omitted the required env vars — supply BAIDU_BCE_BEARER_TOKEN or API Key+Secret as documented in SKILL.md; (2) do not send sensitive images (personal documents, IDs, private photos) unless you trust Baidu and your account; (3) consider creating a limited/monitored Baidu account and API keys for this skill and rotate keys if needed; (4) if you want to be extra cautious, review the included scripts locally (they are short and readable) and run them in an isolated environment; (5) if the missing metadata concerns you, contact the skill publisher or avoid installing until the metadata matches the implementation.

功能分析

Type: OpenClaw Skill Name: vision-recognition-ocr Version: 1.0.1 The skill bundle is a legitimate integration for Baidu's Vision and OCR APIs, providing tools for image classification (animals, cars, plants) and text extraction. The scripts (e.g., `_baidu_image_classify.py`, `ocr_general_basic.py`) correctly handle authentication via environment variables and interact with official Baidu endpoints (aip.baidubce.com). No evidence of malicious behavior, data exfiltration, or prompt injection was found.

能力评估

ℹ Purpose & Capability

Name/description (vision recognition + OCR) match the code and SKILL.md: the scripts call Baidu image-classify and OCR endpoints and accept local path/URL/base64 images. However the registry metadata lists no required environment variables or credentials while the implementation clearly expects Baidu API credentials — an incoherence between metadata and actual capability.

✓ Instruction Scope

SKILL.md instructions and the Python scripts are scoped to classification and OCR tasks. They accept image_path/url/base64 and build requests to Baidu APIs; they do not attempt to read unrelated system files or call unexpected external endpoints beyond Baidu.

✓ Install Mechanism

No external install or remote downloads are used; the package contains local Python scripts. No extract-from-URL or third-party install steps are present. Scripts use the requests library (runtime dependency), which is normal.

⚠ Credentials

The code and SKILL.md require Baidu credentials (BAIDU_BCE_BEARER_TOKEN / BAIDU_API_KEY / BAIDU_VISION_API_KEY + secrets). Those credentials are proportionate to the declared purpose (accessing Baidu APIs), but the skill registry metadata incorrectly lists no required env vars or primary credential — this mismatch is a practical risk (you might install without realizing you must supply secrets). Also BAIDU_API_KEY is used in multiple fallback roles which could be confusing and lead to accidental credential exposure.

✓ Persistence & Privilege

Skill is not always-enabled; it does not request elevated system privileges and does not modify other skills or global agent settings. Autonomous invocation is allowed (platform default) but is not by itself a new risk here.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install vision-recognition-ocr
安装完成后，直接呼叫该 Skill 的名称或使用 /vision-recognition-ocr 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.1

Sync latest local fixes and docs

v1.0.0

Launch public skill with clearer landing copy

元数据

Slug vision-recognition-ocr

版本 1.0.1

许可证 MIT-0

累计安装 9

当前安装数 8

历史版本数 2

常见问题

Vision Recognition Ocr 是什么？

Vehicle/animal/plant recognition plus OCR for screenshots, photos, invoices, and tables. Use when users ask 识别车型/看图识别/提取文字/OCR. Supports local path, URL, and... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 943 次。

如何安装 Vision Recognition Ocr？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install vision-recognition-ocr」即可一键安装，无需额外配置。

Vision Recognition Ocr 是免费的吗？

是的，Vision Recognition Ocr 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Vision Recognition Ocr 支持哪些平台？

Vision Recognition Ocr 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Vision Recognition Ocr？

由 wangziiiiii（@wangziiiiii）开发并维护，当前版本 v1.0.1。