← 返回 Skills 市场
wangziiiiii

Vision Recognition Ocr

作者 wangziiiiii · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ⚠ suspicious
943
总下载
1
收藏
8
当前安装
2
版本数
在 OpenClaw 中安装
/install vision-recognition-ocr
功能描述
Vehicle/animal/plant recognition plus OCR for screenshots, photos, invoices, and tables. Use when users ask 识别车型/看图识别/提取文字/OCR. Supports local path, URL, and...
安全使用建议
This package will send whatever image you provide to Baidu's cloud OCR/classify endpoints and requires Baidu credentials. Before installing: (1) be aware the registry metadata omitted the required env vars — supply BAIDU_BCE_BEARER_TOKEN or API Key+Secret as documented in SKILL.md; (2) do not send sensitive images (personal documents, IDs, private photos) unless you trust Baidu and your account; (3) consider creating a limited/monitored Baidu account and API keys for this skill and rotate keys if needed; (4) if you want to be extra cautious, review the included scripts locally (they are short and readable) and run them in an isolated environment; (5) if the missing metadata concerns you, contact the skill publisher or avoid installing until the metadata matches the implementation.
功能分析
Type: OpenClaw Skill Name: vision-recognition-ocr Version: 1.0.1 The skill bundle is a legitimate integration for Baidu's Vision and OCR APIs, providing tools for image classification (animals, cars, plants) and text extraction. The scripts (e.g., `_baidu_image_classify.py`, `ocr_general_basic.py`) correctly handle authentication via environment variables and interact with official Baidu endpoints (aip.baidubce.com). No evidence of malicious behavior, data exfiltration, or prompt injection was found.
能力评估
Purpose & Capability
Name/description (vision recognition + OCR) match the code and SKILL.md: the scripts call Baidu image-classify and OCR endpoints and accept local path/URL/base64 images. However the registry metadata lists no required environment variables or credentials while the implementation clearly expects Baidu API credentials — an incoherence between metadata and actual capability.
Instruction Scope
SKILL.md instructions and the Python scripts are scoped to classification and OCR tasks. They accept image_path/url/base64 and build requests to Baidu APIs; they do not attempt to read unrelated system files or call unexpected external endpoints beyond Baidu.
Install Mechanism
No external install or remote downloads are used; the package contains local Python scripts. No extract-from-URL or third-party install steps are present. Scripts use the requests library (runtime dependency), which is normal.
Credentials
The code and SKILL.md require Baidu credentials (BAIDU_BCE_BEARER_TOKEN / BAIDU_API_KEY / BAIDU_VISION_API_KEY + secrets). Those credentials are proportionate to the declared purpose (accessing Baidu APIs), but the skill registry metadata incorrectly lists no required env vars or primary credential — this mismatch is a practical risk (you might install without realizing you must supply secrets). Also BAIDU_API_KEY is used in multiple fallback roles which could be confusing and lead to accidental credential exposure.
Persistence & Privilege
Skill is not always-enabled; it does not request elevated system privileges and does not modify other skills or global agent settings. Autonomous invocation is allowed (platform default) but is not by itself a new risk here.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install vision-recognition-ocr
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /vision-recognition-ocr 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
Sync latest local fixes and docs
v1.0.0
Launch public skill with clearer landing copy
元数据
Slug vision-recognition-ocr
版本 1.0.1
许可证 MIT-0
累计安装 9
当前安装数 8
历史版本数 2
常见问题

Vision Recognition Ocr 是什么?

Vehicle/animal/plant recognition plus OCR for screenshots, photos, invoices, and tables. Use when users ask 识别车型/看图识别/提取文字/OCR. Supports local path, URL, and... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 943 次。

如何安装 Vision Recognition Ocr?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install vision-recognition-ocr」即可一键安装,无需额外配置。

Vision Recognition Ocr 是免费的吗?

是的,Vision Recognition Ocr 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Vision Recognition Ocr 支持哪些平台?

Vision Recognition Ocr 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Vision Recognition Ocr?

由 wangziiiiii(@wangziiiiii)开发并维护,当前版本 v1.0.1。

💬 留言讨论