TencentCloud OCR

Name: TencentCloud OCR
Author: zt1314p-design

功能描述

腾讯云通用文字识别（高精度版）(GeneralAccurateOCR) 技能包。当用户发送/粘贴图片、提供图片URL、或要求识别图片中的文字时，应自动调用此技能。支持图像整体文字的检测和识别，支持中文、英文、中英文、数字和特殊字符号的识别，并返回文字框位置和文字内容。适用于文字较多、版式复杂、对识别准召率要求较高...

安全使用建议

This skill appears to implement Tencent Cloud OCR and will send images (or uploaded files) to Tencent's OCR endpoint using credentials you must provide. Before installing: - Verify the publisher/source (no homepage is provided and registry metadata omitted required env vars). If you don't trust the owner, do not provide your Tencent credentials. - Understand it will automatically trigger on messages containing images or image URLs (per SKILL.md). If you want manual control, disable autonomous invocation or avoid granting broad triggers. - The resume parsing guidance enforces verbatim output of recognized text (including PII). Only run this on content you are permitted to send to an external service; consider using redaction or local-only processing for sensitive images. - Provide least-privilege credentials (use a dedicated Tencent account or temporary credentials) and monitor billing and API usage if you enable the skill. - If you need higher assurance, inspect/execute scripts/main.py in a sandbox or request a signed, documented release from a verified source. The main concrete problem here is the metadata mismatch about required env vars — that should be corrected or explained by the publisher before trusting the skill.

功能分析

Type: OpenClaw Skill Name: tencentcloud-ocr Version: 1.0.3 The skill bundle provides a legitimate integration with Tencent Cloud's General Accurate OCR API. The Python script (scripts/main.py) uses the official Tencent Cloud SDK to process images or PDFs, and the documentation (SKILL.md and references/resume-parsing.md) provides clear, task-aligned instructions for the AI agent to perform text extraction and resume parsing without any signs of malicious intent, data exfiltration, or prompt injection attacks.

能力评估

⚠ Purpose & Capability

The skill's declared purpose (TencentCloud GeneralAccurateOCR) matches the included code (scripts/main.py) which calls tencentcloud OCR APIs — requesting Tencent Cloud API keys is appropriate. HOWEVER the registry metadata states 'Required env vars: none' and 'Primary credential: none' while SKILL.md and the script explicitly require TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY. That metadata omission is an incoherence and could mislead users about what secrets the skill needs.

⚠ Instruction Scope

SKILL.md instructs automatic invocation whenever images or image URLs (and certain host patterns) appear, and to run scripts/main.py with either image URL or Base64/file path. The script will read local files (if a filepath is passed) and will send image data or URLs to Tencent's OCR endpoint. The resume-parsing reference also instructs the model to preserve original text verbatim (which may include sensitive PII). These behaviors are consistent with OCR functionality but broaden the risk surface: automatic triggering and verbatim extraction of potentially sensitive personal data are privacy risks and should be explicitly accepted by the user.

✓ Install Mechanism

No install spec is provided (instruction-only with a bundled Python script). There is no external download or archive extraction. The script depends on the public pip package tencentcloud-sdk-python, which is reasonable and transparent.

⚠ Credentials

The code and SKILL.md legitimately require TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY to call Tencent Cloud APIs. That credential request is proportionate to the stated purpose. The concern is that the registry metadata omitted these required env vars (it listed none), which is inconsistent and may cause users to underestimate the privilege they are granting. No other unrelated secrets are requested.

✓ Persistence & Privilege

The skill does not request always:true and does not request persistent or elevated platform privileges. It is user-invocable and may be invoked autonomously by the agent (default), which is normal for skills; combine this with the automatic trigger rules in SKILL.md to understand the practical behavior.

版本历史

v1.0.3

- Enhanced instructions for automatic trigger: now requires automatic OCR invocation when users send/paste images or provide image URLs, not just upon explicit requests.

v1.0.2

修改displayname

元数据

Slug tencentcloud-ocr

版本 1.0.3

许可证 MIT-0

累计安装 5

当前安装数 5

历史版本数 2

常见问题

TencentCloud OCR 是什么？

腾讯云通用文字识别（高精度版）(GeneralAccurateOCR) 技能包。当用户发送/粘贴图片、提供图片URL、或要求识别图片中的文字时，应自动调用此技能。支持图像整体文字的检测和识别，支持中文、英文、中英文、数字和特殊字符号的识别，并返回文字框位置和文字内容。适用于文字较多、版式复杂、对识别准召率要求较高... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 944 次。

如何安装 TencentCloud OCR？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install tencentcloud-ocr」即可一键安装，无需额外配置。

TencentCloud OCR 是免费的吗？

是的，TencentCloud OCR 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

TencentCloud OCR 支持哪些平台？

TencentCloud OCR 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 TencentCloud OCR？

由 tencent-ocr（@zt1314p-design）开发并维护，当前版本 v1.0.3。

TencentCloud OCR 是什么？

如何安装 TencentCloud OCR？

TencentCloud OCR 是免费的吗？

TencentCloud OCR 支持哪些平台？

谁开发了 TencentCloud OCR？

💬 留言讨论