← 返回 Skills 市场
zt1314p-design

TencentCloud OCR

作者 tencent-ocr · GitHub ↗ · v1.0.3 · MIT-0
cross-platform ⚠ suspicious
944
总下载
0
收藏
5
当前安装
2
版本数
在 OpenClaw 中安装
/install tencentcloud-ocr
功能描述
腾讯云通用文字识别(高精度版)(GeneralAccurateOCR) 技能包。当用户发送/粘贴图片、提供图片URL、或要求识别图片中的文字时,应自动调用此技能。支持图像整体文字的检测和识别,支持中文、英文、中英文、数字和特殊字符号的识别,并返回文字框位置和文字内容。适用于文字较多、版式复杂、对识别准召率要求较高...
安全使用建议
This skill appears to implement Tencent Cloud OCR and will send images (or uploaded files) to Tencent's OCR endpoint using credentials you must provide. Before installing: - Verify the publisher/source (no homepage is provided and registry metadata omitted required env vars). If you don't trust the owner, do not provide your Tencent credentials. - Understand it will automatically trigger on messages containing images or image URLs (per SKILL.md). If you want manual control, disable autonomous invocation or avoid granting broad triggers. - The resume parsing guidance enforces verbatim output of recognized text (including PII). Only run this on content you are permitted to send to an external service; consider using redaction or local-only processing for sensitive images. - Provide least-privilege credentials (use a dedicated Tencent account or temporary credentials) and monitor billing and API usage if you enable the skill. - If you need higher assurance, inspect/execute scripts/main.py in a sandbox or request a signed, documented release from a verified source. The main concrete problem here is the metadata mismatch about required env vars — that should be corrected or explained by the publisher before trusting the skill.
功能分析
Type: OpenClaw Skill Name: tencentcloud-ocr Version: 1.0.3 The skill bundle provides a legitimate integration with Tencent Cloud's General Accurate OCR API. The Python script (scripts/main.py) uses the official Tencent Cloud SDK to process images or PDFs, and the documentation (SKILL.md and references/resume-parsing.md) provides clear, task-aligned instructions for the AI agent to perform text extraction and resume parsing without any signs of malicious intent, data exfiltration, or prompt injection attacks.
能力评估
Purpose & Capability
The skill's declared purpose (TencentCloud GeneralAccurateOCR) matches the included code (scripts/main.py) which calls tencentcloud OCR APIs — requesting Tencent Cloud API keys is appropriate. HOWEVER the registry metadata states 'Required env vars: none' and 'Primary credential: none' while SKILL.md and the script explicitly require TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY. That metadata omission is an incoherence and could mislead users about what secrets the skill needs.
Instruction Scope
SKILL.md instructs automatic invocation whenever images or image URLs (and certain host patterns) appear, and to run scripts/main.py with either image URL or Base64/file path. The script will read local files (if a filepath is passed) and will send image data or URLs to Tencent's OCR endpoint. The resume-parsing reference also instructs the model to preserve original text verbatim (which may include sensitive PII). These behaviors are consistent with OCR functionality but broaden the risk surface: automatic triggering and verbatim extraction of potentially sensitive personal data are privacy risks and should be explicitly accepted by the user.
Install Mechanism
No install spec is provided (instruction-only with a bundled Python script). There is no external download or archive extraction. The script depends on the public pip package tencentcloud-sdk-python, which is reasonable and transparent.
Credentials
The code and SKILL.md legitimately require TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY to call Tencent Cloud APIs. That credential request is proportionate to the stated purpose. The concern is that the registry metadata omitted these required env vars (it listed none), which is inconsistent and may cause users to underestimate the privilege they are granting. No other unrelated secrets are requested.
Persistence & Privilege
The skill does not request always:true and does not request persistent or elevated platform privileges. It is user-invocable and may be invoked autonomously by the agent (default), which is normal for skills; combine this with the automatic trigger rules in SKILL.md to understand the practical behavior.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install tencentcloud-ocr
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /tencentcloud-ocr 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.3
- Enhanced instructions for automatic trigger: now requires automatic OCR invocation when users send/paste images or provide image URLs, not just upon explicit requests.
v1.0.2
修改displayname
元数据
Slug tencentcloud-ocr
版本 1.0.3
许可证 MIT-0
累计安装 5
当前安装数 5
历史版本数 2
常见问题

TencentCloud OCR 是什么?

腾讯云通用文字识别(高精度版)(GeneralAccurateOCR) 技能包。当用户发送/粘贴图片、提供图片URL、或要求识别图片中的文字时,应自动调用此技能。支持图像整体文字的检测和识别,支持中文、英文、中英文、数字和特殊字符号的识别,并返回文字框位置和文字内容。适用于文字较多、版式复杂、对识别准召率要求较高... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 944 次。

如何安装 TencentCloud OCR?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install tencentcloud-ocr」即可一键安装,无需额外配置。

TencentCloud OCR 是免费的吗?

是的,TencentCloud OCR 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

TencentCloud OCR 支持哪些平台?

TencentCloud OCR 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 TencentCloud OCR?

由 tencent-ocr(@zt1314p-design)开发并维护,当前版本 v1.0.3。

💬 留言讨论