← Back to Skills Marketplace
TencentCloud OCR
by
tencent-ocr
· GitHub ↗
· v1.0.3
· MIT-0
944
Downloads
0
Stars
5
Active Installs
2
Versions
Install in OpenClaw
/install tencentcloud-ocr
Description
腾讯云通用文字识别(高精度版)(GeneralAccurateOCR) 技能包。当用户发送/粘贴图片、提供图片URL、或要求识别图片中的文字时,应自动调用此技能。支持图像整体文字的检测和识别,支持中文、英文、中英文、数字和特殊字符号的识别,并返回文字框位置和文字内容。适用于文字较多、版式复杂、对识别准召率要求较高...
Usage Guidance
This skill appears to implement Tencent Cloud OCR and will send images (or uploaded files) to Tencent's OCR endpoint using credentials you must provide. Before installing:
- Verify the publisher/source (no homepage is provided and registry metadata omitted required env vars). If you don't trust the owner, do not provide your Tencent credentials.
- Understand it will automatically trigger on messages containing images or image URLs (per SKILL.md). If you want manual control, disable autonomous invocation or avoid granting broad triggers.
- The resume parsing guidance enforces verbatim output of recognized text (including PII). Only run this on content you are permitted to send to an external service; consider using redaction or local-only processing for sensitive images.
- Provide least-privilege credentials (use a dedicated Tencent account or temporary credentials) and monitor billing and API usage if you enable the skill.
- If you need higher assurance, inspect/execute scripts/main.py in a sandbox or request a signed, documented release from a verified source. The main concrete problem here is the metadata mismatch about required env vars — that should be corrected or explained by the publisher before trusting the skill.
Capability Analysis
Type: OpenClaw Skill
Name: tencentcloud-ocr
Version: 1.0.3
The skill bundle provides a legitimate integration with Tencent Cloud's General Accurate OCR API. The Python script (scripts/main.py) uses the official Tencent Cloud SDK to process images or PDFs, and the documentation (SKILL.md and references/resume-parsing.md) provides clear, task-aligned instructions for the AI agent to perform text extraction and resume parsing without any signs of malicious intent, data exfiltration, or prompt injection attacks.
Capability Assessment
Purpose & Capability
The skill's declared purpose (TencentCloud GeneralAccurateOCR) matches the included code (scripts/main.py) which calls tencentcloud OCR APIs — requesting Tencent Cloud API keys is appropriate. HOWEVER the registry metadata states 'Required env vars: none' and 'Primary credential: none' while SKILL.md and the script explicitly require TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY. That metadata omission is an incoherence and could mislead users about what secrets the skill needs.
Instruction Scope
SKILL.md instructs automatic invocation whenever images or image URLs (and certain host patterns) appear, and to run scripts/main.py with either image URL or Base64/file path. The script will read local files (if a filepath is passed) and will send image data or URLs to Tencent's OCR endpoint. The resume-parsing reference also instructs the model to preserve original text verbatim (which may include sensitive PII). These behaviors are consistent with OCR functionality but broaden the risk surface: automatic triggering and verbatim extraction of potentially sensitive personal data are privacy risks and should be explicitly accepted by the user.
Install Mechanism
No install spec is provided (instruction-only with a bundled Python script). There is no external download or archive extraction. The script depends on the public pip package tencentcloud-sdk-python, which is reasonable and transparent.
Credentials
The code and SKILL.md legitimately require TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY to call Tencent Cloud APIs. That credential request is proportionate to the stated purpose. The concern is that the registry metadata omitted these required env vars (it listed none), which is inconsistent and may cause users to underestimate the privilege they are granting. No other unrelated secrets are requested.
Persistence & Privilege
The skill does not request always:true and does not request persistent or elevated platform privileges. It is user-invocable and may be invoked autonomously by the agent (default), which is normal for skills; combine this with the automatic trigger rules in SKILL.md to understand the practical behavior.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install tencentcloud-ocr - After installation, invoke the skill by name or use
/tencentcloud-ocr - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.3
- Enhanced instructions for automatic trigger: now requires automatic OCR invocation when users send/paste images or provide image URLs, not just upon explicit requests.
v1.0.2
修改displayname
Metadata
Frequently Asked Questions
What is TencentCloud OCR?
腾讯云通用文字识别(高精度版)(GeneralAccurateOCR) 技能包。当用户发送/粘贴图片、提供图片URL、或要求识别图片中的文字时,应自动调用此技能。支持图像整体文字的检测和识别,支持中文、英文、中英文、数字和特殊字符号的识别,并返回文字框位置和文字内容。适用于文字较多、版式复杂、对识别准召率要求较高... It is an AI Agent Skill for Claude Code / OpenClaw, with 944 downloads so far.
How do I install TencentCloud OCR?
Run "/install tencentcloud-ocr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is TencentCloud OCR free?
Yes, TencentCloud OCR is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does TencentCloud OCR support?
TencentCloud OCR is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created TencentCloud OCR?
It is built and maintained by tencent-ocr (@zt1314p-design); the current version is v1.0.3.
More Skills