← 返回 Skills 市场
TencentCloud General OCR
作者
tencent-ocr
· GitHub ↗
· v1.0.2
· MIT-0
595
总下载
0
收藏
3
当前安装
3
版本数
在 OpenClaw 中安装
/install tencentcloud-ocr-general
功能描述
腾讯云广告文字识别(AdvertiseOCR)接口调用技能。当用户需要从图片中识别文字内容时,应使用此技能。支持中英文、横排、竖排及倾斜场景的图片文字识别,支持90度、180度、270度翻转场景的图片识别,返回文本框位置与文字内容。支持图片Base64和URL两种输入方式。
安全使用建议
This skill's code and documentation implement Tencent Cloud AdvertiseOCR and require your Tencent Cloud API keys (TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY), but the registry metadata does not declare those env vars — treat that as a red flag. Before installing or running: (1) verify the source and trustworthiness of the skill owner; (2) inspect the included scripts (you already have main.py) and confirm there are no unexpected network endpoints; (3) provide credentials only in a controlled environment (use short-lived keys or a restricted account if possible); (4) run the script in an isolated environment (container) and avoid passing files that contain other secrets; (5) consider adding an explicit install step that pins tencentcloud-sdk-python to a known-safe version, or manually install dependencies after review. If you plan to enable this skill for autonomous agent use, resolve the metadata inconsistency first so users understand what credentials will be required.
功能分析
Type: OpenClaw Skill
Name: tencentcloud-ocr-general
Version: 1.0.2
The skill is a legitimate wrapper for the Tencent Cloud AdvertiseOCR API. It uses the official 'tencentcloud-sdk-python' to perform optical character recognition on images provided via URL or local file paths. The code in 'scripts/main.py' correctly handles API credentials through environment variables and contains no evidence of data exfiltration, malicious execution, or prompt injection attacks.
能力评估
Purpose & Capability
The code and SKILL.md implement a Tencent Cloud AdvertiseOCR client and legitimately require Tencent Cloud API keys; the requested capabilities (OCR via tencentcloud-sdk-python) align with the skill's stated purpose. However, the registry metadata claims no required environment variables/credentials while both the instructions and script require TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY — this metadata omission is inconsistent and reduces trust.
Instruction Scope
The runtime instructions and script stay within OCR scope: they accept an image URL or base64/file, read files only when explicitly provided, call the official OCR endpoint (ocr.tencentcloudapi.com), and format the response. The script reads environment variables for credentials and does not access unrelated system paths or exfiltrate data to unexpected endpoints.
Install Mechanism
There is no install spec in the registry (instruction-only), but SKILL.md and the script require the third-party package tencentcloud-sdk-python and instruct the user to pip install it. Lack of an explicit install entry means dependencies won't be auto-installed and package versions are unpinned — this is reasonable for an instruction-only skill but raises supply-chain and reproducibility concerns.
Credentials
The only sensitive items the skill needs are Tencent Cloud API keys (TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY), which are proportionate to calling the Tencent OCR API. The concern is the inconsistency: the registry metadata lists no required env vars while the SKILL.md and script require secrets; that mismatch could lead to accidental credential exposure or user confusion about what will be requested.
Persistence & Privilege
The skill does not request permanent presence, does not set always:true, and does not modify other skills or system-wide configurations. It runs as a standalone script and prints results to stdout.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install tencentcloud-ocr-general - 安装完成后,直接呼叫该 Skill 的名称或使用
/tencentcloud-ocr-general触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.2
- 新增可选参数 UserAgent(请求来源标识),用于追踪调用来源,建议统一填写为 Skills
- 其余功能与用法保持不变
v1.0.1
更新显示名称
v1.0.0
Initial release of Tencent Cloud AdvertiseOCR skill.
- Provides OCR text extraction from images, supporting Chinese, English, mixed language, horizontal, vertical, and rotated scenarios.
- Returns detected text, four-point position coordinates, and recognition confidence for each line.
- Supports input via image Base64 string or image URL.
- Includes error code explanations and usage examples.
- Requires `tencentcloud-sdk-python` and Tencent Cloud API credentials via environment variables.
元数据
常见问题
TencentCloud General OCR 是什么?
腾讯云广告文字识别(AdvertiseOCR)接口调用技能。当用户需要从图片中识别文字内容时,应使用此技能。支持中英文、横排、竖排及倾斜场景的图片文字识别,支持90度、180度、270度翻转场景的图片识别,返回文本框位置与文字内容。支持图片Base64和URL两种输入方式。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 595 次。
如何安装 TencentCloud General OCR?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install tencentcloud-ocr-general」即可一键安装,无需额外配置。
TencentCloud General OCR 是免费的吗?
是的,TencentCloud General OCR 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
TencentCloud General OCR 支持哪些平台?
TencentCloud General OCR 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 TencentCloud General OCR?
由 tencent-ocr(@zt1314p-design)开发并维护,当前版本 v1.0.2。
推荐 Skills