← 返回 Skills 市场
zt1314p-design

TencentCloud ExtractDoc OCR

作者 tencent-ocr · GitHub ↗ · v1.0.2 · MIT-0
cross-platform ⚠ suspicious
408
总下载
0
收藏
3
当前安装
3
版本数
在 OpenClaw 中安装
/install tencentcloud-ocr-extractdocagent
功能描述
腾讯云实时文档抽取Agent(ExtractDocAgent)接口调用技能。当用户需要从图片或PDF中按自定义字段名称进行结构化信息抽取时,应使用此技能。支持自定义字段名称、字段类型(KV对或表格字段)和字段提示词,实现灵活的文档信息提取。适用于合同、发票、报告等各类文档的结构化数据抽取场景。
安全使用建议
This skill's code and README implement a typical Tencent Cloud OCR integration and will call ocr.tencentcloudapi.com using your Tencent Cloud API keys. Before installing: (1) be aware the skill requires TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY — the registry metadata omitted these, so do not rely solely on the registry listing; (2) provide least-privilege API credentials (a key scoped only to the OCR service if possible) and monitor billing/usage (calls are billable); (3) confirm you trust the skill source because images/PDFs you send will be uploaded to Tencent's API; avoid sending highly sensitive content unless you accept that external processing will occur; (4) the SKILL.md embeds a minor schema inconsistency (a 'UserAgent' entry shown inside ItemNames) — this looks like documentation drift, not an exploit, but verify the expected CLI options and defaults; (5) consider running the script locally first to observe network traffic (ensure it uses TLS) and to confirm credentials and region behavior. If the metadata owner/source cannot explain the missing credential declaration, treat the listing as untrusted.
功能分析
Type: OpenClaw Skill Name: tencentcloud-ocr-extractdocagent Version: 1.0.2 The skill is a legitimate integration for the Tencent Cloud OCR ExtractDocAgent API, designed to extract structured data from images and PDFs. The Python script `scripts/main.py` uses the official `tencentcloud-sdk-python` and follows standard practices for handling cloud credentials via environment variables. No evidence of malicious behavior, data exfiltration, or prompt injection was found in the code or the `SKILL.md` instructions.
能力评估
Purpose & Capability
The SKILL.md and scripts/main.py implement calling Tencent Cloud's ExtractDocAgent API (ocr.tencentcloudapi.com) and require Tencent Cloud API credentials — this matches the stated purpose. However, the registry metadata claims no required environment variables or primary credential, while both SKILL.md and the script require TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY. This metadata omission is an inconsistency that should be resolved before trusting the package listing.
Instruction Scope
The runtime instructions and the script stay within OCR/document-extraction scope: they accept image URLs or file paths (or Base64), build a request, call Tencent's OCR API, and format the response. A small incoherence: SKILL.md lists a 'UserAgent' field inside the ItemNames structure (marked as optional and fixed to 'Skills'), which is an odd place for a request-source identifier; the code instead uses a CLI arg (args.user_agent) to set the client request header. No instructions ask the agent to read unrelated system files or call external endpoints other than Tencent Cloud.
Install Mechanism
There is no install spec (instruction-only plus an included script). The script requires the public package tencentcloud-sdk-python (pip). That dependency is proportional and expected for calling Tencent Cloud APIs. No downloads from untrusted URLs or archive extraction were found.
Credentials
The script needs TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY which are appropriate and necessary for API access. The proportionality concern is the metadata mismatch: the registry metadata lists no required env vars while the documentation and code require credentials. This discrepancy can mislead users about the secrets the skill needs.
Persistence & Privilege
The skill does not request always:true and has no install steps that modify other skills or system-wide config. It simply runs a client script and does not persist credentials itself. No elevated or persistent privileges were requested.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install tencentcloud-ocr-extractdocagent
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /tencentcloud-ocr-extractdocagent 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.2
- 新增ItemNames参数说明,支持可选字段UserAgent用于请求来源标识,统一固定为 Skills。 - 明确表述UserAgent字段不会影响现有逻辑,仅作调用追踪和溯源用途。
v1.0.1
更新显示名称
v1.0.0
Initial release of tencentcloud-ocr-extractdocagent: - Enables structured data extraction from images or PDFs using Tencent Cloud’s ExtractDocAgent. - Supports custom field names, field types (key-value or table), and field prompts for flexible information extraction. - Handles common document scenarios including contracts, invoices, and reports. - Accepts multiple input formats: PNG, JPG, JPEG, BMP, and PDF. - Provides both formatted and raw JSON output modes. - Offers command-line usage with multiple configurable parameters.
元数据
Slug tencentcloud-ocr-extractdocagent
版本 1.0.2
许可证 MIT-0
累计安装 3
当前安装数 3
历史版本数 3
常见问题

TencentCloud ExtractDoc OCR 是什么?

腾讯云实时文档抽取Agent(ExtractDocAgent)接口调用技能。当用户需要从图片或PDF中按自定义字段名称进行结构化信息抽取时,应使用此技能。支持自定义字段名称、字段类型(KV对或表格字段)和字段提示词,实现灵活的文档信息提取。适用于合同、发票、报告等各类文档的结构化数据抽取场景。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 408 次。

如何安装 TencentCloud ExtractDoc OCR?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install tencentcloud-ocr-extractdocagent」即可一键安装,无需额外配置。

TencentCloud ExtractDoc OCR 是免费的吗?

是的,TencentCloud ExtractDoc OCR 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

TencentCloud ExtractDoc OCR 支持哪些平台?

TencentCloud ExtractDoc OCR 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 TencentCloud ExtractDoc OCR?

由 tencent-ocr(@zt1314p-design)开发并维护,当前版本 v1.0.2。

💬 留言讨论