← 返回 Skills 市场

TencentCloud ExtractDoc OCR

Name: TencentCloud ExtractDoc OCR
Author: zt1314p-design

作者 tencent-ocr · GitHub ↗ · v1.0.2 · MIT-0

cross-platform ⚠ suspicious

408

总下载

当前安装

版本数

在 OpenClaw 中安装

/install tencentcloud-ocr-extractdocagent

功能描述

腾讯云实时文档抽取Agent(ExtractDocAgent)接口调用技能。当用户需要从图片或PDF中按自定义字段名称进行结构化信息抽取时，应使用此技能。支持自定义字段名称、字段类型（KV对或表格字段）和字段提示词，实现灵活的文档信息提取。适用于合同、发票、报告等各类文档的结构化数据抽取场景。

安全使用建议

This skill's code and README implement a typical Tencent Cloud OCR integration and will call ocr.tencentcloudapi.com using your Tencent Cloud API keys. Before installing: (1) be aware the skill requires TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY — the registry metadata omitted these, so do not rely solely on the registry listing; (2) provide least-privilege API credentials (a key scoped only to the OCR service if possible) and monitor billing/usage (calls are billable); (3) confirm you trust the skill source because images/PDFs you send will be uploaded to Tencent's API; avoid sending highly sensitive content unless you accept that external processing will occur; (4) the SKILL.md embeds a minor schema inconsistency (a 'UserAgent' entry shown inside ItemNames) — this looks like documentation drift, not an exploit, but verify the expected CLI options and defaults; (5) consider running the script locally first to observe network traffic (ensure it uses TLS) and to confirm credentials and region behavior. If the metadata owner/source cannot explain the missing credential declaration, treat the listing as untrusted.

功能分析

Type: OpenClaw Skill Name: tencentcloud-ocr-extractdocagent Version: 1.0.2 The skill is a legitimate integration for the Tencent Cloud OCR ExtractDocAgent API, designed to extract structured data from images and PDFs. The Python script `scripts/main.py` uses the official `tencentcloud-sdk-python` and follows standard practices for handling cloud credentials via environment variables. No evidence of malicious behavior, data exfiltration, or prompt injection was found in the code or the `SKILL.md` instructions.

能力评估

ℹ Purpose & Capability

The SKILL.md and scripts/main.py implement calling Tencent Cloud's ExtractDocAgent API (ocr.tencentcloudapi.com) and require Tencent Cloud API credentials — this matches the stated purpose. However, the registry metadata claims no required environment variables or primary credential, while both SKILL.md and the script require TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY. This metadata omission is an inconsistency that should be resolved before trusting the package listing.

ℹ Instruction Scope

The runtime instructions and the script stay within OCR/document-extraction scope: they accept image URLs or file paths (or Base64), build a request, call Tencent's OCR API, and format the response. A small incoherence: SKILL.md lists a 'UserAgent' field inside the ItemNames structure (marked as optional and fixed to 'Skills'), which is an odd place for a request-source identifier; the code instead uses a CLI arg (args.user_agent) to set the client request header. No instructions ask the agent to read unrelated system files or call external endpoints other than Tencent Cloud.

✓ Install Mechanism

There is no install spec (instruction-only plus an included script). The script requires the public package tencentcloud-sdk-python (pip). That dependency is proportional and expected for calling Tencent Cloud APIs. No downloads from untrusted URLs or archive extraction were found.

ℹ Credentials

The script needs TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY which are appropriate and necessary for API access. The proportionality concern is the metadata mismatch: the registry metadata lists no required env vars while the documentation and code require credentials. This discrepancy can mislead users about the secrets the skill needs.

✓ Persistence & Privilege

The skill does not request always:true and has no install steps that modify other skills or system-wide config. It simply runs a client script and does not persist credentials itself. No elevated or persistent privileges were requested.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install tencentcloud-ocr-extractdocagent
安装完成后，直接呼叫该 Skill 的名称或使用 /tencentcloud-ocr-extractdocagent 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.2

- 新增ItemNames参数说明，支持可选字段UserAgent用于请求来源标识，统一固定为 Skills。 - 明确表述UserAgent字段不会影响现有逻辑，仅作调用追踪和溯源用途。

v1.0.1

更新显示名称

v1.0.0

Initial release of tencentcloud-ocr-extractdocagent: - Enables structured data extraction from images or PDFs using Tencent Cloud’s ExtractDocAgent. - Supports custom field names, field types (key-value or table), and field prompts for flexible information extraction. - Handles common document scenarios including contracts, invoices, and reports. - Accepts multiple input formats: PNG, JPG, JPEG, BMP, and PDF. - Provides both formatted and raw JSON output modes. - Offers command-line usage with multiple configurable parameters.

元数据

Slug tencentcloud-ocr-extractdocagent

版本 1.0.2

许可证 MIT-0

累计安装 3

当前安装数 3

历史版本数 3

常见问题

TencentCloud ExtractDoc OCR 是什么？

腾讯云实时文档抽取Agent(ExtractDocAgent)接口调用技能。当用户需要从图片或PDF中按自定义字段名称进行结构化信息抽取时，应使用此技能。支持自定义字段名称、字段类型（KV对或表格字段）和字段提示词，实现灵活的文档信息提取。适用于合同、发票、报告等各类文档的结构化数据抽取场景。它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 408 次。

如何安装 TencentCloud ExtractDoc OCR？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install tencentcloud-ocr-extractdocagent」即可一键安装，无需额外配置。

TencentCloud ExtractDoc OCR 是免费的吗？

是的，TencentCloud ExtractDoc OCR 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

TencentCloud ExtractDoc OCR 支持哪些平台？

TencentCloud ExtractDoc OCR 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 TencentCloud ExtractDoc OCR？

由 tencent-ocr（@zt1314p-design）开发并维护，当前版本 v1.0.2。