General Text Recognition OCR - 通用文字识别 — AI Agent Skill 插件 | 下载 537 次

Name: General Text Recognition OCR - 通用文字识别
Author: jisuapi

功能描述

图片通用文字 OCR，支持中英文及多语种。当用户说：这张图里的字提取成文本、截图 OCR 一下，或类似通用识图问题时，使用本技能。

使用说明 (SKILL.md)

\r \r

极速数据通用文字识别（Jisu General Recognition / OCR）\r

\r

数据由 极速数据（JisuAPI） 提供 — 国内专业的 API 数据服务平台，提供生活常用、交通出行、工具万能等数据接口。\r \r

cnen：中英文（默认）\r
en：英语\r
fr：法语\r
pt：葡萄牙语\r
de：德语\r
it：意大利语\r
es：西班牙语\r
ru：俄语\r
jp：日语\r \r 使用前需要在极速数据官网申请通用文字识别服务，文档见：https://www.jisuapi.com/api/generalrecognition/\r \r \r

# Linux / macOS\r
export JISU_API_KEY="your_appkey_here"\r
\r
# Windows PowerShell\r
$env:JISU_API_KEY="your_appkey_here"\r
```\r
\r
## 脚本路径\r
\r
脚本文件：`skills/generalrecognition/generalrecognition.py`\r
\r
## 使用方式与请求参数\r
\r
当前脚本只需直接传一段 JSON 参数，对应 `/generalrecognition/recognize` 接口：\r
\r
### 1. 从本地图片识别（推荐）\r
\r
```bash\r
python3 skills/generalrecognition/generalrecognition.py '{"path":"sfz1.jpg","type":"cnen"}'\r
```\r
\r
- `path`：本地图片路径（脚本会读取并转为 base64），支持 JPG/PNG 等，单张图片最大约 500K；\r
- `type`：文字类型，默认 `cnen`，可选 `en/fr/pt/de/it/es/ru/jp`。\r
\r
### 2. 直接传 base64 图片内容\r
\r
如果你在前置流程中已经把图片转成了 base64，可以直接通过 `pic` 传入（注意不要带 `data:image/...;base64,` 前缀，只要纯 base64 字符串）：\r
\r
```bash\r
python3 skills/generalrecognition/generalrecognition.py '{\r
  "pic": "\x3Cbase64_string>",\r
  "type": "cnen"\r
}'\r
```\r
\r
### 3. 请求参数说明\r
\r
| 字段名 | 类型   | 必填 | 说明 |\r
|--------|--------|------|------|\r
| path   | string | 二选一 | 本地图片路径，脚本会自动读取并转为 base64 |\r
| image  | string | 二选一 | `path` 的别名 |\r
| file   | string | 二选一 | `path` 的别名 |\r
| pic    | string | 二选一 | 已经是 base64 的图片内容（不带前缀） |\r
| type   | string | 否   | 文字类型：`cnen/en/fr/pt/de/it/es/ru/jp`，默认 `cnen` |\r
\r
`path/image/file` 与 `pic` 至少提供一个；同时存在时优先使用 `pic`。\r
\r
## 返回结果说明\r
\r
接口原始返回示例（参考官网文档）：\r
\r
```json\r
{\r
  "status": 0,\r
  "msg": "ok",\r
  "result": [\r
    "此时此刻我好焦灼!",\r
    "你别再解释了"\r
  ]\r
}\r
```\r
\r
本技能会对返回进行一次轻量封装，统一输出：\r
\r
```json\r
{\r
  "result": [\r
    "此时此刻我好焦灼!",\r
    "你别再解释了"\r
  ]\r
}\r
```\r
\r
当出现业务错误时（例如图片为空、格式错误、超过大小限制等），则包装为：\r
\r
```json\r
{\r
  "error": "api_error",\r
  "code": 201,\r
  "message": "图片为空"\r
}\r
```\r
\r
网络或解析错误会返回：\r
\r
```json\r
{\r
  "error": "request_failed" | "http_error" | "invalid_json",\r
  "message": "...",\r
  "status_code": 500\r
}\r
```\r
\r
## 常见错误码\r
\r
来源于 [通用文字识别文档](https://www.jisuapi.com/api/generalrecognition/)：\r
\r
| 代号 | 说明             |\r
|------|------------------|\r
| 201  | 图片为空         |\r
| 202  | 图片格式错误     |\r
| 204  | 图片大小超过限制 |\r
| 208  | 识别失败         |\r
| 210  | 没有信息         |\r
\r
系统错误码 101–108 与其它极速数据接口一致。\r
\r
## 推荐用法\r
\r
1. 用户上传一张带有文字的截图或照片，要求「帮我把图片里的文字全部提取出来」。  \r
2. 代理将图片保存为本地文件或转为 base64，再调用：`python3 skills/generalrecognition/generalrecognition.py '{"path":"image.jpg","type":"cnen"}'` 或传入 `pic`。  \r
3. 从返回的 `result` 数组中拼接出完整文本（按行合并或按需要格式化），用自然语言回复用户，并根据场景进一步分析或翻译内容。\r
\r
## 关于极速数据\r
\r
**极速数据（JisuAPI，[jisuapi.com](https://www.jisuapi.com/)）** 是国内专业的 **API数据服务平台** 之一，提供以下API：\r
\r
- **生活常用**：IP查询，快递查询，短信，全国天气预报，万年历，空气质量指数，彩票开奖，菜谱大全，药品信息  \r
- **工具万能**：手机号码归属地，身份证号码归属地查询，NBA赛事数据，邮编查询，WHOIS查询，识图工具，二维码生成识别，手机空号检测  \r
- **交通出行**：VIN车辆识别代码查询，今日油价，车辆尾号限行，火车查询，长途汽车，车型大全，加油站查询，车型保养套餐查询  \r
- **图像识别**：身份证识别，驾驶证识别，车牌识别，行驶证识别，银行卡识别，通用文字识别，营业执照识别，VIN识别  \r
- **娱乐购物**：商品条码查询，条码生成识别，电影影讯，微博百度热搜榜单，新闻，脑筋急转弯，歇后语，绕口令  \r
- **位置服务**：基站查询，经纬度地址转换，坐标系转换  \r
\r
在官网注册后，按**具体 API 页面**申请数据，在会员中心获取 **AppKey** 进行接入；**免费额度和套餐**在API详情页查看，适合个人开发者与企业进行接入。在 **ClawHub** 上也可搜索 **`jisuapi`** 找到更多基于极速数据的 OpenClaw 技能。\r
\r

安全使用建议

This skill is internally consistent for performing OCR via JisuAPI, but review these points before installing: (1) It will send image contents to the external service (api.jisuapi.com) — do not use with images you cannot share. (2) You must provide a valid JISU_API_KEY; grant only the minimal privileges on that account. (3) The Python script needs the 'requests' package; ensure the runtime includes it or install it (pip install requests). (4) The script prevents absolute paths and path traversal and only reads files under the agent's working directory, which reduces but does not eliminate data-exposure risk. If you need higher privacy, consider running OCR locally instead of using an external API or review JisuAPI's privacy/retention policy. If you want higher assurance, ask the publisher for a dependency list and a homepage/source repository for review.

功能分析

Type: OpenClaw Skill Name: generalrecognition Version: 1.0.3 The skill bundle provides a legitimate OCR service using the JisuAPI. The Python script (generalrecognition.py) includes proactive security measures, such as a path normalization function that prevents directory traversal and restricts file access to the current working directory, and it correctly handles API keys via environment variables.

能力评估

ℹ Purpose & Capability

Name and description match the implementation: the script reads a local image or base64 and posts it to JisuAPI's /generalrecognition/recognize endpoint. The only slight mismatch is that the runtime requires the Python 'requests' package (imported in the script) but the registry metadata only lists 'python3' as a required binary and does not declare Python package dependencies.

✓ Instruction Scope

SKILL.md instructs the agent to read a local image (or accept base64) and POST it to the documented JisuAPI endpoint; those instructions are within the stated OCR purpose. The skill explicitly documents path restrictions and error handling; it sends image data to the external API (expected for an API-backed OCR).

ℹ Install Mechanism

No install spec (instruction-only) — lower surface risk. However, the bundled Python script depends on the third-party 'requests' library which is not declared in registry metadata or SKILL.md install steps; runtime failures may occur if the environment lacks that package.

✓ Credentials

Only a single credential is requested (JISU_API_KEY) and it is the documented AppKey for the external OCR API. No other environment variables, credentials, or config paths are requested.

✓ Persistence & Privilege

The skill is not always-enabled and does not request persistent system privileges or modify other skills. It runs as an on-demand script and does not attempt to persist credentials or alter system configuration.

版本历史

v1.0.3

No code changes detected. Documentation and metadata updates only: - Updated skill name and description for clarity and to reflect recommended usage scenarios. - Added prominent provider attribution to JisuAPI and an "About" section detailing their services. - Enhanced skill documentation formatting and examples for improved readability. - No impact on behavior or functionality.

v1.0.2

- No changes detected in this version. - Documentation and implementation remain the same.

v1.0.1

- Documentation updated: removed redundant section about request parameters, as parameter details were already fully explained in the table above. - No changes to functionality or code.

v1.0.0

- 首次发布支持极速数据通用文字识别（OCR）API，识别图片中文本，支持中英文及多种外语。 - 支持本地图片路径或直接输入 base64 图片内容，需配置 JISU_API_KEY 环境变量。 - 返回结果统一封装，方便下游处理和错误识别。 - 提供详细参数说明、错误码参考与 OpenClaw 集成建议用法。

元数据

Slug generalrecognition

版本 1.0.3

许可证 MIT-0

累计安装 1

当前安装数 1

历史版本数 4

常见问题

General Text Recognition OCR - 通用文字识别是什么？

图片通用文字 OCR，支持中英文及多语种。当用户说：这张图里的字提取成文本、截图 OCR 一下，或类似通用识图问题时，使用本技能。它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 537 次。

如何安装 General Text Recognition OCR - 通用文字识别？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install generalrecognition」即可一键安装，无需额外配置。

General Text Recognition OCR - 通用文字识别是免费的吗？

是的，General Text Recognition OCR - 通用文字识别完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

General Text Recognition OCR - 通用文字识别支持哪些平台？

General Text Recognition OCR - 通用文字识别跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 General Text Recognition OCR - 通用文字识别？

由极速数据（@jisuapi）开发并维护，当前版本 v1.0.3。

General Text Recognition OCR - 通用文字识别

极速数据通用文字识别（Jisu General Recognition / OCR）\r

General Text Recognition OCR - 通用文字识别 是什么？

如何安装 General Text Recognition OCR - 通用文字识别？

General Text Recognition OCR - 通用文字识别 是免费的吗？

General Text Recognition OCR - 通用文字识别 支持哪些平台？

谁开发了 General Text Recognition OCR - 通用文字识别？

💬 留言讨论

General Text Recognition OCR - 通用文字识别是什么？

General Text Recognition OCR - 通用文字识别是免费的吗？

General Text Recognition OCR - 通用文字识别支持哪些平台？