← Back to Skills Marketplace
🔌

image-reader-xytest

by xytest · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
197
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install image-reader-xytest
Description
识别本地图片内容(支持中文),通过 OCR.space 免费 API 实现。
README (SKILL.md)

image-reader Skill

识别本地图片内容(支持中文),通过 OCR.space 免费 API 实现。

工作流程

  1. 接收图片路径
  2. 调用 OCR.space API 进行识别
  3. 返回识别结果

Python 脚本

保存到 {skill_root}/scripts/ocr_image.py

import urllib.request, urllib.parse, base64, io, json, sys
from PIL import Image

img_path = sys.argv[1] if len(sys.argv) > 1 else sys.stdin.read().strip()

img = Image.open(img_path)
img_small = img.resize((1600, 2400), Image.LANCZOS)
buf = io.BytesIO()
img_small.save(buf, format='JPEG', quality=80)
b64_data = base64.b64encode(buf.getvalue()).decode()

url = 'https://api.ocr.space/parse/image'
params = urllib.parse.urlencode({
    'base64Image': 'data:image/jpeg;base64,' + b64_data,
    'language': 'chs',
    'isOverlayRequired': 'false',
    'detectOrientation': 'true',
    'scale': 'true',
    'OCREngine': '2',
})
data = params.encode('utf-8')

req = urllib.request.Request(url, data=data)
req.add_header('Content-Type', 'application/x-www-form-urlencoded')
req.add_header('apikey', 'helloworld')  # OCR.space free demo key

with urllib.request.urlopen(req, timeout=30) as resp:
    raw = resp.read()
    result = json.loads(raw.decode('utf-8', errors='replace'))
    if 'ParsedResults' in result:
        for r in result['ParsedResults']:
            print(r['ParsedText'])
    else:
        print('OCR识别失败:', result)

使用方式

python {skill_root}/scripts/ocr_image.py "F:/1.jpg"

依赖

  • Python 3
  • Pillow (pip install pillow)
Usage Guidance
This skill appears internally consistent with its purpose. Before installing, consider: (1) Privacy — your image is uploaded to a third-party service (api.ocr.space); do not send sensitive images unless you accept that. (2) Output file — recognized text is saved under the skill's scripts folder as <image_name>_ocr.txt; check/write permissions. (3) API key — the script uses the public demo key 'helloworld' (limited quota); if you need production use, replace it with your own OCR.space API key (modify the script to accept an env var for the key to avoid hardcoding). (4) Dependencies — ensure Python 3 and Pillow are installed. If you want stronger guarantees, review/modify the script to avoid sending sensitive images or to use an on-prem OCR solution.
Capability Analysis
Type: OpenClaw Skill Name: image-reader-xytest Version: 1.0.0 The skill provides OCR functionality by sending local image data to the legitimate OCR.space API (api.ocr.space) using a public demo API key ('helloworld'). The Python script (scripts/ocr_image.py) correctly implements the stated purpose of resizing images and processing them via a third-party service, with no evidence of malicious intent, data exfiltration beyond the API's requirements, or prompt injection.
Capability Assessment
Purpose & Capability
Name/description (local OCR using OCR.space) align with the included Python script and SKILL.md. The script performs image resizing, base64-encodes the image, and posts to api.ocr.space — functionality expected for this purpose.
Instruction Scope
Instructions only run the provided script against a local image path and require Pillow; they do not read unrelated files or environment variables. Note: the script transmits the image content to OCR.space (a third-party endpoint) and writes the recognized text to a file in the skill's scripts directory, which are privacy-relevant behaviors users should be aware of.
Install Mechanism
No install spec; this is an instruction-only skill that requires Python and the Pillow package (pip). Nothing is downloaded or written by an installer.
Credentials
No environment variables or credentials are requested. The script hardcodes OCR.space's public demo API key ('helloworld'), which is consistent with a demo usage but has limited quota and is not a secret; no unrelated credentials are present.
Persistence & Privilege
The skill does not request persistent/always-on presence and does not modify other skills or system-wide settings. It writes an output file to its own scripts directory, which is expected for a local OCR helper.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install image-reader-xytest
  3. After installation, invoke the skill by name or use /image-reader-xytest
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of image-reader skill. - Recognizes content from local images (including Chinese) using the free OCR.space API. - Includes a Python script for submitting images and returning extracted text. - Easy CLI usage; requires Python 3 and Pillow.
Metadata
Slug image-reader-xytest
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is image-reader-xytest?

识别本地图片内容(支持中文),通过 OCR.space 免费 API 实现。 It is an AI Agent Skill for Claude Code / OpenClaw, with 197 downloads so far.

How do I install image-reader-xytest?

Run "/install image-reader-xytest" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is image-reader-xytest free?

Yes, image-reader-xytest is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does image-reader-xytest support?

image-reader-xytest is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created image-reader-xytest?

It is built and maintained by xytest (@xytest); the current version is v1.0.0.

💬 Comments