← Back to Skills Marketplace

🔌

image-reader-xytest

Name: image-reader-xytest
Author: xytest

by xytest · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ Security Clean

197

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install image-reader-xytest

Description

识别本地图片内容（支持中文），通过 OCR.space 免费 API 实现。

README (SKILL.md)

image-reader Skill

识别本地图片内容（支持中文），通过 OCR.space 免费 API 实现。

工作流程

接收图片路径
调用 OCR.space API 进行识别
返回识别结果

Python 脚本

保存到 {skill_root}/scripts/ocr_image.py：

import urllib.request, urllib.parse, base64, io, json, sys
from PIL import Image

img_path = sys.argv[1] if len(sys.argv) > 1 else sys.stdin.read().strip()

img = Image.open(img_path)
img_small = img.resize((1600, 2400), Image.LANCZOS)
buf = io.BytesIO()
img_small.save(buf, format='JPEG', quality=80)
b64_data = base64.b64encode(buf.getvalue()).decode()

url = 'https://api.ocr.space/parse/image'
params = urllib.parse.urlencode({
    'base64Image': 'data:image/jpeg;base64,' + b64_data,
    'language': 'chs',
    'isOverlayRequired': 'false',
    'detectOrientation': 'true',
    'scale': 'true',
    'OCREngine': '2',
})
data = params.encode('utf-8')

req = urllib.request.Request(url, data=data)
req.add_header('Content-Type', 'application/x-www-form-urlencoded')
req.add_header('apikey', 'helloworld')  # OCR.space free demo key

with urllib.request.urlopen(req, timeout=30) as resp:
    raw = resp.read()
    result = json.loads(raw.decode('utf-8', errors='replace'))
    if 'ParsedResults' in result:
        for r in result['ParsedResults']:
            print(r['ParsedText'])
    else:
        print('OCR识别失败:', result)

使用方式

python {skill_root}/scripts/ocr_image.py "F:/1.jpg"

依赖

Python 3
Pillow (pip install pillow)

Usage Guidance

This skill appears internally consistent with its purpose. Before installing, consider: (1) Privacy — your image is uploaded to a third-party service (api.ocr.space); do not send sensitive images unless you accept that. (2) Output file — recognized text is saved under the skill's scripts folder as <image_name>_ocr.txt; check/write permissions. (3) API key — the script uses the public demo key 'helloworld' (limited quota); if you need production use, replace it with your own OCR.space API key (modify the script to accept an env var for the key to avoid hardcoding). (4) Dependencies — ensure Python 3 and Pillow are installed. If you want stronger guarantees, review/modify the script to avoid sending sensitive images or to use an on-prem OCR solution.

Capability Analysis

Type: OpenClaw Skill Name: image-reader-xytest Version: 1.0.0 The skill provides OCR functionality by sending local image data to the legitimate OCR.space API (api.ocr.space) using a public demo API key ('helloworld'). The Python script (scripts/ocr_image.py) correctly implements the stated purpose of resizing images and processing them via a third-party service, with no evidence of malicious intent, data exfiltration beyond the API's requirements, or prompt injection.

Capability Assessment

✓ Purpose & Capability

Name/description (local OCR using OCR.space) align with the included Python script and SKILL.md. The script performs image resizing, base64-encodes the image, and posts to api.ocr.space — functionality expected for this purpose.

ℹ Instruction Scope

Instructions only run the provided script against a local image path and require Pillow; they do not read unrelated files or environment variables. Note: the script transmits the image content to OCR.space (a third-party endpoint) and writes the recognized text to a file in the skill's scripts directory, which are privacy-relevant behaviors users should be aware of.

✓ Install Mechanism

No install spec; this is an instruction-only skill that requires Python and the Pillow package (pip). Nothing is downloaded or written by an installer.

✓ Credentials

No environment variables or credentials are requested. The script hardcodes OCR.space's public demo API key ('helloworld'), which is consistent with a demo usage but has limited quota and is not a secret; no unrelated credentials are present.

✓ Persistence & Privilege

The skill does not request persistent/always-on presence and does not modify other skills or system-wide settings. It writes an output file to its own scripts directory, which is expected for a local OCR helper.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install image-reader-xytest
After installation, invoke the skill by name or use /image-reader-xytest
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

- Initial release of image-reader skill. - Recognizes content from local images (including Chinese) using the free OCR.space API. - Includes a Python script for submitting images and returning extracted text. - Easy CLI usage; requires Python 3 and Pillow.

Metadata

Slug image-reader-xytest

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is image-reader-xytest?

识别本地图片内容（支持中文），通过 OCR.space 免费 API 实现。 It is an AI Agent Skill for Claude Code / OpenClaw, with 197 downloads so far.

How do I install image-reader-xytest?

Run "/install image-reader-xytest" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is image-reader-xytest free?

Yes, image-reader-xytest is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does image-reader-xytest support?

image-reader-xytest is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created image-reader-xytest?

It is built and maintained by xytest (@xytest); the current version is v1.0.0.

More Skills