← 返回 Skills 市场

Autoglm Image Recognition

Name: Autoglm Image Recognition
Author: khurramjamil12

作者 khurramjamil12 · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ 安全检测通过

总下载

当前安装

版本数

在 OpenClaw 中安装

/install autoglm-image-recognition

功能描述

Use the AutoGLM Image Recognition API to analyze and describe image content. Use this skill when the user needs image analysis, object or scene recognition,...

使用说明 (SKILL.md)

AutoGLM Image Recognition Skill

Use the AutoGLM Image Recognition API to analyze and describe an image.

Prerequisite: Get a Public Image URL

This skill requires image_url to be a publicly accessible URL. Choose the correct path based on the source of the image:

Image source	What to do
Existing public URL (`http://` or `https://`)	Use it directly with no extra processing
Local file (user upload or local path)	You must run `upload-mix.py` first, then pass the returned public URL

Important: If the user provides a local image, such as an uploaded file or a local disk path, do not pass the file path directly. Run upload-mix.py first to upload the file, obtain a public URL, and only then perform image recognition.

Step 1 for a Local Image: Upload with `upload-mix.py`

If the image is a local file, upload it first:

python upload-mix.py "\x3Clocal image path>"

Example:

python upload-mix.py "/home/user/photo.jpg"

Response structure:

{
  "code": 0,
  "msg": "SUCCESS",
  "time": 1773199477734,
  "trace": "78dd001f3ec04c37b6a1d58b5db70fce",
  "data": {
    "message": "",
    "oss_info": [
      {
        "filename": "photo.jpg",
        "oss_name": "auto_fly/xxx/photo.jpg",
        "oss_url": "https://autoglm-agent.aminer.cn/auto_fly/xxx/photo.jpg"
      }
    ]
  }
}

Extract data.oss_info[0].oss_url from the response. That value is the image_url needed for the recognition step.

Step 2: Image Recognition API

Item	Value
URL	`https://autoglm-api.autoglm.ai/agentdr/v1/assistant/skills/image-recognition`
Method	POST
Request body	See below

Request body:

{
  "prompt": "Describe the image",
  "image_url": "https://example.com/image.jpg"
}

Field	Description	Required
`image_url`	A publicly accessible URL for the image. For local images, upload first with `upload-mix.py` and use `data.oss_info[0].oss_url`	Yes
`prompt`	An instruction such as "Describe the image" or "Extract the text shown in the image"	Optional, default is `"Describe the image"`

Signed headers (generated dynamically for each request):

X-Auth-Appid: 100003
X-Auth-TimeStamp: current Unix timestamp in seconds
X-Auth-Sign: MD5(100003 + "&" + timestamp + "&" + 38d2391985e2369a5fb8227d8e6cd5e5)

Run the Script

Use image-recognition.py in the same directory:

# Pass only the image URL and use the default prompt
python image-recognition.py "https://example.com/image.jpg"

# Pass the image URL with a custom prompt
python image-recognition.py "https://example.com/image.jpg" "Extract the text shown in the image"

Note: Image recognition may take longer than other calls. Wait for the response. If you need a timeout, change the request call in image-recognition.py to:
with urllib.request.urlopen(req, timeout=300) as resp:
A timeout of 300 seconds is recommended.

Full Workflow

User provides a local image
       ↓
Run upload-mix.py to upload the image
  python upload-mix.py "\x3Clocal image path>"
       ↓
Extract data.oss_info[0].oss_url as image_url
       ↓
Run image-recognition.py
  python image-recognition.py "\x3Cimage_url>" ["\x3Cprompt>"]
       ↓
Present data.text to the user

If the user already provides a public URL, skip the upload step:

User provides a public image URL
       ↓
Run image-recognition.py
  python image-recognition.py "\x3Cimage_url>" ["\x3Cprompt>"]
       ↓
Present data.text to the user

Response Handling

Response Structure

{
  "code": 0,
  "msg": "SUCCESS",
  "time": 1773137796961,
  "trace": "298d5fe1efdd4da58ca46d1700d8054b",
  "data": {
    "text": "Detailed image recognition result...",
    "tokens": 5588
  }
}

Output Requirements

1. Present the recognition result directly Return the contents of data.text directly to the user and preserve the original formatting, including any Markdown emphasis.

安全使用建议

Install only if you are comfortable sending selected images, image URLs, and prompts to AutoGLM. Do not use it for confidential screenshots, IDs, private documents, or sensitive photos unless you are willing for the image to be uploaded to a public URL and processed by an external service.

能力评估

✓ Purpose & Capability

The scripts and documentation align with the stated image-recognition purpose: local files are uploaded to obtain a public URL, then the image URL and prompt are sent to AutoGLM's recognition endpoint.

ℹ Instruction Scope

The workflow is user-directed and documented, but it does not include an explicit privacy warning before uploading local images or sending prompts and image URLs to the external service.

✓ Install Mechanism

The package contains only a skill document and two small Python helper scripts; there is no installer, package manager hook, obfuscated payload, or automatic execution path.

ℹ Credentials

Network access, reading a user-specified local file, localhost token retrieval, and API signing are proportionate for the stated AutoGLM integration, though users should treat uploaded images as externally accessible.

✓ Persistence & Privilege

No persistence, background workers, privilege escalation, credential-store scraping, deletion, or broad local indexing behavior was found.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install autoglm-image-recognition
安装完成后，直接呼叫该 Skill 的名称或使用 /autoglm-image-recognition 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

- Initial release of autoglm-image-recognition skill. - Enables image analysis, object/scene recognition, OCR-style text extraction, and general image description via the AutoGLM Image Recognition API. - Automatically fetches required API tokens from the local service—no manual setup needed. - Supports both public image URLs (use directly) and local files (requires prior upload via upload-mix.py). - Provides clear steps and examples for uploading local files and performing image recognition. - Guides users to present results using the API’s descriptive output directly.

元数据

Slug autoglm-image-recognition

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题