← Back to Skills Marketplace

Autoglm Image Recognition

Name: Autoglm Image Recognition
Author: khurramjamil12

by khurramjamil12 · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ Security Clean

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install autoglm-image-recognition

Description

Use the AutoGLM Image Recognition API to analyze and describe image content. Use this skill when the user needs image analysis, object or scene recognition,...

README (SKILL.md)

AutoGLM Image Recognition Skill

Use the AutoGLM Image Recognition API to analyze and describe an image.

Prerequisite: Get a Public Image URL

This skill requires image_url to be a publicly accessible URL. Choose the correct path based on the source of the image:

Image source	What to do
Existing public URL (`http://` or `https://`)	Use it directly with no extra processing
Local file (user upload or local path)	You must run `upload-mix.py` first, then pass the returned public URL

Important: If the user provides a local image, such as an uploaded file or a local disk path, do not pass the file path directly. Run upload-mix.py first to upload the file, obtain a public URL, and only then perform image recognition.

Step 1 for a Local Image: Upload with `upload-mix.py`

If the image is a local file, upload it first:

python upload-mix.py "\x3Clocal image path>"

Example:

python upload-mix.py "/home/user/photo.jpg"

Response structure:

{
  "code": 0,
  "msg": "SUCCESS",
  "time": 1773199477734,
  "trace": "78dd001f3ec04c37b6a1d58b5db70fce",
  "data": {
    "message": "",
    "oss_info": [
      {
        "filename": "photo.jpg",
        "oss_name": "auto_fly/xxx/photo.jpg",
        "oss_url": "https://autoglm-agent.aminer.cn/auto_fly/xxx/photo.jpg"
      }
    ]
  }
}

Extract data.oss_info[0].oss_url from the response. That value is the image_url needed for the recognition step.

Step 2: Image Recognition API

Item	Value
URL	`https://autoglm-api.autoglm.ai/agentdr/v1/assistant/skills/image-recognition`
Method	POST
Request body	See below

Request body:

{
  "prompt": "Describe the image",
  "image_url": "https://example.com/image.jpg"
}

Field	Description	Required
`image_url`	A publicly accessible URL for the image. For local images, upload first with `upload-mix.py` and use `data.oss_info[0].oss_url`	Yes
`prompt`	An instruction such as "Describe the image" or "Extract the text shown in the image"	Optional, default is `"Describe the image"`

Signed headers (generated dynamically for each request):

X-Auth-Appid: 100003
X-Auth-TimeStamp: current Unix timestamp in seconds
X-Auth-Sign: MD5(100003 + "&" + timestamp + "&" + 38d2391985e2369a5fb8227d8e6cd5e5)

Run the Script

Use image-recognition.py in the same directory:

# Pass only the image URL and use the default prompt
python image-recognition.py "https://example.com/image.jpg"

# Pass the image URL with a custom prompt
python image-recognition.py "https://example.com/image.jpg" "Extract the text shown in the image"

Note: Image recognition may take longer than other calls. Wait for the response. If you need a timeout, change the request call in image-recognition.py to:
with urllib.request.urlopen(req, timeout=300) as resp:
A timeout of 300 seconds is recommended.

Full Workflow

User provides a local image
       ↓
Run upload-mix.py to upload the image
  python upload-mix.py "\x3Clocal image path>"
       ↓
Extract data.oss_info[0].oss_url as image_url
       ↓
Run image-recognition.py
  python image-recognition.py "\x3Cimage_url>" ["\x3Cprompt>"]
       ↓
Present data.text to the user

If the user already provides a public URL, skip the upload step:

User provides a public image URL
       ↓
Run image-recognition.py
  python image-recognition.py "\x3Cimage_url>" ["\x3Cprompt>"]
       ↓
Present data.text to the user

Response Handling

Response Structure

{
  "code": 0,
  "msg": "SUCCESS",
  "time": 1773137796961,
  "trace": "298d5fe1efdd4da58ca46d1700d8054b",
  "data": {
    "text": "Detailed image recognition result...",
    "tokens": 5588
  }
}

Output Requirements

1. Present the recognition result directly Return the contents of data.text directly to the user and preserve the original formatting, including any Markdown emphasis.

Usage Guidance

Install only if you are comfortable sending selected images, image URLs, and prompts to AutoGLM. Do not use it for confidential screenshots, IDs, private documents, or sensitive photos unless you are willing for the image to be uploaded to a public URL and processed by an external service.

Capability Assessment

✓ Purpose & Capability

The scripts and documentation align with the stated image-recognition purpose: local files are uploaded to obtain a public URL, then the image URL and prompt are sent to AutoGLM's recognition endpoint.

ℹ Instruction Scope

The workflow is user-directed and documented, but it does not include an explicit privacy warning before uploading local images or sending prompts and image URLs to the external service.

✓ Install Mechanism

The package contains only a skill document and two small Python helper scripts; there is no installer, package manager hook, obfuscated payload, or automatic execution path.

ℹ Credentials

Network access, reading a user-specified local file, localhost token retrieval, and API signing are proportionate for the stated AutoGLM integration, though users should treat uploaded images as externally accessible.

✓ Persistence & Privilege

No persistence, background workers, privilege escalation, credential-store scraping, deletion, or broad local indexing behavior was found.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install autoglm-image-recognition
After installation, invoke the skill by name or use /autoglm-image-recognition
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

- Initial release of autoglm-image-recognition skill. - Enables image analysis, object/scene recognition, OCR-style text extraction, and general image description via the AutoGLM Image Recognition API. - Automatically fetches required API tokens from the local service—no manual setup needed. - Supports both public image URLs (use directly) and local files (requires prior upload via upload-mix.py). - Provides clear steps and examples for uploading local files and performing image recognition. - Guides users to present results using the API’s descriptive output directly.

Metadata

Slug autoglm-image-recognition

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Autoglm Image Recognition?

Use the AutoGLM Image Recognition API to analyze and describe image content. Use this skill when the user needs image analysis, object or scene recognition,... It is an AI Agent Skill for Claude Code / OpenClaw, with 41 downloads so far.

How do I install Autoglm Image Recognition?

Run "/install autoglm-image-recognition" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Autoglm Image Recognition free?

Yes, Autoglm Image Recognition is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Autoglm Image Recognition support?

Autoglm Image Recognition is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Autoglm Image Recognition?

It is built and maintained by khurramjamil12 (@khurramjamil12); the current version is v1.0.0.

More Skills