Autoglm Image Recognition
/install autoglm-image-recognition
AutoGLM Image Recognition Skill
Use the AutoGLM Image Recognition API to analyze and describe an image.
Prerequisite: Get a Public Image URL
This skill requires image_url to be a publicly accessible URL. Choose the correct path based on the source of the image:
| Image source | What to do |
|---|---|
Existing public URL (http:// or https://) |
Use it directly with no extra processing |
| Local file (user upload or local path) | You must run upload-mix.py first, then pass the returned public URL |
Important: If the user provides a local image, such as an uploaded file or a local disk path, do not pass the file path directly. Run
upload-mix.pyfirst to upload the file, obtain a public URL, and only then perform image recognition.
Step 1 for a Local Image: Upload with upload-mix.py
If the image is a local file, upload it first:
python upload-mix.py "\x3Clocal image path>"
Example:
python upload-mix.py "/home/user/photo.jpg"
Response structure:
{
"code": 0,
"msg": "SUCCESS",
"time": 1773199477734,
"trace": "78dd001f3ec04c37b6a1d58b5db70fce",
"data": {
"message": "",
"oss_info": [
{
"filename": "photo.jpg",
"oss_name": "auto_fly/xxx/photo.jpg",
"oss_url": "https://autoglm-agent.aminer.cn/auto_fly/xxx/photo.jpg"
}
]
}
}
Extract data.oss_info[0].oss_url from the response. That value is the image_url needed for the recognition step.
Step 2: Image Recognition API
| Item | Value |
|---|---|
| URL | https://autoglm-api.autoglm.ai/agentdr/v1/assistant/skills/image-recognition |
| Method | POST |
| Request body | See below |
Request body:
{
"prompt": "Describe the image",
"image_url": "https://example.com/image.jpg"
}
| Field | Description | Required |
|---|---|---|
image_url |
A publicly accessible URL for the image. For local images, upload first with upload-mix.py and use data.oss_info[0].oss_url |
Yes |
prompt |
An instruction such as "Describe the image" or "Extract the text shown in the image" | Optional, default is "Describe the image" |
Signed headers (generated dynamically for each request):
X-Auth-Appid:100003X-Auth-TimeStamp: current Unix timestamp in secondsX-Auth-Sign: MD5(100003 + "&" + timestamp + "&" + 38d2391985e2369a5fb8227d8e6cd5e5)
Run the Script
Use image-recognition.py in the same directory:
# Pass only the image URL and use the default prompt
python image-recognition.py "https://example.com/image.jpg"
# Pass the image URL with a custom prompt
python image-recognition.py "https://example.com/image.jpg" "Extract the text shown in the image"
Note: Image recognition may take longer than other calls. Wait for the response. If you need a timeout, change the request call in
image-recognition.pyto:with urllib.request.urlopen(req, timeout=300) as resp:A timeout of 300 seconds is recommended.
Full Workflow
User provides a local image
↓
Run upload-mix.py to upload the image
python upload-mix.py "\x3Clocal image path>"
↓
Extract data.oss_info[0].oss_url as image_url
↓
Run image-recognition.py
python image-recognition.py "\x3Cimage_url>" ["\x3Cprompt>"]
↓
Present data.text to the user
If the user already provides a public URL, skip the upload step:
User provides a public image URL
↓
Run image-recognition.py
python image-recognition.py "\x3Cimage_url>" ["\x3Cprompt>"]
↓
Present data.text to the user
Response Handling
Response Structure
{
"code": 0,
"msg": "SUCCESS",
"time": 1773137796961,
"trace": "298d5fe1efdd4da58ca46d1700d8054b",
"data": {
"text": "Detailed image recognition result...",
"tokens": 5588
}
}
Output Requirements
1. Present the recognition result directly
Return the contents of data.text directly to the user and preserve the original formatting, including any Markdown emphasis.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install autoglm-image-recognition - 安装完成后,直接呼叫该 Skill 的名称或使用
/autoglm-image-recognition触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Autoglm Image Recognition 是什么?
Use the AutoGLM Image Recognition API to analyze and describe image content. Use this skill when the user needs image analysis, object or scene recognition,... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 41 次。
如何安装 Autoglm Image Recognition?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install autoglm-image-recognition」即可一键安装,无需额外配置。
Autoglm Image Recognition 是免费的吗?
是的,Autoglm Image Recognition 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Autoglm Image Recognition 支持哪些平台?
Autoglm Image Recognition 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Autoglm Image Recognition?
由 khurramjamil12(@khurramjamil12)开发并维护,当前版本 v1.0.0。