← 返回 Skills 市场
AB Agents Vision
作者
alexburrstudio
· GitHub ↗
· v1.0.1
· MIT-0
48
总下载
0
收藏
0
当前安装
2
版本数
在 OpenClaw 中安装
/install ab-agents-vision
功能描述
👁️ Image analysis using MiniMax VL API. Describe images, extract text from screenshots, analyze photos. Works with local files and URLs. Simple shell wrapper.
使用说明 (SKILL.md)
AB Agents Vision 👁️
Image analysis using MiniMax VL API — simple, fast, reliable.
What It Does
- 📸 Describe images — Get detailed scene descriptions
- 📝 Extract text — Read text from screenshots, photos, documents
- 🔍 Analyze photos — Identify objects, people, settings
- 🌐 URL support — Analyze images from the web
Quick Start
# Install
curl -LsSf https://astral.sh/uv/install.sh | sh
# Set your MiniMax API key
export MINIMAX_API_KEY="sk-cp-your-key"
# Use
./vision.sh image.jpg "Describe this image"
Usage
# Basic description
./vision.sh photo.jpg
# With custom prompt
./vision.sh screenshot.png "What text do you see?"
# URL support
./vision.sh "https://example.com/image.jpg" "Describe this"
Requirements
- MiniMax Token Plan API key (get one)
- Linux/macOS
uvx(auto-installed via script)
Examples
Screenshot analysis:
Input: screenshot.png + "What text is in the image?"
Output: "The screenshot shows a code editor with Python code...
Photo description:
Input: photo.jpg + "Describe in detail"
Output: "A person's bare foot and lower leg resting on a brown
textured waffle-weave blanket. The skin is light-toned with
visible fine hairs..."
Installation
git clone https://github.com/alexburrstudio/ab-agents-skills.git
cd ab-agents-skills/skills/vision
chmod +x vision.sh
Or via ClaWHub:
clawhub install AB-Agents-Vision
Troubleshooting
| Error | Solution |
|---|---|
| API Error: 1033 | Retry — system error on MiniMax side |
| No response | Check MINIMAX_API_KEY is set correctly |
| Slow | Use smaller images (\x3C10MB) |
AB-Agents 🦀
安全使用建议
This skill largely does what it claims (wraps MiniMax VL), but there are several issues to consider before installing:
- Metadata mismatch: The registry says no environment variables are required, but both SKILL.md and vision.sh require MINIMAX_API_KEY. Treat the API key as required and ensure you provide a dedicated key.
- Remote installer risk: The README suggests running 'curl ... | sh' to install 'uv'. Running remote install scripts without verification is risky — inspect the script at https://astral.sh/uv/install.sh before running, or install the tool from a trusted package source or your distro's package manager.
- Local credential file read: vision.sh will try to read ~/.minimax-env to extract MINIMAX_API_KEY. If you keep secrets in that file, be aware the script will access it. Consider storing the key in a controlled location or set MINIMAX_API_KEY in the environment explicitly.
- PATH tweak: The script prepends /root/.local/bin to PATH at runtime which is odd and may hide which binary is being invoked; review the tool installed as 'uvx' and confirm it is what you expect.
Recommended actions before use: inspect vision.sh and the uv install script yourself, avoid piping unknown scripts to sh, set MINIMAX_API_KEY explicitly in your shell environment (not relying on fallback files), and verify the uvx/minimax client being installed is from a trusted source. If you need higher assurance, ask the package owner for a signed release or an install method that uses a well-known package registry or package manager.
功能分析
Type: OpenClaw Skill
Name: ab-agents-vision
Version: 1.0.1
The skill bundle contains a shell script (vision.sh) that is vulnerable to command injection and JSON injection. It directly embeds the $PROMPT and $IMAGE_PATH variables into an echo command and a JSON-RPC payload without sanitization, allowing for arbitrary command execution via command substitution (e.g., $(whoami)) if the input is not strictly controlled. Additionally, it relies on uvx to download and execute a remote package (minimax-coding-plan-mcp), which poses a supply chain risk if the package name is typosquatted or the registry is compromised.
能力标签
能力评估
Purpose & Capability
The code and SKILL.md implement an image-analysis wrapper for MiniMax (calls a minimax tool via 'uvx' and uses a MINIMAX_API_KEY) which matches the stated purpose. However the registry metadata declares no required environment variables while both the docs and the script require MINIMAX_API_KEY — a clear metadata/instruction mismatch. The script also adds /root/.local/bin to PATH which is unusual and unnecessary for the stated functionality.
Instruction Scope
SKILL.md instructs the user to install 'uv' by piping a remote install script (curl ... | sh). The runtime script reads MINIMAX_API_KEY from either the environment or from ~/.minimax-env (grep), which means it will access a local credentials file in the user's home directory. Reading that file is outside the nominal scope (the key itself is required, but implicit fallback file access is a privacy/clarity concern). The instructions do not document these behaviors in the registry metadata.
Install Mechanism
There is no formal install spec in the package; SKILL.md recommends running a remote installer via 'curl -LsSf https://astral.sh/uv/install.sh | sh', which is high-risk (running remote code without checks). The script relies on the external 'uvx' tool that would be installed by that script. No checksums or explicit trusted package sources are provided in the skill docs.
Credentials
Functionality legitimately requires a MiniMax API key (MINIMAX_API_KEY), so requiring one is proportionate. But the registry metadata omits this requirement. The script's fallback to read ~/.minimax-env means it will access a local file containing credentials — this access should be declared up front. No unrelated credentials appear to be requested.
Persistence & Privilege
The skill does not request permanent presence (always:false), does not modify other skills' configs, and does not write persistent files itself. The only runtime change is a temporary PATH mutation. Autonomous invocation is allowed by default but not a special privilege here.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install ab-agents-vision - 安装完成后,直接呼叫该 Skill 的名称或使用
/ab-agents-vision触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
- Improved documentation in SKILL.md for easier setup and usage.
- Expanded feature list and clearer examples for describing images, extracting text, and analyzing photos.
- Added explicit URL support instructions.
- Provided installation alternatives and troubleshooting table.
- Updated tags for better searchability.
v1.0.0
- Initial release of AB-Agents-Vision.
- Provides image analysis via MiniMax VL API: image description, text extraction (OCR), and object detection.
- Supports local image files and URLs; works with JPEG, PNG, WebP formats up to ~10MB.
- Multi-language support for English, Russian, and Chinese.
- Includes CLI and Python usage examples for easy integration.
- Requires a MiniMax API key and setup as described in the documentation.
元数据
常见问题
AB Agents Vision 是什么?
👁️ Image analysis using MiniMax VL API. Describe images, extract text from screenshots, analyze photos. Works with local files and URLs. Simple shell wrapper. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 48 次。
如何安装 AB Agents Vision?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install ab-agents-vision」即可一键安装,无需额外配置。
AB Agents Vision 是免费的吗?
是的,AB Agents Vision 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
AB Agents Vision 支持哪些平台?
AB Agents Vision 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 AB Agents Vision?
由 alexburrstudio(@alexburrstudio)开发并维护,当前版本 v1.0.1。
推荐 Skills