โ† Back to Skills Marketplace
alexburrstudio

AB Agents Vision

by alexburrstudio ยท GitHub โ†— ยท v1.0.1 ยท MIT-0
cross-platform โš  suspicious
48
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install ab-agents-vision
Description
๐Ÿ‘๏ธ Image analysis using MiniMax VL API. Describe images, extract text from screenshots, analyze photos. Works with local files and URLs. Simple shell wrapper.
README (SKILL.md)

AB Agents Vision ๐Ÿ‘๏ธ

Image analysis using MiniMax VL API โ€” simple, fast, reliable.

What It Does

  • ๐Ÿ“ธ Describe images โ€” Get detailed scene descriptions
  • ๐Ÿ“ Extract text โ€” Read text from screenshots, photos, documents
  • ๐Ÿ” Analyze photos โ€” Identify objects, people, settings
  • ๐ŸŒ URL support โ€” Analyze images from the web

Quick Start

# Install
curl -LsSf https://astral.sh/uv/install.sh | sh

# Set your MiniMax API key
export MINIMAX_API_KEY="sk-cp-your-key"

# Use
./vision.sh image.jpg "Describe this image"

Usage

# Basic description
./vision.sh photo.jpg

# With custom prompt
./vision.sh screenshot.png "What text do you see?"

# URL support
./vision.sh "https://example.com/image.jpg" "Describe this"

Requirements

  • MiniMax Token Plan API key (get one)
  • Linux/macOS
  • uvx (auto-installed via script)

Examples

Screenshot analysis:

Input: screenshot.png + "What text is in the image?"
Output: "The screenshot shows a code editor with Python code...

Photo description:

Input: photo.jpg + "Describe in detail"
Output: "A person's bare foot and lower leg resting on a brown
textured waffle-weave blanket. The skin is light-toned with
visible fine hairs..."

Installation

git clone https://github.com/alexburrstudio/ab-agents-skills.git
cd ab-agents-skills/skills/vision
chmod +x vision.sh

Or via ClaWHub:

clawhub install AB-Agents-Vision

Troubleshooting

Error Solution
API Error: 1033 Retry โ€” system error on MiniMax side
No response Check MINIMAX_API_KEY is set correctly
Slow Use smaller images (\x3C10MB)

AB-Agents ๐Ÿฆ€

Usage Guidance
This skill largely does what it claims (wraps MiniMax VL), but there are several issues to consider before installing: - Metadata mismatch: The registry says no environment variables are required, but both SKILL.md and vision.sh require MINIMAX_API_KEY. Treat the API key as required and ensure you provide a dedicated key. - Remote installer risk: The README suggests running 'curl ... | sh' to install 'uv'. Running remote install scripts without verification is risky โ€” inspect the script at https://astral.sh/uv/install.sh before running, or install the tool from a trusted package source or your distro's package manager. - Local credential file read: vision.sh will try to read ~/.minimax-env to extract MINIMAX_API_KEY. If you keep secrets in that file, be aware the script will access it. Consider storing the key in a controlled location or set MINIMAX_API_KEY in the environment explicitly. - PATH tweak: The script prepends /root/.local/bin to PATH at runtime which is odd and may hide which binary is being invoked; review the tool installed as 'uvx' and confirm it is what you expect. Recommended actions before use: inspect vision.sh and the uv install script yourself, avoid piping unknown scripts to sh, set MINIMAX_API_KEY explicitly in your shell environment (not relying on fallback files), and verify the uvx/minimax client being installed is from a trusted source. If you need higher assurance, ask the package owner for a signed release or an install method that uses a well-known package registry or package manager.
Capability Analysis
Type: OpenClaw Skill Name: ab-agents-vision Version: 1.0.1 The skill bundle contains a shell script (vision.sh) that is vulnerable to command injection and JSON injection. It directly embeds the $PROMPT and $IMAGE_PATH variables into an echo command and a JSON-RPC payload without sanitization, allowing for arbitrary command execution via command substitution (e.g., $(whoami)) if the input is not strictly controlled. Additionally, it relies on uvx to download and execute a remote package (minimax-coding-plan-mcp), which poses a supply chain risk if the package name is typosquatted or the registry is compromised.
Capability Tags
requires-sensitive-credentials
Capability Assessment
โ„น Purpose & Capability
The code and SKILL.md implement an image-analysis wrapper for MiniMax (calls a minimax tool via 'uvx' and uses a MINIMAX_API_KEY) which matches the stated purpose. However the registry metadata declares no required environment variables while both the docs and the script require MINIMAX_API_KEY โ€” a clear metadata/instruction mismatch. The script also adds /root/.local/bin to PATH which is unusual and unnecessary for the stated functionality.
โš  Instruction Scope
SKILL.md instructs the user to install 'uv' by piping a remote install script (curl ... | sh). The runtime script reads MINIMAX_API_KEY from either the environment or from ~/.minimax-env (grep), which means it will access a local credentials file in the user's home directory. Reading that file is outside the nominal scope (the key itself is required, but implicit fallback file access is a privacy/clarity concern). The instructions do not document these behaviors in the registry metadata.
โš  Install Mechanism
There is no formal install spec in the package; SKILL.md recommends running a remote installer via 'curl -LsSf https://astral.sh/uv/install.sh | sh', which is high-risk (running remote code without checks). The script relies on the external 'uvx' tool that would be installed by that script. No checksums or explicit trusted package sources are provided in the skill docs.
โ„น Credentials
Functionality legitimately requires a MiniMax API key (MINIMAX_API_KEY), so requiring one is proportionate. But the registry metadata omits this requirement. The script's fallback to read ~/.minimax-env means it will access a local file containing credentials โ€” this access should be declared up front. No unrelated credentials appear to be requested.
โœ“ Persistence & Privilege
The skill does not request permanent presence (always:false), does not modify other skills' configs, and does not write persistent files itself. The only runtime change is a temporary PATH mutation. Autonomous invocation is allowed by default but not a special privilege here.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install ab-agents-vision
  3. After installation, invoke the skill by name or use /ab-agents-vision
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
- Improved documentation in SKILL.md for easier setup and usage. - Expanded feature list and clearer examples for describing images, extracting text, and analyzing photos. - Added explicit URL support instructions. - Provided installation alternatives and troubleshooting table. - Updated tags for better searchability.
v1.0.0
- Initial release of AB-Agents-Vision. - Provides image analysis via MiniMax VL API: image description, text extraction (OCR), and object detection. - Supports local image files and URLs; works with JPEG, PNG, WebP formats up to ~10MB. - Multi-language support for English, Russian, and Chinese. - Includes CLI and Python usage examples for easy integration. - Requires a MiniMax API key and setup as described in the documentation.
Metadata
Slug ab-agents-vision
Version 1.0.1
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 2
Frequently Asked Questions

What is AB Agents Vision?

๐Ÿ‘๏ธ Image analysis using MiniMax VL API. Describe images, extract text from screenshots, analyze photos. Works with local files and URLs. Simple shell wrapper. It is an AI Agent Skill for Claude Code / OpenClaw, with 48 downloads so far.

How do I install AB Agents Vision?

Run "/install ab-agents-vision" in the OpenClaw or Claude Code chat to install it in one step โ€” no extra setup required.

Is AB Agents Vision free?

Yes, AB Agents Vision is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does AB Agents Vision support?

AB Agents Vision is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created AB Agents Vision?

It is built and maintained by alexburrstudio (@alexburrstudio); the current version is v1.0.1.

๐Ÿ’ฌ Comments