← Back to Skills Marketplace

Vlm Image Helper

Name: Vlm Image Helper
Author: testlbin

by testlbin · GitHub ↗ · v0.1.0 · MIT-0

cross-platform ✓ Security Clean

170

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install vlm-image-helper

Description

Visual inspection helper for VLM and OCR workflows. Use when agent needs to help a vision model see an image more clearly before re-analysis: rotate misalign...

Usage Guidance

This appears to be a focused, local image-preprocessing helper suitable for VLM/OCR workflows. Before installing or running: (1) review the full script if you will process sensitive images (it operates locally and returns base64 or files, but you should avoid pasting secrets into command arguments), (2) install Pillow from the official PyPI source (pip install Pillow) or via your vetted package manager, and (3) confirm your agent won't forward image data to external services unless you intend that. If you need a deeper audit, provide the remainder of scripts/image_helper.py for a line-by-line review.

Capability Analysis

Type: OpenClaw Skill Name: vlm-image-helper Version: 0.1.0 The vlm-image-helper skill is a legitimate utility designed to assist Vision Language Models (VLMs) by preprocessing images for better OCR and visual analysis. The primary script, scripts/image_helper.py, uses the Pillow library to perform standard image transformations like rotation, cropping, and scaling, with no evidence of malicious behavior such as data exfiltration, network calls, or unauthorized system access. The instructions in SKILL.md and the documentation are consistent with the tool's stated purpose and do not contain harmful prompt-injection attempts.

Capability Assessment

✓ Purpose & Capability

Name/description (VLM/OCR preprocessing) match the provided CLI, SKILL.md, README and the included script: rotation, semantic cropping, scaling and enhancement are implemented and expected for this purpose.

✓ Instruction Scope

SKILL.md limits scope to minimal transformations for re-analysis and documents input/output rules. The runtime instructions only reference the included script and local image inputs/outputs (file paths, data URIs, base64) — no directives to read unrelated files or send data externally.

✓ Install Mechanism

No install spec in registry; the code is instruction-only plus a small Python script. The only external dependency is Pillow (pip), which is reasonable and documented. No downloads from unknown URLs or archive extraction are used.

✓ Credentials

The skill requires no environment variables, no credentials, and no config paths. The script uses only local temp files and in-memory base64 — proportional to the stated functionality.

✓ Persistence & Privilege

Skill is not always-enabled and uses no privileged agent APIs or modifications to other skills. It writes only its own temporary output files when asked and returns base64 on demand.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install vlm-image-helper
After installation, invoke the skill by name or use /vlm-image-helper
Provide required inputs per the skill's parameter spec and get structured output

Version History

v0.1.0

- Initial release of vlm-image-helper: a visual preprocessing tool for OCR and VLM tasks. - Provides quick commands for rotating, cropping, zooming, enhancing, or converting images to improve model analysis. - Designed for minimal, targeted image transformations—avoid general-purpose editing. - Supports input/output as file paths, base64 strings, or data URIs. - Emphasizes iterative improvement: re-analyze after each edit rather than chaining many changes. - Includes references for full CLI options and preset parameters.

Metadata

Slug vlm-image-helper

Version 0.1.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Vlm Image Helper?

Visual inspection helper for VLM and OCR workflows. Use when agent needs to help a vision model see an image more clearly before re-analysis: rotate misalign... It is an AI Agent Skill for Claude Code / OpenClaw, with 170 downloads so far.

How do I install Vlm Image Helper?

Run "/install vlm-image-helper" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Vlm Image Helper free?

Yes, Vlm Image Helper is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Vlm Image Helper support?

Vlm Image Helper is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Vlm Image Helper?

It is built and maintained by testlbin (@testlbin); the current version is v0.1.0.

More Skills