← Back to Skills Marketplace

Image Scanner Pro

Name: Image Scanner Pro
Author: mrxolin

by mrxolin · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

221

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install image-scanner-pro

Description

Scan image folders and use Gemini 2.0 Flash to analyze and categorize photos by photography attributes like composition, lighting, and style.

README (SKILL.md)

image-scanner-pro\r

Description\r

扫描图片文件夹，调用视觉大模型（Gemini 2.0 Flash）深度分析每张照片的摄影属性：景别、主体、场景、光线、氛围、影调、产品、物件、陈设。\r \r

Triggers\r

分析摄影作品\r
识别图片内容\r
扫描并分类图片\r
批量分析照片风格\r
整理作品集\r
识别图片颜色和风格\r \r

Capabilities\r

扫描目录中的所有图片文件\r
调用视觉模型分析每张图片\r
识别专业摄影属性（景别/主体/光线/影调等）\r
按拍摄内容自动分类\r
生成详细分析报告\r
支持批量处理\r \r

Requirements\r

需要配置视觉模型 API（Gemini 2.0 Flash）\r
安装依赖：npm install @google/generative-ai\r \r

Usage\r

node skills/image-scanner-pro/index.js --path \x3C目录路径> --api-key \x3CGemini Key> --output report.json

Usage Guidance

This skill appears to do what it says (scan local images and send them to Gemini for analysis) but the registry metadata failed to declare the required Gemini API key/env vars. Before installing: 1) Expect that images (full base64 blobs) will be uploaded to an external LLM service — do NOT run it on private or sensitive photos unless you accept that. 2) Provide the API key either with --api-key or set GEMINI_API_KEY; the skill will also respect HTTPS_PROXY / --proxy. 3) Review and run dependencies (npm install) in an isolated environment; verify the @google/generative-ai package version is acceptable. 4) If you need stronger guarantees about data handling or want metadata-only analysis, request/inspect a variant that avoids sending full images. If you need the metadata declared in the registry (so automated installers can surface required secrets), ask the publisher to add GEMINI_API_KEY to requires.env.

Capability Analysis

Type: OpenClaw Skill Name: image-scanner-pro Version: 1.0.0 The skill bundle is a legitimate tool for batch-analyzing images using the Gemini 2.0 Flash model. The code in index.js scans a local directory for image files, converts them to base64, and sends them to the official Google Generative AI API for photography-specific analysis as described in SKILL.md. There is no evidence of data exfiltration, unauthorized network activity, or malicious prompt injection.

Capability Assessment

ℹ Purpose & Capability

Name/description claim to analyze local images with Gemini; the code and SKILL.md implement exactly that (scanning folders, sending images to Gemini, categorizing). However, the registry metadata declares no primary credential or required env vars while both SKILL.md and index.js expect a Gemini API key (CLI --api-key or process.env.GEMINI_API_KEY). That mismatch is unexplained.

ℹ Instruction Scope

Runtime instructions and code are narrowly scoped to scanning a specified directory, reading image files, and sending their base64 content to the Gemini model for analysis. This is coherent with purpose, but it does transmit full image data (and file paths/sizes) to an external model — a privacy consideration users should be aware of.

✓ Install Mechanism

No external arbitrary downloads; dependencies are standard npm packages (@google/generative-ai, proxy-agent) declared in package.json and resolved via registry mirror in package-lock.json. No extract-from-URL installs or obscure hosts were used.

⚠ Credentials

The skill uses an API key (GEMINI_API_KEY) and proxy env vars at runtime but the registry metadata lists none and primary credential is 'none'. Requiring an API key to call Gemini is expected, but failing to declare it in metadata is an incoherence that can lead to surprise credential prompts or misconfiguration. The presence of proxy-agent dependency also implies network configuration ability that isn't declared.

✓ Persistence & Privilege

Skill is not always-enabled, does not request elevated system persistence, and does not modify other skills or global configuration. It runs on-demand and writes only its report file if --output is provided.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install image-scanner-pro
After installation, invoke the skill by name or use /image-scanner-pro
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release of image-scanner-pro. - Scans folders and analyzes each image using Gemini 2.0 Flash vision model. - Identifies photography attributes such as scene, subject, lighting, mood, and style. - Automatically classifies photos by content and generates detailed analysis reports. - Supports batch processing and works via simple command line usage. - Requires configuration of Gemini API and installation of @google/generative-ai.

Metadata

Slug image-scanner-pro

Version 1.0.0

License MIT-0

All-time Installs 1

Active Installs 1

Total Versions 1

Frequently Asked Questions

What is Image Scanner Pro?

Scan image folders and use Gemini 2.0 Flash to analyze and categorize photos by photography attributes like composition, lighting, and style. It is an AI Agent Skill for Claude Code / OpenClaw, with 221 downloads so far.

How do I install Image Scanner Pro?

Run "/install image-scanner-pro" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Image Scanner Pro free?

Yes, Image Scanner Pro is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Image Scanner Pro support?

Image Scanner Pro is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Image Scanner Pro?

It is built and maintained by mrxolin (@mrxolin); the current version is v1.0.0.

More Skills