← 返回 Skills 市场

图片向量嵌入技能

Name: 图片向量嵌入技能
Author: kumamon2019s

作者 Venwell Chiang · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

116

总下载

当前安装

版本数

在 OpenClaw 中安装

/install image-embedding

功能描述

图片向量嵌入技能，支持将病害图片转换为特征向量，用于后续的图片检索和相似度匹配。适用于病害知识库的图片特征提取、向量入库等场景。

使用说明 (SKILL.md)

图片向量嵌入技能

核心功能

支持JPG/PNG/WebP等常见病害图片格式的特征提取
输出1024维归一化特征向量，适配病害检索场景精度要求
支持批量处理图片，单批最大支持100张
内置预训练病害特征提取模型，无需额外微调即可使用

依赖安装

pip install torch>=2.0 torchvision>=0.15 transformers>=4.30 pillow>=10.0

使用方式

单张图片提取

from scripts.embedding import ImageEmbedding
embeder = ImageEmbedding()
feature_vector = embeder.extract("./test_disease.jpg")

批量提取

vectors = embeder.batch_extract(["./img1.jpg", "./img2.jpg"])

输出格式

返回归一化后的numpy数组，shape为(1024,)

安全使用建议

This skill's documentation promises a real pretrained image embedding model and lists heavy ML dependencies, but the included code is a placeholder that returns random vectors. Before installing or using it: (1) don't assume outputs are meaningful — test with known inputs to verify vectors (e.g., check reproducibility, normalization, similarity behavior); (2) avoid blindly running 'pip install' of large ML libraries unless you actually need them — the shipped code only needs numpy; (3) ask the author for the real model implementation or for SKILL.md to be corrected (provide model weights, license, inference code, and how normalization/batch limits are enforced); (4) if you need production embeddings, prefer a skill with explicit, auditable model code or a link to a trusted model/release. If the author supplies a true model implementation that matches the docs, re-evaluate; otherwise treat this as untrusted/testing-only code.

功能分析

Type: OpenClaw Skill Name: image-embedding Version: 1.0.0 The skill provides a placeholder implementation for image feature extraction (embedding) for disease-related images. The code in scripts/embedding.py is a simple mock that returns random vectors, and the SKILL.md documentation describes standard usage of common machine learning libraries (torch, torchvision, transformers) without any malicious instructions or suspicious behaviors.

能力评估

⚠ Purpose & Capability

SKILL.md and description claim a pre-trained disease-specific embedding model, 1024-d normalized outputs, support for common image formats, batching limits, and ML dependencies (torch, torchvision, transformers, pillow). The included code (scripts/embedding.py) contains only a simple class that returns random numpy vectors and does not use any of the declared ML libraries or implement format handling, normalization, or batch-size enforcement. The requested dependencies and promised capabilities are disproportionate to the actual implementation.

⚠ Instruction Scope

Runtime instructions tell the user/agent to install heavy ML packages via pip, and show usage examples that imply a real model. The SKILL.md instructs operations that the code does not implement (pretrained model extraction, normalization, file format handling, max-batch enforcement). There are no instructions to access unrelated files or credentials, but the guidance is misleading about runtime behavior.

ℹ Install Mechanism

There is no formal install spec (skill is instruction-only), but SKILL.md recommends pip installing torch, torchvision, transformers, pillow. Because those installs are not reflected in code and may be unnecessary, this is an inconsistency (not a direct supply-chain risk here, but installing large ML packages without need is avoidable risk).

✓ Credentials

The skill requests no environment variables, credentials, or config paths. There is no sign of unnecessary credential access or exfiltration in code or instructions.

✓ Persistence & Privilege

Skill flags are default (always: false, user-invocable: true, model invocation allowed). The skill does not request elevated persistence or modify other skills; no concern here.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install image-embedding
安装完成后，直接呼叫该 Skill 的名称或使用 /image-embedding 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

Initial release of image-embedding skill: - Converts disease images (JPG/PNG/WebP) into 1024-dimensional normalized feature vectors. - Enables both single and batch image processing (up to 100 images per batch). - Integrates a pre-trained disease feature extraction model for out-of-the-box use. - Designed for image retrieval and similarity matching in disease knowledge bases.

元数据

Slug image-embedding

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题