← Back to Skills Marketplace
图片向量嵌入技能
by
Venwell Chiang
· GitHub ↗
· v1.0.0
· MIT-0
116
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install image-embedding
Description
图片向量嵌入技能,支持将病害图片转换为特征向量,用于后续的图片检索和相似度匹配。适用于病害知识库的图片特征提取、向量入库等场景。
README (SKILL.md)
图片向量嵌入技能
核心功能
- 支持JPG/PNG/WebP等常见病害图片格式的特征提取
- 输出1024维归一化特征向量,适配病害检索场景精度要求
- 支持批量处理图片,单批最大支持100张
- 内置预训练病害特征提取模型,无需额外微调即可使用
依赖安装
pip install torch>=2.0 torchvision>=0.15 transformers>=4.30 pillow>=10.0
使用方式
单张图片提取
from scripts.embedding import ImageEmbedding
embeder = ImageEmbedding()
feature_vector = embeder.extract("./test_disease.jpg")
批量提取
vectors = embeder.batch_extract(["./img1.jpg", "./img2.jpg"])
输出格式
返回归一化后的numpy数组,shape为(1024,)
Usage Guidance
This skill's documentation promises a real pretrained image embedding model and lists heavy ML dependencies, but the included code is a placeholder that returns random vectors. Before installing or using it: (1) don't assume outputs are meaningful — test with known inputs to verify vectors (e.g., check reproducibility, normalization, similarity behavior); (2) avoid blindly running 'pip install' of large ML libraries unless you actually need them — the shipped code only needs numpy; (3) ask the author for the real model implementation or for SKILL.md to be corrected (provide model weights, license, inference code, and how normalization/batch limits are enforced); (4) if you need production embeddings, prefer a skill with explicit, auditable model code or a link to a trusted model/release. If the author supplies a true model implementation that matches the docs, re-evaluate; otherwise treat this as untrusted/testing-only code.
Capability Analysis
Type: OpenClaw Skill
Name: image-embedding
Version: 1.0.0
The skill provides a placeholder implementation for image feature extraction (embedding) for disease-related images. The code in scripts/embedding.py is a simple mock that returns random vectors, and the SKILL.md documentation describes standard usage of common machine learning libraries (torch, torchvision, transformers) without any malicious instructions or suspicious behaviors.
Capability Assessment
Purpose & Capability
SKILL.md and description claim a pre-trained disease-specific embedding model, 1024-d normalized outputs, support for common image formats, batching limits, and ML dependencies (torch, torchvision, transformers, pillow). The included code (scripts/embedding.py) contains only a simple class that returns random numpy vectors and does not use any of the declared ML libraries or implement format handling, normalization, or batch-size enforcement. The requested dependencies and promised capabilities are disproportionate to the actual implementation.
Instruction Scope
Runtime instructions tell the user/agent to install heavy ML packages via pip, and show usage examples that imply a real model. The SKILL.md instructs operations that the code does not implement (pretrained model extraction, normalization, file format handling, max-batch enforcement). There are no instructions to access unrelated files or credentials, but the guidance is misleading about runtime behavior.
Install Mechanism
There is no formal install spec (skill is instruction-only), but SKILL.md recommends pip installing torch, torchvision, transformers, pillow. Because those installs are not reflected in code and may be unnecessary, this is an inconsistency (not a direct supply-chain risk here, but installing large ML packages without need is avoidable risk).
Credentials
The skill requests no environment variables, credentials, or config paths. There is no sign of unnecessary credential access or exfiltration in code or instructions.
Persistence & Privilege
Skill flags are default (always: false, user-invocable: true, model invocation allowed). The skill does not request elevated persistence or modify other skills; no concern here.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install image-embedding - After installation, invoke the skill by name or use
/image-embedding - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of image-embedding skill:
- Converts disease images (JPG/PNG/WebP) into 1024-dimensional normalized feature vectors.
- Enables both single and batch image processing (up to 100 images per batch).
- Integrates a pre-trained disease feature extraction model for out-of-the-box use.
- Designed for image retrieval and similarity matching in disease knowledge bases.
Metadata
Frequently Asked Questions
What is 图片向量嵌入技能?
图片向量嵌入技能,支持将病害图片转换为特征向量,用于后续的图片检索和相似度匹配。适用于病害知识库的图片特征提取、向量入库等场景。 It is an AI Agent Skill for Claude Code / OpenClaw, with 116 downloads so far.
How do I install 图片向量嵌入技能?
Run "/install image-embedding" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is 图片向量嵌入技能 free?
Yes, 图片向量嵌入技能 is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does 图片向量嵌入技能 support?
图片向量嵌入技能 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created 图片向量嵌入技能?
It is built and maintained by Venwell Chiang (@kumamon2019s); the current version is v1.0.0.
More Skills