← 返回 Skills 市场
linkfox-ai

Multimodal Extract Attributes

作者 linkfox-ai · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
91
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install linkfox-multimodal-extract-attributes
功能描述
利用多模态AI分析商品主图,提取视觉特征和提示词。当用户提到分析产品图片、从商品图中提取视觉属性、识别产品Listing中的颜色/形状/材质/风格、反推图片提示词、批量视觉特征提取、将产品图信息转化为结构化数据、视觉属性统计、基于图片的商品分类、main image analysis, image feature...
使用说明 (SKILL.md)

Product Main Image Prompt Extractor

This skill guides you on how to extract visual features and prompts from product main images using multimodal AI, helping e-commerce sellers turn unstructured image data into structured, actionable insights.

Core Concepts

This tool performs deep visual analysis on product main images (and optionally additional images) from a product list. It uses a multimodal AI model to identify specific visual dimensions based on a natural language instruction, such as color, shape, style, material, or specific selling-point elements.

How it works: You provide a list of products (with image URLs) and a natural language prompt describing what to extract. The tool automatically iterates over all products, analyzes each image, and returns structured attribute data (attributeName + attributeValue) appended to each product record.

Row expansion: When extracting multiple dimensions in a single request (e.g., both color and shape), each original product row is duplicated per dimension, resulting in one row per product per attribute.

Parameter Guide

Parameter Required Description
productImageAnalysisPrompt Yes Natural language instruction describing what visual information to extract from the images. Be specific about the dimensions you want (color, material, shape, style, pendant type, etc.).
analyzeAdditionalImages No Whether to also analyze additional product images beyond the main image. Defaults to false.
refResultData No Reference data from a previous step, containing the product list to analyze. Must be a JSON string with a products array.
userInput No Supplementary user input for additional context.

Writing Effective Prompts

  1. Be dimension-specific: Clearly state what visual attribute(s) to extract. "Extract the dominant color of each product" is better than "Analyze the images."
  2. One or few dimensions per call: For cleaner results, focus on one or two dimensions at a time.
  3. Use concrete terms: "Identify the pendant/charm shape on the product" is clearer than "Look at the decorations."
  4. No need to specify individual products: The tool automatically iterates over all products in the input list.
  5. Data flow dependency: The tool requires upstream product data. It cannot reference "products from the previous conversation round" -- the data must be explicitly provided via the current step's input or resource references.

Prompt Examples

Goal Example Prompt
Extract dominant color "Analyze each product's main image and extract the primary color of the product"
Identify material "From each product's main image, identify the apparent material (plastic, metal, wood, fabric, etc.)"
Classify pendant shape "Analyze each product's main image and identify the shape of the pendant/charm (round, heart, star, etc.)"
Detect style "Extract the overall style of each product from its main image (minimalist, vintage, bohemian, industrial, etc.)"
Reverse-engineer image prompt "Based on the main image, infer the likely AI-generation prompt or visual description that could reproduce this image"
Multi-dimension extraction "From each main image, extract both the dominant color and the overall product shape"

API Usage

This tool calls the LinkFox tool gateway API. See references/api.md for calling conventions, request parameters, and response structure. You can also execute scripts/multimodal_extract_attributes.py directly to run analyses.

Response Structure

The response enriches the original product list with extracted attributes:

  • products: An array of product records, each augmented with attributeName (the dimension extracted, e.g., "color") and attributeValue (the extracted value, e.g., "red"). One record per product per attribute dimension.
  • attributeGroups: Products grouped by attribute name and value for easy comparison. Each group includes the attribute value, the count of products, and the list of ASINs.
  • columns: Column definitions for rendering the result table.
  • costToken: Total tokens consumed by the multimodal AI model.

Display Rules

  1. Present data in tables: Show extracted attributes in clear, well-formatted tables with product identifiers (ASIN, title) alongside the extracted attribute values.
  2. Highlight distribution: When attribute groups are returned, summarize the distribution (e.g., "60% of products are red, 25% blue, 15% green") to give the user a quick overview.
  3. Row expansion notice: If multiple dimensions were extracted, inform the user that each product appears once per dimension in the results.
  4. Error handling: When analysis fails, explain the reason based on the response message and suggest adjustments (e.g., ensuring the product list contains valid image URLs).
  5. Data dependency reminder: If the user tries to reference products from a previous conversation round without explicit data flow, remind them that the product data must come from an upstream step in the current pipeline.
  6. No subjective advice: Present the extracted visual features factually. Let the user draw their own business conclusions.

Important Limitations

  • Requires product data input: The tool cannot operate without a products array containing image URLs. It depends on upstream data from a prior step.
  • No fuzzy references: Cannot analyze "products from the last conversation" -- data must be explicitly piped in via refResultData or resource references.
  • Row multiplication: Extracting N dimensions from M products produces up to M x N rows in the output.
  • Image accessibility: Product image URLs must be publicly accessible for the analysis to succeed.

User Expression & Scenario Quick Reference

Applicable -- Visual feature extraction and image analysis for product listings:

User Says Scenario
"What colors are these products" Dominant color extraction
"Analyze the product images", "Look at the main photos" General visual feature extraction
"What material does it look like" Material identification
"What shapes/styles are popular" Shape or style classification
"Reverse the image prompt", "What prompt made this image" Image prompt reverse-engineering
"Group products by visual appearance" Visual attribute grouping & statistics
"Extract features from the product photos" Structured attribute extraction

Not applicable -- Needs beyond image-based visual analysis:

  • Text-based product data queries (use appropriate data query tools)
  • Listing copywriting or review analysis
  • Price or sales data analysis
  • Tasks that do not involve product images

Feedback:

Auto-detect and report feedback via the Feedback API when any of the following apply:

  1. The functionality or purpose described in this skill does not match actual behavior
  2. The skill's results do not match the user's intent
  3. The user expresses dissatisfaction or praise about this skill
  4. Anything you believe could be improved

Call the feedback API as specified in references/api.md. Do not interrupt the user's flow.


For more high-quality, professional cross-border e-commerce skills, set LinkFox Skills.

安全使用建议
Key points to check before installing/using this skill: - The skill will send product metadata and image URLs to https://tool-gateway.linkfox.com for analysis. If your images or product data are sensitive or proprietary, verify the service's privacy, retention, and security policies before sending real data. - The code and API docs require an API key (LINKFOXAGENT_API_KEY), but the skill metadata omitted this requirement. Expect to set LINKFOXAGENT_API_KEY in your environment; confirm where to obtain the key and whether the provider is trustworthy. - Confirm the vendor and homepage: the skill metadata lists no homepage and the source is "unknown." Try to verify the publisher (owner ID) and the LinkFox service independently before trusting it with production data. - Limit triggers and test first: because the SKILL.md suggests triggering on broad mentions of image-related tasks, restrict or review automatic activations and test with non-sensitive images to validate behavior. - If you need higher assurance, request disclosure from the publisher about: where images are processed/stored, retention policy, whether images are reused to train models, and whether TLS and authentication are enforced. Also ask them to update the manifest to declare required env vars and provide a homepage or documentation. If you want, I can: (1) point out the exact lines in the files that require the API key, (2) draft a short checklist/email to the publisher requesting the missing information, or (3) suggest safer local or self-hosted alternatives for image attribute extraction.
功能分析
Type: OpenClaw Skill Name: linkfox-multimodal-extract-attributes Version: 1.0.0 The skill is a legitimate tool designed to extract visual attributes (color, material, style) from product images using a multimodal AI API. The Python script `multimodal_extract_attributes.py` is a standard API wrapper that sends product data to a hardcoded endpoint at `tool-gateway.linkfox.com` using an environment-stored API key. No evidence of data exfiltration, malicious execution, or harmful prompt injection was found; the instructions in `SKILL.md` are strictly aligned with the stated e-commerce analysis purpose.
能力评估
Purpose & Capability
The skill's name, description, SKILL.md, API reference, and included script consistently implement product image attribute extraction via an external LinkFox API — that part is coherent. However, the registry metadata claims "Required env vars: none" while both references/api.md and scripts/multimodal_extract_attributes.py require an API key (LINKFOXAGENT_API_KEY). The undeclared credential is an incoherence that reduces trust.
Instruction Scope
Instructions and the shipped script call an external HTTPS API (https://tool-gateway.linkfox.com/multimodal/extractPromptsFromMainImage) with user-provided product lists and image URLs; this is expected for the stated purpose. The SKILL.md also says the skill should trigger whenever a user mentions extracting structured data from images (including cases where the user doesn't literally say "image analysis"), which broadens activation scope and could lead to unexpected data being sent if triggers are too permissive. The skill does not instruct reading unrelated local files, but it will transmit product metadata and image URLs to the external service.
Install Mechanism
No install spec and only a small helper script are included. There are no downloads from arbitrary URLs or archive extraction. This is low-risk from an install/write-to-disk perspective.
Credentials
The only secret required in practice is LINKFOXAGENT_API_KEY (used as the Authorization header). Requesting a single API key is proportionate to calling an external service, but the skill metadata fails to declare this required environment variable. That mismatch (runtime code requiring a key while metadata lists none) is an important coherence issue and a potential deployment surprise for users. No other credentials are requested.
Persistence & Privilege
The skill does not request persistent or elevated privileges (always:false, no config-paths, no modifications to other skills). It can be invoked by the agent per platform default, which is normal; this combined with the external API call is a usage consideration but not a privilege escalation in itself.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install linkfox-multimodal-extract-attributes
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /linkfox-multimodal-extract-attributes 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release
元数据
Slug linkfox-multimodal-extract-attributes
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Multimodal Extract Attributes 是什么?

利用多模态AI分析商品主图,提取视觉特征和提示词。当用户提到分析产品图片、从商品图中提取视觉属性、识别产品Listing中的颜色/形状/材质/风格、反推图片提示词、批量视觉特征提取、将产品图信息转化为结构化数据、视觉属性统计、基于图片的商品分类、main image analysis, image feature... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 91 次。

如何安装 Multimodal Extract Attributes?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install linkfox-multimodal-extract-attributes」即可一键安装,无需额外配置。

Multimodal Extract Attributes 是免费的吗?

是的,Multimodal Extract Attributes 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Multimodal Extract Attributes 支持哪些平台?

Multimodal Extract Attributes 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Multimodal Extract Attributes?

由 linkfox-ai(@linkfox-ai)开发并维护,当前版本 v1.0.0。

💬 留言讨论