← Back to Skills Marketplace
minds-eye
by
Tianyu Jiang
· GitHub ↗
· v1.0.0
274
Downloads
1
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install minds-eye
Description
Remember and retrieve visual content from conversations. Use when: (1) user sends an image, photo, chart, diagram, or screenshot and wants it saved/remembere...
Usage Guidance
This skill appears to do what it says: it analyzes images by sending them to the configured vision-capable model and stores copies and metadata under ~/.multimodal-memory/. Before installing, verify which provider/API key the skill will use (check ~/.openclaw/openclaw.json and your OPENAI_API_KEY). Do not supply file paths to sensitive local files (password screenshots, private keys, etc.), because any image file you point the skill to will be read and uploaded to the API. If you plan to capture websites, note Playwright (or headless Chrome) will load remote pages locally to take screenshots — that can execute page JS. Consider using a separate/limited account or running this skill in an isolated environment if you need to limit exposure, and review or rotate API keys regularly. Finally, inspect ~/.multimodal-memory/ if you want to audit stored images and memory.md summaries or remove them when no longer needed.
Capability Analysis
Type: OpenClaw Skill
Name: minds-eye
Version: 1.0.0
The 'minds-eye' skill provides a legitimate mechanism for an OpenClaw agent to store and recall visual information. It includes scripts for capturing website screenshots (capture_url.py), analyzing images via the OpenAI API (analyze.py), and managing a local SQLite database for metadata (store.py, search.py, list.py). The skill correctly retrieves API keys from the standard OpenClaw configuration directory (~/.openclaw/) and maintains a human-readable summary in ~/.multimodal-memory/memory.md. No evidence of data exfiltration, malicious execution, or prompt injection attacks was found; the code is well-structured and aligns perfectly with its stated purpose.
Capability Assessment
Purpose & Capability
Name/description (remember & retrieve visual content) matches the included scripts and instructions: analyze images via a vision-capable model, capture website screenshots, store images and metadata in ~/.multimodal-memory, and provide search/list utilities. Required binary (python3) and optional Playwright dependency are reasonable for this functionality.
Instruction Scope
Instructions correctly direct the agent to use the provided scripts (analyze.py, capture_url.py, store.py, search.py, list.py). The skill intentionally reads OpenClaw config (~/.openclaw/openclaw.json) and ~/.openclaw/.env and will ask for absolute image file paths when needed. These behaviors are consistent with the purpose but do grant the skill the ability to read arbitrary files the agent is instructed to analyze (those files will be base64-encoded and sent to the configured model provider).
Install Mechanism
This is an instruction-and-script skill with no installer; requirements.txt lists Playwright only. One-time pip install of Playwright (recommended by the SKILL.md) is the only external package installation suggested. There are no downloads from unknown URLs or archives that would raise high install risk.
Credentials
The skill does not require unusual environment variables, but it will look for an API key in ~/.openclaw/openclaw.json, ~/.openclaw/.env, or the OPENAI_API_KEY env var to call a vision-capable model. This is proportional to its function. Users should be aware that any image provided (or any file path supplied and read) will be uploaded to the configured API provider.
Persistence & Privilege
The skill stores data under the user's home directory (~/.multimodal-memory/) and updates a memory.md file; this is expected for a local memory capability. It does not request always:true and does not modify other skills or system-wide agent configuration.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install minds-eye - After installation, invoke the skill by name or use
/minds-eye - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release: visual memory skill for AI agents
Metadata
Frequently Asked Questions
What is minds-eye?
Remember and retrieve visual content from conversations. Use when: (1) user sends an image, photo, chart, diagram, or screenshot and wants it saved/remembere... It is an AI Agent Skill for Claude Code / OpenClaw, with 274 downloads so far.
How do I install minds-eye?
Run "/install minds-eye" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is minds-eye free?
Yes, minds-eye is completely free (open-source). You can download, install and use it at no cost.
Which platforms does minds-eye support?
minds-eye is cross-platform and runs anywhere OpenClaw / Claude Code is available (darwin, linux).
Who created minds-eye?
It is built and maintained by Tianyu Jiang (@horisky); the current version is v1.0.0.
More Skills