← Back to Skills Marketplace

MenuVision

Name: MenuVision
Author: ademczuk

by ademczuk · GitHub ↗ · v1.0.1

cross-platform ⚠ suspicious

601

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install menuvision

Description

Build beautiful HTML photo menus from restaurant URLs, PDFs, or photos using Gemini Vision and AI image generation

Usage Guidance

This skill appears to do what it says, but consider these before installing: 1) Privacy: PDFs and photos you provide will be sent to Google's Gemini APIs for extraction and image generation — do not upload sensitive personal data unless you're comfortable with Google processing it. 2) API key scope: use a dedicated Google API key or service account with minimal required permissions and rotate it if shared. 3) Local scripts: the agent will generate and run Python scripts locally (ensure python3 is the expected interpreter). 4) Publishing: if you choose to publish to GitHub Pages, the agent should prompt for explicit GitHub credentials; do not provide tokens unless you trust the action. 5) Source provenance: the metadata points at a GitHub repo (github.com/ademczuk/MenuVision); if provenance matters, review that repository or contact the author before trusting the skill.

Capability Analysis

Type: OpenClaw Skill Name: menuvision Version: 1.0.1 The skill bundle is classified as suspicious due to its extensive use of high-risk capabilities, including the execution of `git` commands with a `GITHUB_PAT` for publishing to GitHub Pages, the installation and use of `playwright` (a headless browser) for web scraping, and broad file system access. While these capabilities are necessary for the skill's stated purpose of building and optionally publishing restaurant menus, they introduce significant attack surface and potential for misuse. There is no clear evidence of intentional malicious behavior (e.g., data exfiltration to unauthorized endpoints, persistence mechanisms, or obfuscation) within the `SKILL.md` instructions or implied code logic. However, the power of these operations, particularly the `GITHUB_PAT` for authenticated `git push` and the `playwright` browser, means that a compromised agent or a malicious input URL could lead to severe consequences, classifying it beyond benign.

Capability Assessment

✓ Purpose & Capability

The declared requirements (python3 and GOOGLE_API_KEY) match the SKILL.md: the pipeline extracts menu data via Gemini Vision, generates images via Gemini Image, and builds HTML using generated Python scripts. Requiring a Google API key is coherent for Gemini-based extraction and image generation, and python3 is reasonable because the agent will create and run Python scripts.

ℹ Instruction Scope

The instructions confine themselves to extracting menu data from user-supplied URLs/PDFs/photos, generating images, building local HTML, and optionally publishing to GitHub Pages. However, the extraction and image-generation steps explicitly send provided menu content (images/PDF text) to Google's Gemini APIs — users should be aware their menu content (and any embedded data in PDFs/photos) will be transmitted to Google. The SKILL.md does not instruct reading unrelated local files or environment variables.

✓ Install Mechanism

This is an instruction-only skill with no install spec and no code files, so nothing is downloaded or written by the installer itself. That keeps install risk low; the agent will generate Python scripts at runtime which will run locally under the user's environment.

✓ Credentials

Only one required env var (GOOGLE_API_KEY) is declared and used for Gemini calls — proportionate to the stated functionality. No unrelated credentials (e.g., AWS, GitHub) are required. Note: an optional publish step to GitHub Pages is listed but no GitHub token is requested by default; if the agent attempts to publish, it should request explicit permission and the appropriate GitHub credentials at that time.

✓ Persistence & Privilege

The skill does not request permanent/always-enabled presence. It does not modify other skills or global agent settings per the SKILL.md. Runtime-generated scripts operate within the user's environment when invoked.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install menuvision
After installation, invoke the skill by name or use /menuvision
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.1

Add portable output mode (base64 single-file HTML), make GitHub Pages publishing configurable via env vars, fix hardcoded repo

v1.0.0

Initial release: Instagram-style restaurant menu generator with AI food images via Gemini

Metadata

Slug menuvision

Version 1.0.1

License —

All-time Installs 0

Active Installs 0

Total Versions 2

Frequently Asked Questions

What is MenuVision?

Build beautiful HTML photo menus from restaurant URLs, PDFs, or photos using Gemini Vision and AI image generation. It is an AI Agent Skill for Claude Code / OpenClaw, with 601 downloads so far.

How do I install MenuVision?

Run "/install menuvision" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is MenuVision free?

Yes, MenuVision is completely free (open-source). You can download, install and use it at no cost.

Which platforms does MenuVision support?

MenuVision is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created MenuVision?

It is built and maintained by ademczuk (@ademczuk); the current version is v1.0.1.

More Skills