← Back to Skills Marketplace
601
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install menuvision
Description
Build beautiful HTML photo menus from restaurant URLs, PDFs, or photos using Gemini Vision and AI image generation
Usage Guidance
This skill appears to do what it says, but consider these before installing: 1) Privacy: PDFs and photos you provide will be sent to Google's Gemini APIs for extraction and image generation — do not upload sensitive personal data unless you're comfortable with Google processing it. 2) API key scope: use a dedicated Google API key or service account with minimal required permissions and rotate it if shared. 3) Local scripts: the agent will generate and run Python scripts locally (ensure python3 is the expected interpreter). 4) Publishing: if you choose to publish to GitHub Pages, the agent should prompt for explicit GitHub credentials; do not provide tokens unless you trust the action. 5) Source provenance: the metadata points at a GitHub repo (github.com/ademczuk/MenuVision); if provenance matters, review that repository or contact the author before trusting the skill.
Capability Analysis
Type: OpenClaw Skill
Name: menuvision
Version: 1.0.1
The skill bundle is classified as suspicious due to its extensive use of high-risk capabilities, including the execution of `git` commands with a `GITHUB_PAT` for publishing to GitHub Pages, the installation and use of `playwright` (a headless browser) for web scraping, and broad file system access. While these capabilities are necessary for the skill's stated purpose of building and optionally publishing restaurant menus, they introduce significant attack surface and potential for misuse. There is no clear evidence of intentional malicious behavior (e.g., data exfiltration to unauthorized endpoints, persistence mechanisms, or obfuscation) within the `SKILL.md` instructions or implied code logic. However, the power of these operations, particularly the `GITHUB_PAT` for authenticated `git push` and the `playwright` browser, means that a compromised agent or a malicious input URL could lead to severe consequences, classifying it beyond benign.
Capability Assessment
Purpose & Capability
The declared requirements (python3 and GOOGLE_API_KEY) match the SKILL.md: the pipeline extracts menu data via Gemini Vision, generates images via Gemini Image, and builds HTML using generated Python scripts. Requiring a Google API key is coherent for Gemini-based extraction and image generation, and python3 is reasonable because the agent will create and run Python scripts.
Instruction Scope
The instructions confine themselves to extracting menu data from user-supplied URLs/PDFs/photos, generating images, building local HTML, and optionally publishing to GitHub Pages. However, the extraction and image-generation steps explicitly send provided menu content (images/PDF text) to Google's Gemini APIs — users should be aware their menu content (and any embedded data in PDFs/photos) will be transmitted to Google. The SKILL.md does not instruct reading unrelated local files or environment variables.
Install Mechanism
This is an instruction-only skill with no install spec and no code files, so nothing is downloaded or written by the installer itself. That keeps install risk low; the agent will generate Python scripts at runtime which will run locally under the user's environment.
Credentials
Only one required env var (GOOGLE_API_KEY) is declared and used for Gemini calls — proportionate to the stated functionality. No unrelated credentials (e.g., AWS, GitHub) are required. Note: an optional publish step to GitHub Pages is listed but no GitHub token is requested by default; if the agent attempts to publish, it should request explicit permission and the appropriate GitHub credentials at that time.
Persistence & Privilege
The skill does not request permanent/always-enabled presence. It does not modify other skills or global agent settings per the SKILL.md. Runtime-generated scripts operate within the user's environment when invoked.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install menuvision - After installation, invoke the skill by name or use
/menuvision - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
Add portable output mode (base64 single-file HTML), make GitHub Pages publishing configurable via env vars, fix hardcoded repo
v1.0.0
Initial release: Instagram-style restaurant menu generator with AI food images via Gemini
Metadata
Frequently Asked Questions
What is MenuVision?
Build beautiful HTML photo menus from restaurant URLs, PDFs, or photos using Gemini Vision and AI image generation. It is an AI Agent Skill for Claude Code / OpenClaw, with 601 downloads so far.
How do I install MenuVision?
Run "/install menuvision" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is MenuVision free?
Yes, MenuVision is completely free (open-source). You can download, install and use it at no cost.
Which platforms does MenuVision support?
MenuVision is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created MenuVision?
It is built and maintained by ademczuk (@ademczuk); the current version is v1.0.1.
More Skills