← 返回 Skills 市场
zypher-agent

corespeed-nanobanana

作者 Zypher Agent · GitHub ↗ · v0.0.2 · MIT-0
cross-platform ✓ 安全检测通过
162
总下载
0
收藏
1
当前安装
2
版本数
在 OpenClaw 中安装
/install corespeed-nanobanana
功能描述
Generate and edit images using Google Gemini models via Corespeed AI Gateway. Supports text-to-image generation, image editing, multi-image input, and text r...
使用说明 (SKILL.md)

Corespeed NanoBanana — Gemini Image & Text Generation

Auth: Set CS_AI_GATEWAY_BASE_URL and CS_AI_GATEWAY_API_TOKEN environment variables.

Workflow

  1. Pick a model from the table below (default: gemini-2.5-flash-image for image generation)
  2. Run the script with your prompt

Usage

uv run {baseDir}/scripts/gemini.py --prompt "your prompt" -f output.ext [-i input.ext] [--model MODEL]
  • --prompt, -p — Text prompt (required)
  • --filename, -f — Output filename (required)
  • --input, -i — Input image file(s), repeat for multiple
  • --model, -m — Model name (default: gemini-2.5-flash-image)
  • --modalities — Response type: auto, image, text, image+text (default: auto)
  • --json — Output structured JSON (recommended for agent consumption)

Output format is determined by file extension: .png/.jpg → image generation, .txt/.md → text output.

Image Generation

# Text-to-image
uv run {baseDir}/scripts/gemini.py -p "a watercolor fox in autumn forest" -f fox.png

# Image editing
uv run {baseDir}/scripts/gemini.py -p "Remove background, add beach sunset" -f edited.png -i photo.jpg

# Multi-image compositing
uv run {baseDir}/scripts/gemini.py -p "Blend these two scenes together" -f blend.png -i scene1.png -i scene2.png

Image Analysis

# Describe an image
uv run {baseDir}/scripts/gemini.py -p "Describe this image" -f desc.txt -i photo.jpg --model gemini-2.5-flash

# Compare images
uv run {baseDir}/scripts/gemini.py -p "What are the differences?" -f diff.txt -i before.jpg -i after.jpg --model gemini-2.5-flash

Text Generation

# Use the most capable model for complex tasks
uv run {baseDir}/scripts/gemini.py -p "Write a haiku about coding" -f haiku.txt --model gemini-2.5-pro

Models

Model Type Best For
gemini-2.5-flash-image Image + Text Image generation & editing (default)
gemini-2.5-flash Text Fast analysis, vision, general tasks
gemini-2.5-pro Text Complex reasoning, highest quality
gemini-2.5-flash-lite Text Fastest, simple tasks

Notes

  • No manual Python setup required. The script uses PEP 723 inline metadata. uv run automatically creates an isolated virtual environment and installs the google-genai dependency on first run.
  • Image output is returned inline as base64 from the Gemini API — no separate download step.
  • Use timestamps in filenames: yyyy-mm-dd-hh-mm-ss-name.ext.
  • Script prints MEDIA: line for OpenClaw to auto-attach generated images.
  • Do not read generated media back; report the saved path only.
  • Only gemini-2.5-flash-image can generate images. Other models are text-only.
  • Use --json for structured output: {"ok": true, "files": [...], "text": "...", "model": "...", "tokens": {...}}

Support

Built by Corespeed. If you need help or run into issues:

安全使用建议
This skill sends prompts and any input images to the CS_AI_GATEWAY_BASE_URL using CS_AI_GATEWAY_API_TOKEN — only install it if you trust the gateway host. Verify the gateway URL and token come from a trusted Corespeed instance (or your own gateway). Be aware the tool will upload provided input files and will print full file paths and short prompt text to stdout; if those are sensitive, run in an isolated environment, use ephemeral credentials, or avoid supplying sensitive images/text. Installing/running will cause uv and google-genai packages to be installed in an isolated venv — review those packages if you require stricter supply-chain controls.
功能分析
Type: OpenClaw Skill Name: corespeed-nanobanana Version: 0.0.2 The skill is a legitimate tool for generating and analyzing images and text using Google Gemini models via the Corespeed AI Gateway. The Python script (scripts/gemini.py) uses the official google-genai library and follows standard practices for handling environment variables and file I/O, with no evidence of malicious intent, data exfiltration, or unauthorized execution.
能力评估
Purpose & Capability
Name/description (Gemini image & text generation via Corespeed AI Gateway) match the required binary (uv), the two env vars (CS_AI_GATEWAY_BASE_URL, CS_AI_GATEWAY_API_TOKEN), and the included script which uses the Google genai client pointed at the gateway.
Instruction Scope
Runtime instructions and the script are scoped to sending prompts and input images to the configured gateway and saving returned media/text locally. The script logs prompt snippets and input/output file paths and sizes; it will upload any input files provided and send prompt text to the gateway — this is expected for the skill but means those artifacts are transmitted to the gateway host.
Install Mechanism
The skill is instruction-only (no forced installer), but SKILL.md metadata suggests 'pip install uv' and the script relies on uv to create a venv and install google-genai at runtime. This is a typical, moderate-risk behavior (packages come from PyPI). No arbitrary download URLs or archive extraction were observed.
Credentials
Only the gateway base URL and API token are required and actually used by the script. No unrelated secrets, system config paths, or extra credentials are requested.
Persistence & Privilege
always is false and the skill doesn't attempt to modify other skills or system-wide configuration. It writes its own output files only (user-specified paths).
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install corespeed-nanobanana
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /corespeed-nanobanana 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.0.2
Version 0.0.2 of corespeed-nanobanana - No file changes detected in this release. - No new features, bug fixes, or documentation updates included.
v0.0.1
Initial release: Gemini image & text generation via Corespeed AI Gateway
元数据
Slug corespeed-nanobanana
版本 0.0.2
许可证 MIT-0
累计安装 1
当前安装数 1
历史版本数 2
常见问题

corespeed-nanobanana 是什么?

Generate and edit images using Google Gemini models via Corespeed AI Gateway. Supports text-to-image generation, image editing, multi-image input, and text r... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 162 次。

如何安装 corespeed-nanobanana?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install corespeed-nanobanana」即可一键安装,无需额外配置。

corespeed-nanobanana 是免费的吗?

是的,corespeed-nanobanana 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

corespeed-nanobanana 支持哪些平台?

corespeed-nanobanana 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 corespeed-nanobanana?

由 Zypher Agent(@zypher-agent)开发并维护,当前版本 v0.0.2。

💬 留言讨论