← Back to Skills Marketplace

Lemonade Server Manager

Name: Lemonade Server Manager
Author: james-martinez

by James Martinez · GitHub ↗ · v1.0.7 · MIT-0

linuxdarwinwin32 ✓ Security Clean

193

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install lemonade-server-manager

Description

Manage Lemonade Servers natively. Use when checking system info, health status, listing available models, pulling or loading new models, completing LLM chats...

README (SKILL.md)

Lemonade Server Management

Interact with and manage local or remote Lemonade AI Server hardware directly via standard native network requests (curl).

When to Use

Checking local GPU/NPU health and currently loaded VRAM resources
Listing available, downloaded text/image models on a Lemonade cluster
Pulling, loading, or unloading multimodal models
Generating text from LLMs (chat completions)
Generating stable-diffusion image responses

Setup Instructions

Every endpoint requires a base server_url. If one is not specified by the user, assume http://localhost:8000.
Use the LEMONADE_API_KEY environment variable to authenticate.
This skill stays focused on making HTTP requests to a base server_url (default http://localhost:8000) and using an optional LEMONADE_API_KEY. It does not instruct reading arbitrary host files or other env vars.
Operational note: The skill will transmit prompts and model requests to whichever server_url is used. If a remote URL is supplied, the agent will send user content and model commands there (expected for this skill, but worth the user's attention).

API Operations

System Info

Get hardware capabilities and device enumeration limits.

Returns: JSON object with hardware capabilities.

# Example Request
curl -X GET "http://localhost:8000/api/v1/system-info" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}"

Health Check

Verify status and monitor currently loaded models to prevent VRAM overflow.

Returns: JSON object containing server status and currently loaded models.

# Example Request
curl -X GET "http://localhost:8000/api/v1/health" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}"

List Models

Get an array of downloaded models available to load into memory.

Returns: JSON list of all downloaded and available models.

# Example Request
curl -X GET "http://localhost:8000/api/v1/models" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}"

Pull Model

Download and install a new model string to the target machine.

Returns: JSON stream or object confirming download status.

# Example Request
curl -X POST "http://localhost:8000/api/v1/pull" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}" \
  -H "Content-Type: application/json" \
  -d '{"model": "llama3"}'

Load Model

Load a model into VRAM/NPU to prepare for prompt responses.

Returns: JSON object confirming model loaded into memory.

# Example Request
curl -X POST "http://localhost:8000/api/v1/load" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}" \
  -H "Content-Type: application/json" \
  -d '{"model": "llama3"}'

Unload Model

Force unload a model to free up memory before loading a larger variant.

Returns: JSON object confirming model unloaded.

# Example Request
curl -X POST "http://localhost:8000/api/v1/unload" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}" \
  -H "Content-Type: application/json" \
  -d '{"model": "llama3"}'

Chat Completion

Send a standard chat request to the LLM backend.

Returns: JSON object containing standard chat completion response.

# Example Request
curl -X POST "http://localhost:8000/api/v1/chat/completions" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama3",
    "messages": [
      {"role": "user", "content": "Hello, world!"}
    ]
  }'

Generate Image

Submit a stable-diffusion prompt for image generation.

Returns: JSON object containing base64 generated image data.

# Example Request
curl -X POST "http://localhost:8000/api/v1/images/generations" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "sdxl",
    "prompt": "A majestic lion standing on a cliff, digital art"
  }'

Tips

The flm, ryzenai-llm, and whispercpp backends are mutually exclusive on an NPU. Always use the /api/v1/unload endpoint on conflicting LLMs before attempting to load a Whisper model on an NPU.
Check /api/v1/health first to assess available hardware VRAM before pulling or loading a multi-gigabyte LLM to avoid out-of-memory errors on the host.
Provide clear feedback to the user if an API request fails due to resource constraints and suggest unloading inactive background models.

Usage Guidance

This skill appears coherent for managing Lemonade servers. Before installing: (1) Be aware that any content you send through the skill will be forwarded to whatever server_url you (or the agent) supplies — avoid sending sensitive data to untrusted remote endpoints. (2) Prefer using local servers (http://localhost:8000) or HTTPS endpoints you control; verify TLS and host identity for remote servers. (3) The API key is optional — only set LEMONADE_API_KEY if you trust the target server. (4) The registry metadata has no homepage while SKILL.md lists a GitHub repo; if provenance matters, manually inspect the repository/owner to confirm trustworthiness before installing or exporting credentials. (5) Because the skill can be invoked by the agent autonomously (platform default), review agent autonomy settings if you want to restrict automatic calls to remote servers.

Capability Analysis

Type: OpenClaw Skill Name: lemonade-server-manager Version: 1.0.7 The lemonade-server-manager skill is a legitimate tool designed to manage local or remote AI servers via REST API calls using curl. It provides standard functionality for monitoring system health, managing model lifecycles (pull, load, unload), and performing inference (chat and image generation). The skill explicitly limits its scope to HTTP requests, uses a dedicated environment variable for authentication (LEMONADE_API_KEY), and contains no evidence of malicious execution, data exfiltration, or prompt injection.

Capability Assessment

✓ Purpose & Capability

Name/description match the declared operations (system info, health, list/pull/load/unload models, chat, image generation). The only runtime requirement (curl) and the optional LEMONADE_API_KEY are appropriate for an HTTP-based server-management skill.

ℹ Instruction Scope

SKILL.md confines behavior to making HTTP requests to a base server_url (default http://localhost:8000) and explicitly warns prompts/model requests will be transmitted to that URL. It does not instruct reading arbitrary host files or other env vars. Note: if a remote server_url is supplied, user prompts and model payloads will be sent to that remote endpoint — expected for this skill but important for user privacy.

✓ Install Mechanism

Instruction-only skill with no install spec or code files — lowest install risk. README suggests manual clone or ClawHub install, but nothing is automatically downloaded by the skill itself.

✓ Credentials

Only an optional LEMONADE_API_KEY is declared and used by the documented curl examples. No unrelated credentials, config paths, or high-privilege env vars are requested.

ℹ Persistence & Privilege

always:false and user-invocable:true (defaults). The skill can be invoked autonomously by the agent (platform default). Combined with the ability to send user prompts to arbitrary server_url values, this is a privacy consideration but not an incoherence or disproportionate privilege in itself.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install lemonade-server-manager
After installation, invoke the skill by name or use /lemonade-server-manager
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.7

Release v1.0.7

v1.0.6

Release v1.0.6

v1.0.5

Release v1.0.5

v1.0.4

Release v1.0.4

v1.0.3

Release v1.0.3

v1.0.2

Release v1.0.2

v1.0.0

Lemonade Master Manager 1.0.0 – Initial release. - Provides tools for managing local AI models and hardware resources across multiple Lemonade Servers. - Supports system info retrieval, health checks, model listing, pulling, loading, and unloading. - Enables chat completions and image generation inference. - Offers built-in support for multi-server management and hardware awareness, including NPU exclusivity rules. - Prompts users to free up memory by unloading models when hardware limitations are detected.

Metadata

Slug lemonade-server-manager

Version 1.0.7

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 7

Frequently Asked Questions

What is Lemonade Server Manager?

Manage Lemonade Servers natively. Use when checking system info, health status, listing available models, pulling or loading new models, completing LLM chats... It is an AI Agent Skill for Claude Code / OpenClaw, with 193 downloads so far.

How do I install Lemonade Server Manager?

Run "/install lemonade-server-manager" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Lemonade Server Manager free?

Yes, Lemonade Server Manager is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Lemonade Server Manager support?

Lemonade Server Manager is cross-platform and runs anywhere OpenClaw / Claude Code is available (linux, darwin, win32).

Who created Lemonade Server Manager?

It is built and maintained by James Martinez (@james-martinez); the current version is v1.0.7.

More Skills