← 返回 Skills 市场

Ollama Load Balancer

Name: Ollama Load Balancer
Author: twinsgeeks

作者 Twin Geeks · GitHub ↗ · v1.0.4 · MIT-0

darwinlinuxwindows ⚠ pending

256

总下载

当前安装

版本数

在 OpenClaw 中安装

/install ollama-load-balancer

功能描述

Ollama load balancer for Llama, Qwen, DeepSeek, and Mistral inference across multiple machines. Load balancing with auto-discovery via mDNS, health checks, q...

安全使用建议

This skill appears internally consistent for running a local Ollama load balancer, but it relies on you pip installing a third-party package and running local servers that manage model downloads and expose admin HTTP endpoints. Before installing: (1) review the ollama-herd PyPI package and GitHub repo to ensure the code matches expectations, (2) prefer installing in a sandbox/VM or isolated network, (3) disable or review auto-pull behavior to avoid unexpected large downloads, and (4) restrict access to the daemon's HTTP port (localhost-only or firewall) to prevent unauthorized remote control.

能力评估

✓ Purpose & Capability

Name/description (Ollama load balancer) matches the runtime instructions: auto-discovery, health checks, routing, and admin HTTP endpoints. Required binaries (curl/wget) and optional python/pip/sqlite3 are appropriate for a Python-based local service.

✓ Instruction Scope

SKILL.md instructs the agent to pip install ollama-herd and run local commands (herd / herd-node) and to call local HTTP endpoints on localhost:11435. It does not instruct reading unrelated system files or exfiltrating secrets. It does include administrative endpoints (pull/delete models) — expected for a load-balancer but potentially powerful if misused.

ℹ Install Mechanism

There is no registry install spec; the README tells the user to pip install ollama-herd from PyPI. pip installs are common but will execute third-party code on the host — moderate risk. The SKILL.md points to a PyPI project and GitHub repo (traceable), which is better than an arbitrary download, but users should review the package/source before installing.

✓ Credentials

The skill declares no required environment variables or credentials. The use of FLEET_MAX_RETRIES and runtime settings is plausible for a load balancer; no unexplained secret access is requested.

ℹ Persistence & Privilege

always:false (normal). The instructions create and use local config paths (~/.fleet-manager/...), run long-lived processes, and expose an admin HTTP interface. This is consistent with the stated functionality but means the package will persist on disk and run services — run with appropriate isolation and review.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install ollama-load-balancer
安装完成后，直接呼叫该 Skill 的名称或使用 /ollama-load-balancer 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.4

Cross-platform support: macOS, Linux, and Windows. Updated OS metadata, descriptions, and hardware recommendations.

v1.0.3

Version 1.0.3 - Expanded internationalization in the description (added Chinese and Spanish). - Reworded across the documentation to consistently refer to "load balancer" for clarity. - Enhanced deployment and API examples for more explicit load balancer usage. - Updated feature and endpoint explanations to clarify their relation to the load balancer. - Incremented version metadata to 1.0.3.

v1.1.0

- Updated the description for improved clarity and to highlight support for Llama, Qwen, DeepSeek, and Mistral. - Reduced the zombie reaper’s request cleanup threshold from 15 minutes to 10 minutes. - No changes to functionality or API; documentation only.

v1.0.2

- Updated version to 1.0.2 in SKILL.md. - Clarified that auto-pull of missing models is now optional and disabled by default; enable via settings API. - Updated metadata for configPaths location. - No code or feature changes; documentation and metadata only.

v1.0.1

- Added "optionalBins" and "configPaths" fields to the metadata section in SKILL.md. - Now explicitly lists optional dependencies: python3, sqlite3, and pip. - Declares expected config and log file locations for fleet manager operation. - No changes to code or user-facing features.

v1.0.0

Initial release. - Load balances Ollama inference across multiple machines with automatic discovery, health checks, and real-time monitoring dashboard. - Built-in queue management, zero configuration setup, and automatic failover with retry logic. - Includes zombie request cleanup, VRAM-aware model routing, and operational analytics via SQL queries. - Web dashboard provides fleet status, trends, insights, and runtime toggles. - Exposes multiple REST API endpoints for fleet health, request traces, usage, and model management. - Designed for high availability and operational visibility in Ollama deployments.

元数据

Slug ollama-load-balancer

版本 1.0.4

许可证 MIT-0

累计安装 3

当前安装数 3

历史版本数 6

常见问题

Ollama Load Balancer 是什么？

Ollama load balancer for Llama, Qwen, DeepSeek, and Mistral inference across multiple machines. Load balancing with auto-discovery via mDNS, health checks, q... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 256 次。

如何安装 Ollama Load Balancer？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ollama-load-balancer」即可一键安装，无需额外配置。

Ollama Load Balancer 是免费的吗？

是的，Ollama Load Balancer 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Ollama Load Balancer 支持哪些平台？

Ollama Load Balancer 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（darwin, linux, windows）。

谁开发了 Ollama Load Balancer？

由 Twin Geeks（@twinsgeeks）开发并维护，当前版本 v1.0.4。