← 返回 Skills 市场

Gemma Gemma3

Name: Gemma Gemma3
Author: twinsgeeks

作者 Twin Geeks · GitHub ↗ · v1.0.1 · MIT-0

darwinlinuxwindows ✓ 安全检测通过

177

总下载

当前安装

版本数

在 OpenClaw 中安装

/install gemma-gemma3

功能描述

Gemma 3 by Google — run Gemma 3 (4B, 12B, 27B) across your local device fleet. Google's most capable open model with 128K context, strong coding, and multili...

使用说明 (SKILL.md)

Gemma 3 — Run Google's Open Models Across Your Fleet

Gemma 3 is Google's most capable open-source LLM family. 128K context window, strong coding performance, multilingual support across 140+ languages. The fleet router picks the best device for every request — no manual load balancing.

Supported Gemma models

Model	Parameters	Ollama name	Best for
Gemma 3 27B	27B	`gemma3:27b`	Highest quality — rivals much larger models
Gemma 3 12B	12B	`gemma3:12b`	Balanced quality and speed
Gemma 3 4B	4B	`gemma3:4b`	Fast, runs on low-RAM devices
Gemma 3 1B	1B	`gemma3:1b`	Ultra-light, instant responses
CodeGemma 7B	7B	`codegemma`	Code-focused variant

Quick start

pip install ollama-herd    # PyPI: https://pypi.org/project/ollama-herd/
herd                       # start the router (port 11435)
herd-node                  # run on each device — finds the router automatically

No models are downloaded during installation. Models are pulled on demand when a request arrives, or manually via the dashboard. All pulls require user confirmation.

Use Gemma through the fleet

OpenAI SDK (drop-in replacement)

from openai import OpenAI

client = OpenAI(base_url="http://localhost:11435/v1", api_key="not-needed")

# Gemma 3 27B for complex reasoning
response = client.chat.completions.create(
    model="gemma3:27b",
    messages=[{"role": "user", "content": "Explain quantum entanglement to a 10-year-old"}],
    stream=True,
)
for chunk in response:
    print(chunk.choices[0].delta.content or "", end="")

Code generation with CodeGemma

response = client.chat.completions.create(
    model="codegemma",
    messages=[{"role": "user", "content": "Write a binary search tree in Rust with insert, delete, and search"}],
)
print(response.choices[0].message.content)

curl (Ollama format)

# Gemma 3 27B
curl http://localhost:11435/api/chat -d '{
  "model": "gemma3:27b",
  "messages": [{"role": "user", "content": "Translate to Japanese: The weather is beautiful today"}],
  "stream": false
}'

curl (OpenAI format)

curl http://localhost:11435/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": "gemma3:4b", "messages": [{"role": "user", "content": "Hello"}]}'

Which Gemma for your hardware

Cross-platform: These are example configurations. Any device (Mac, Linux, Windows) with equivalent RAM works. The fleet router runs on all platforms.

Device	RAM	Best Gemma model
MacBook Air (8GB)	8GB	`gemma3:1b` — instant responses
Mac Mini (16GB)	16GB	`gemma3:4b` — strong for its size
Mac Mini (24GB)	24GB	`gemma3:12b` — great balance
MacBook Pro (36GB)	36GB	`gemma3:27b` — full power
Mac Studio (64GB+)	64GB+	`gemma3:27b` + `codegemma` simultaneously

Why Gemma locally

128K context — process entire codebases and long documents
140+ languages — multilingual without switching models
Google quality, zero cost — no per-token charges after hardware
Privacy — all data stays on your network
Fleet routing — multiple machines share the load

Check what's running

# Models loaded in memory
curl -s http://localhost:11435/api/ps | python3 -m json.tool

# Fleet health
curl -s http://localhost:11435/dashboard/api/health | python3 -m json.tool

Web dashboard at http://localhost:11435/dashboard — live monitoring.

Also available on this fleet

Other LLMs

Llama 3.3, Qwen 3.5, DeepSeek-V3, DeepSeek-R1, Phi 4, Mistral, Codestral — same endpoint.

Image generation

curl -o image.png http://localhost:11435/api/generate-image \
  -d '{"model": "z-image-turbo", "prompt": "a gemstone catching light", "width": 1024, "height": 1024}'

Speech-to-text

curl http://localhost:11435/api/transcribe -F "[email protected]" -F "model=qwen3-asr"

Embeddings

curl http://localhost:11435/api/embed \
  -d '{"model": "nomic-embed-text", "input": "Google Gemma open source language model"}'

Full documentation

Contribute

Ollama Herd is open source (MIT). Stars, issues, and PRs welcome — from humans and AI agents alike:

GitHub — 444 tests, fully async, CLAUDE.md makes AI agents productive instantly
Found a bug? Open an issue
Want to add a feature? Fork, branch, PR — the test suite runs in under 40 seconds

Guardrails

Model downloads require explicit user confirmation — Gemma models range from 1GB (1B) to 16GB (27B).
Model deletion requires explicit user confirmation.
Never delete or modify files in ~/.fleet-manager/.
No models are downloaded automatically — all pulls are user-initiated or require opt-in via auto_pull.

安全使用建议

This skill is internally consistent with its purpose, but before installing you should: 1) Verify the upstream project and PyPI package (https://github.com/geeks-accelerator/ollama-herd and the PyPI package 'ollama-herd') to ensure they are official/trustworthy and inspect the code if possible; 2) Prefer pinning a known-good package version rather than installing an unpinned latest; 3) Run installation/testing in an isolated environment (VM/container) first; 4) Be aware that running 'herd'/'herd-node' opens a local network service (port 11435) and may pull multi-gigabyte model files — restrict network/firewall access to trusted hosts and confirm that model downloads truly require explicit confirmation; 5) Review ~/.fleet-manager/* logs/configs for sensitive data and follow the documented guardrails rather than blindly deleting/modifying files. If you cannot verify the package source or code, treat the installation as higher risk.

能力评估

✓ Purpose & Capability

The name/description claim (run Gemma models locally across a fleet via an Ollama Herd router) matches the instructions: pip-install an 'ollama-herd' package and run 'herd' and 'herd-node' to provide a local endpoint. Required binaries (curl/wget) and optional python/pip are reasonable for this functionality.

✓ Instruction Scope

SKILL.md stays on-topic: it tells the agent to install/run the herd/router, how to call the local API (localhost:11435), how to check status, and documents model choices and guardrails (downloads require user confirmation). It does not instruct reading unrelated system files or exfiltrating secrets.

ℹ Install Mechanism

There is no built-in install spec; the instructions tell the user to 'pip install ollama-herd' from PyPI. Installing a third-party package and running a network service is expected for this use case, but it is a higher-risk action because the package code executes locally and is not vetted by this scanner.

✓ Credentials

The skill declares no required environment variables or credentials. Metadata references a couple of config paths (~/.fleet-manager/...), which are plausible for a fleet manager and are mentioned in the guardrails (do not modify). There are no unexplained secret requests.

✓ Persistence & Privilege

The skill is not always-enabled and does not request elevated platform privileges. It instructs running a local service (herd) and per-node agents (herd-node), which is appropriate for a fleet router and does not modify other skill configurations.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install gemma-gemma3
安装完成后，直接呼叫该 Skill 的名称或使用 /gemma-gemma3 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.1

Cross-platform support: macOS, Linux, and Windows. Updated OS metadata, descriptions, and hardware recommendations.

v1.0.0

- Initial release of Gemma 3 support via Ollama Herd for Mac and Linux. - Run Gemma 3 (4B, 12B, 27B, 1B) and CodeGemma 7B models locally, routed across your device fleet. - 128K context, strong multilingual and coding abilities, with zero cloud costs. - Fleet routing automatically balances requests to the best available machine. - Built-in privacy: all data stays on your network; models downloaded only with user confirmation. - Additional features include dashboard monitoring, compatibility with major LLMs, image generation, speech-to-text, and embeddings.

元数据

Slug gemma-gemma3

版本 1.0.1

许可证 MIT-0

累计安装 2

当前安装数 2

历史版本数 2

常见问题

Gemma Gemma3 是什么？

Gemma 3 by Google — run Gemma 3 (4B, 12B, 27B) across your local device fleet. Google's most capable open model with 128K context, strong coding, and multili... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 177 次。

如何安装 Gemma Gemma3？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install gemma-gemma3」即可一键安装，无需额外配置。

Gemma Gemma3 是免费的吗？

是的，Gemma Gemma3 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Gemma Gemma3 支持哪些平台？

Gemma Gemma3 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（darwin, linux, windows）。

谁开发了 Gemma Gemma3？

由 Twin Geeks（@twinsgeeks）开发并维护，当前版本 v1.0.1。