← 返回 Skills 市场

Ollama Model Router

Name: Ollama Model Router
Author: tooled-app

作者 Tooled-app · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ 安全检测通过

总下载

当前安装

版本数

在 OpenClaw 中安装

/install ollama-model-router

功能描述

Route tasks to the optimal cloud or local model based on task characteristics — coding, analysis, reasoning, creative, or general.

使用说明 (SKILL.md)

Model Router Skill

Intelligent model matching for OpenClaw. Pairs task type with the best available model.

When to Use

Before starting any non-trivial task
When current model is struggling with task type
To optimise token usage and latency
When you want best quality for specific domains
To balance local vs cloud based on privacy/speed needs

Model Registry

Define available models in ~/.openclaw/model-registry.json:

{
  "models": [
    {
      "id": "kimi-k2.6",
      "provider": "ollama",
      "host": "cloud",
      "tags": ["reasoning", "analysis", "coding", "general"],
      "strengths": ["long-context", "instruction-following", "chinese"],
      "weaknesses": ["creative-writing"],
      "max_tokens": 128000,
      "speed": "medium",
      "cost_tier": "free"
    },
    {
      "id": "llama3.3-70b",
      "provider": "ollama",
      "host": "local",
      "tags": ["coding", "analysis", "general"],
      "strengths": ["code-generation", "structured-output"],
      "weaknesses": ["creative-writing", "long-context"],
      "max_tokens": 8192,
      "speed": "fast",
      "cost_tier": "free"
    },
    {
      "id": "qwen2.5-coder",
      "provider": "ollama",
      "host": "local",
      "tags": ["coding", "technical"],
      "strengths": ["code-completion", "bug-fixing", "refactoring"],
      "weaknesses": ["general-chat", "creative"],
      "max_tokens": 32768,
      "speed": "fast",
      "cost_tier": "free"
    },
    {
      "id": "mistral-nemo",
      "provider": "ollama",
      "host": "local",
      "tags": ["reasoning", "analysis", "general"],
      "strengths": ["reasoning", "math", "logic"],
      "weaknesses": ["long-context"],
      "max_tokens": 32768,
      "speed": "fast",
      "cost_tier": "free"
    },
    {
      "id": "phi4",
      "provider": "ollama",
      "host": "local",
      "tags": ["coding", "technical", "analysis"],
      "strengths": ["code-generation", "structured-output"],
      "weaknesses": ["creative", "long-context"],
      "max_tokens": 16384,
      "speed": "very-fast",
      "cost_tier": "free"
    }
  ]
}

Task Classification

The router classifies tasks using keyword matching and optional LLM-based classification:

Task Type	Keywords	Preferred Models	Fallback
coding	code, function, bug, refactor, syntax, error, debug, implement	qwen2.5-coder, phi4, llama3.3-70b	kimi-k2.6
reasoning	analyse, compare, evaluate, why, how, explain, logic	mistral-nemo, kimi-k2.6	llama3.3-70b
creative	write, story, poem, draft, design, creative, brainstorm	kimi-k2.6, llama3.3-70b	mistral-nemo
analysis	data, summary, extract, parse, compare, metrics	kimi-k2.6, mistral-nemo	llama3.3-70b
general	help, what, tell, describe, general	kimi-k2.6, llama3.3-70b	Any available
technical	config, setup, install, deploy, architecture	qwen2.5-coder, phi4	kimi-k2.6

Routing Decision Tree

1. Classify task type from user prompt
2. Filter models matching task tags
3. Score candidates by:
   - Tag match (exact = 3, related = 1)
   - Strength match (+2 per strength)
   - Speed preference (fast = +1 if user prefers speed)
   - Host preference (local = +1 if privacy needed)
4. Select highest score
5. Check availability (ping ollama)
6. If unavailable, go to next highest
7. Return model ID + reason

Usage

Manual Routing

# Before starting task, ask router
which-model "debug this Python function"
# → qwen2.5-coder (coding specialist, fast, local)

which-model "write a marketing email"
# → kimi-k2.6 (creative, long-context, cloud)

Automatic Routing

Set in OpenClaw config:

{
  "model_routing": {
    "enabled": true,
    "default": "kimi-k2.6",
    "auto_classify": true,
    "prefer_local": false,
    "prefer_speed": false
  }
}

Session Override

/model coding    # Force coding models
/model local     # Prefer local models
/model fast      # Prefer speed over quality
/model cloud     # Use cloud models only

Implementation

Check Available Models

curl -s http://localhost:11434/api/tags | jq '.models[].name'

Route Task

#!/bin/bash
TASK="$1"
REGISTRY="$HOME/.openclaw/model-registry.json"

# Classify task
if echo "$TASK" | grep -qiE "code|function|bug|refactor|syntax|error|debug|implement"; then
  TYPE="coding"
elif echo "$TASK" | grep -qiE "analyse|compare|evaluate|why|how.*does|explain|logic|reason"; then
  TYPE="reasoning"
elif echo "$TASK" | grep -qiE "write|story|poem|draft|design|creative|brainstorm"; then
  TYPE="creative"
elif echo "$TASK" | grep -qiE "data|summary|extract|parse|metrics|report"; then
  TYPE="analysis"
elif echo "$TASK" | grep -qiE "config|setup|install|deploy|architecture|build"; then
  TYPE="technical"
else
  TYPE="general"
fi

# Score models
echo "Task type: $TYPE"
echo "Recommended models:"
jq -r --arg type "$TYPE" '
  .models | map(
    . as $m |
    ($m.tags | index($type) // -1) as $tag_match |
    ($m.strengths | map(ascii_downcase) | index($type) // -1) as $strength_match |
    {
      model: $m.id,
      host: $m.host,
      score: (if $tag_match >= 0 then 3 else 0 end) + (if $strength_match >= 0 then 2 else 0 end),
      speed: $m.speed
    }
  ) | sort_by(-.score) | .[0:3] | .[]
' "$REGISTRY"

Fallback Chain

When preferred model is unavailable:

Same provider, next best match
Different provider, same capability tier
General-purpose model (kimi-k2.6, llama3.3-70b)
Default model as last resort

Integration with OpenClaw

Add to ~/.openclaw/config.json:

{
  "skills": {
    "model-router": {
      "enabled": true,
      "registry_path": "~/.openclaw/model-registry.json",
      "auto_route": true,
      "notify_on_switch": true
    }
  }
}

Benefits

Better results: Task-appropriate model = higher quality
Lower latency: Local models for simple tasks
Cost control: Use expensive/cloud models only when needed
Privacy: Route sensitive data to local models
Reliability: Automatic fallback when models fail

ollama-model-management
token-optimisation
local-vs-cloud
openclaw-configuration

Resources

IKKF: https://ikkf.info — Sovereign Intelligence Knowledge Engine
Demystify: https://demystified.website — Tech explainers and analysis
Tooled: https://tooled.pro — Personal productivity platform
Ollama: https://ollama.com — Local LLM management
OpenClaw: https://openclaw.ai — AI agent platform

安全使用建议

Install only if you are comfortable with model-routing decisions that may choose cloud models. Prefer local mode for confidential code, personal data, credentials, or proprietary material, and avoid passing secrets because the helper prints the task text to the terminal.

能力评估

✓ Purpose & Capability

The files consistently describe and implement model selection for Ollama-style local/cloud model entries using a registry, keyword classification, and scoring.

ℹ Instruction Scope

The skill encourages routing before non-trivial tasks and includes optional automatic routing; users should understand that cloud selections may expose prompt contents to the selected provider.

✓ Install Mechanism

The artifact contains SKILL.md, a shell helper, and an example JSON registry; there is no installer, package dependency, obfuscated setup, or automatic startup behavior.

ℹ Credentials

The shell helper reads a user-chosen registry path and queries localhost Ollama with curl; this is proportionate for the stated purpose, though the cloud-model availability check is only assumed.

ℹ Persistence & Privilege

The documentation asks users to add OpenClaw config and a model registry under ~/.openclaw, but the artifact itself does not create persistence or elevate privileges.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install ollama-model-router
安装完成后，直接呼叫该 Skill 的名称或使用 /ollama-model-router 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

Initial release: intelligent model routing for OpenClaw. - Classifies tasks and routes them to the optimal cloud or local Ollama model based on type (coding, analysis, reasoning, creative, general, technical). - Uses a configurable model registry for scoring and selection, considering strengths, speed, privacy, and availability. - Supports both manual and automatic routing, with CLI examples and OpenClaw integration. - Provides fallback logic for high reliability when preferred models are unavailable. - Enables session override and user preferences (speed, privacy, host).

元数据

Slug ollama-model-router

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题