← 返回 Skills 市场
tooled-app

Ollama Model Router

作者 Tooled-app · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
39
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install ollama-model-router
功能描述
Route tasks to the optimal cloud or local model based on task characteristics — coding, analysis, reasoning, creative, or general.
使用说明 (SKILL.md)

Model Router Skill

Intelligent model matching for OpenClaw. Pairs task type with the best available model.

When to Use

  • Before starting any non-trivial task
  • When current model is struggling with task type
  • To optimise token usage and latency
  • When you want best quality for specific domains
  • To balance local vs cloud based on privacy/speed needs

Model Registry

Define available models in ~/.openclaw/model-registry.json:

{
  "models": [
    {
      "id": "kimi-k2.6",
      "provider": "ollama",
      "host": "cloud",
      "tags": ["reasoning", "analysis", "coding", "general"],
      "strengths": ["long-context", "instruction-following", "chinese"],
      "weaknesses": ["creative-writing"],
      "max_tokens": 128000,
      "speed": "medium",
      "cost_tier": "free"
    },
    {
      "id": "llama3.3-70b",
      "provider": "ollama",
      "host": "local",
      "tags": ["coding", "analysis", "general"],
      "strengths": ["code-generation", "structured-output"],
      "weaknesses": ["creative-writing", "long-context"],
      "max_tokens": 8192,
      "speed": "fast",
      "cost_tier": "free"
    },
    {
      "id": "qwen2.5-coder",
      "provider": "ollama",
      "host": "local",
      "tags": ["coding", "technical"],
      "strengths": ["code-completion", "bug-fixing", "refactoring"],
      "weaknesses": ["general-chat", "creative"],
      "max_tokens": 32768,
      "speed": "fast",
      "cost_tier": "free"
    },
    {
      "id": "mistral-nemo",
      "provider": "ollama",
      "host": "local",
      "tags": ["reasoning", "analysis", "general"],
      "strengths": ["reasoning", "math", "logic"],
      "weaknesses": ["long-context"],
      "max_tokens": 32768,
      "speed": "fast",
      "cost_tier": "free"
    },
    {
      "id": "phi4",
      "provider": "ollama",
      "host": "local",
      "tags": ["coding", "technical", "analysis"],
      "strengths": ["code-generation", "structured-output"],
      "weaknesses": ["creative", "long-context"],
      "max_tokens": 16384,
      "speed": "very-fast",
      "cost_tier": "free"
    }
  ]
}

Task Classification

The router classifies tasks using keyword matching and optional LLM-based classification:

Task Type Keywords Preferred Models Fallback
coding code, function, bug, refactor, syntax, error, debug, implement qwen2.5-coder, phi4, llama3.3-70b kimi-k2.6
reasoning analyse, compare, evaluate, why, how, explain, logic mistral-nemo, kimi-k2.6 llama3.3-70b
creative write, story, poem, draft, design, creative, brainstorm kimi-k2.6, llama3.3-70b mistral-nemo
analysis data, summary, extract, parse, compare, metrics kimi-k2.6, mistral-nemo llama3.3-70b
general help, what, tell, describe, general kimi-k2.6, llama3.3-70b Any available
technical config, setup, install, deploy, architecture qwen2.5-coder, phi4 kimi-k2.6

Routing Decision Tree

1. Classify task type from user prompt
2. Filter models matching task tags
3. Score candidates by:
   - Tag match (exact = 3, related = 1)
   - Strength match (+2 per strength)
   - Speed preference (fast = +1 if user prefers speed)
   - Host preference (local = +1 if privacy needed)
4. Select highest score
5. Check availability (ping ollama)
6. If unavailable, go to next highest
7. Return model ID + reason

Usage

Manual Routing

# Before starting task, ask router
which-model "debug this Python function"
# → qwen2.5-coder (coding specialist, fast, local)

which-model "write a marketing email"
# → kimi-k2.6 (creative, long-context, cloud)

Automatic Routing

Set in OpenClaw config:

{
  "model_routing": {
    "enabled": true,
    "default": "kimi-k2.6",
    "auto_classify": true,
    "prefer_local": false,
    "prefer_speed": false
  }
}

Session Override

/model coding    # Force coding models
/model local     # Prefer local models
/model fast      # Prefer speed over quality
/model cloud     # Use cloud models only

Implementation

Check Available Models

curl -s http://localhost:11434/api/tags | jq '.models[].name'

Route Task

#!/bin/bash
TASK="$1"
REGISTRY="$HOME/.openclaw/model-registry.json"

# Classify task
if echo "$TASK" | grep -qiE "code|function|bug|refactor|syntax|error|debug|implement"; then
  TYPE="coding"
elif echo "$TASK" | grep -qiE "analyse|compare|evaluate|why|how.*does|explain|logic|reason"; then
  TYPE="reasoning"
elif echo "$TASK" | grep -qiE "write|story|poem|draft|design|creative|brainstorm"; then
  TYPE="creative"
elif echo "$TASK" | grep -qiE "data|summary|extract|parse|metrics|report"; then
  TYPE="analysis"
elif echo "$TASK" | grep -qiE "config|setup|install|deploy|architecture|build"; then
  TYPE="technical"
else
  TYPE="general"
fi

# Score models
echo "Task type: $TYPE"
echo "Recommended models:"
jq -r --arg type "$TYPE" '
  .models | map(
    . as $m |
    ($m.tags | index($type) // -1) as $tag_match |
    ($m.strengths | map(ascii_downcase) | index($type) // -1) as $strength_match |
    {
      model: $m.id,
      host: $m.host,
      score: (if $tag_match >= 0 then 3 else 0 end) + (if $strength_match >= 0 then 2 else 0 end),
      speed: $m.speed
    }
  ) | sort_by(-.score) | .[0:3] | .[]
' "$REGISTRY"

Fallback Chain

When preferred model is unavailable:

  1. Same provider, next best match
  2. Different provider, same capability tier
  3. General-purpose model (kimi-k2.6, llama3.3-70b)
  4. Default model as last resort

Integration with OpenClaw

Add to ~/.openclaw/config.json:

{
  "skills": {
    "model-router": {
      "enabled": true,
      "registry_path": "~/.openclaw/model-registry.json",
      "auto_route": true,
      "notify_on_switch": true
    }
  }
}

Benefits

  • Better results: Task-appropriate model = higher quality
  • Lower latency: Local models for simple tasks
  • Cost control: Use expensive/cloud models only when needed
  • Privacy: Route sensitive data to local models
  • Reliability: Automatic fallback when models fail

Related

  • ollama-model-management
  • token-optimisation
  • local-vs-cloud
  • openclaw-configuration

Resources

安全使用建议
Install only if you are comfortable with model-routing decisions that may choose cloud models. Prefer local mode for confidential code, personal data, credentials, or proprietary material, and avoid passing secrets because the helper prints the task text to the terminal.
能力评估
Purpose & Capability
The files consistently describe and implement model selection for Ollama-style local/cloud model entries using a registry, keyword classification, and scoring.
Instruction Scope
The skill encourages routing before non-trivial tasks and includes optional automatic routing; users should understand that cloud selections may expose prompt contents to the selected provider.
Install Mechanism
The artifact contains SKILL.md, a shell helper, and an example JSON registry; there is no installer, package dependency, obfuscated setup, or automatic startup behavior.
Credentials
The shell helper reads a user-chosen registry path and queries localhost Ollama with curl; this is proportionate for the stated purpose, though the cloud-model availability check is only assumed.
Persistence & Privilege
The documentation asks users to add OpenClaw config and a model registry under ~/.openclaw, but the artifact itself does not create persistence or elevate privileges.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install ollama-model-router
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /ollama-model-router 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release: intelligent model routing for OpenClaw. - Classifies tasks and routes them to the optimal cloud or local Ollama model based on type (coding, analysis, reasoning, creative, general, technical). - Uses a configurable model registry for scoring and selection, considering strengths, speed, privacy, and availability. - Supports both manual and automatic routing, with CLI examples and OpenClaw integration. - Provides fallback logic for high reliability when preferred models are unavailable. - Enables session override and user preferences (speed, privacy, host).
元数据
Slug ollama-model-router
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Ollama Model Router 是什么?

Route tasks to the optimal cloud or local model based on task characteristics — coding, analysis, reasoning, creative, or general. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 39 次。

如何安装 Ollama Model Router?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ollama-model-router」即可一键安装,无需额外配置。

Ollama Model Router 是免费的吗?

是的,Ollama Model Router 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Ollama Model Router 支持哪些平台?

Ollama Model Router 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Ollama Model Router?

由 Tooled-app(@tooled-app)开发并维护,当前版本 v1.0.0。

💬 留言讨论