GLM Autoroute
/install glm-autoroute
GLM Autoroute
Binary model routing for ZAI GLM models - lightweight vs heavyweight tasks.
Introduction
- GLM-4.7 is the default model. Only spawn GLM-5 when the task actually needs it.
- Use sessions_spawn to run tasks with GLM-5:
sessions_spawn({
task: "\x3Cthe full task description>",
model: "zai/glm-5",
label: "\x3Cshort task label>"
})
- After done with GLM-5, the main session continues with GLM-4.7 as default.
Models
GLM-4.7 (DEFAULT - zai/glm-4.7)
Use for lightweight tasks:
- Simple Q&A - What, When, Who, Where
- Casual chat - No reasoning needed
- Quick lookups
- File lookups
- Simple tasks - repetitive tasks, formatting
- Cron Jobs - if it needs reasoning, THEN ESCALATE TO GLM-5
- Status checks
- Basic confirmations
- Provide concise output, just plain answer, no explaining
DO NOT:
- ❌ DO NOT CODE WITH GLM-4.7
- ❌ DO NOT ANALYZE USING GLM-4.7
- ❌ DO NOT ATTEMPT ANY REASONING USING GLM-4.7
- ❌ DO NOT RESEARCH USING GLM-4.7
- If you think the request does not fall into point 1-8, THEN ESCALATE TO GLM-5
- If you think you will violate the DO NOT list, THEN ESCALATE TO GLM-5
GLM-5 (zai/glm-5)
Use for heavyweight tasks:
- Coding (any complexity)
- Analysis & debugging
- Multi-step reasoning
- Research & investigation
- Critical planning
- Architecture decisions
- Complex problem solving
- Deep research
- Critical decisions
- Detailed explanations
Examples
| Task | Model | Why |
|---|---|---|
| "Check calendar" | GLM-4.7 | Simple lookup |
| "What time is it?" | GLM-4.7 | Simple Q&A |
| "Heartbeat check" | GLM-4.7 | Routine |
| "Read this file" | GLM-4.7 | Simple lookup |
| "Summarize this" | GLM-4.7 | Basic task |
| "Write Python script" | GLM-5 | Coding |
| "Debug this error" | GLM-5 | Analysis |
| "Research market trends" | GLM-5 | Deep research |
| "Plan migration" | GLM-5 | Complex planning |
| "Analyze this issue" | GLM-5 | Analysis |
Other Notes
- When the user asks to use a specific model, use it
- Always mention which model is used in outputs — example: "(GLM-5)" or "(GLM-4.7)" at the end of responses
- After done with GLM-5 (via sessions_spawn), continue with GLM-4.7 as default
- If you think the request does not fall into GLM-4.7 use cases, THEN ESCALATE TO GLM-5
- If you think you will violate the DO NOT list, THEN ESCALATE TO GLM-5
- Coding = always GLM-5
- When in doubt → GLM-5 (better safe than sorry)
- Heartbeat checks → always GLM-4.7 unless complex analysis needed
Memory Management with sessions_spawn
When spawning GLM-5 sub-agent sessions for ANY task (coding, research, analysis, planning, etc.), follow this pattern:
Output Rules
1. Code Output (Important)
- Full code ONLY in files — do NOT include in announce unless explicitly requested
- Provide summary: what was created, file path, status, dependencies
- Full code disclosure ONLY when:
- User explicitly requests: "Show me the code"
- Debugging needs code review
- User wants to improve/modify it
2. Full Announce for Other Results
- Research findings, analysis results, solutions → announce FULLY to user
- Do NOT shorten, summarize, or condense non-code output
- User gets complete findings, not a brief summary
3. Two-Layer Memory Strategy
MEMORY.md (Curated Long-Term)
- ONLY key insights, decisions, lessons, significant findings, preferences
- Clean, concise, actionable
- Skip routine data, step-by-step reasoning, temporary thoughts
Detailed Reports (Task-Specific Files)
- For research:
research/YYYY-MM-DD-topic.md(full findings, data, analysis) - For coding: add inline docs/README in code folder if needed
- For analysis: output files in relevant project directories
Examples
Research task:
sessions_spawn({
task: "Research X. Announce full findings to user. Write full report to research/YYYY-MM-DD-X.md, then write ONLY key insights to MEMORY.md (clean, concise).",
model: "zai/glm-5",
label: "Research X"
})
Coding task:
sessions_spawn({
task: "Write Python script for X. Save full code to file. Provide summary (what created, path, status, dependencies) in announce. Write key implementation decisions to MEMORY.md (important only).",
model: "zai/glm-5",
label: "Python script X"
})
Apply this pattern to ALL GLM-5 spawns. Code in files only, summary in announce, full disclosure on request.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install glm-autoroute - 安装完成后,直接呼叫该 Skill 的名称或使用
/glm-autoroute触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
GLM Autoroute 是什么?
Routes tasks between GLM-4.7-FlashX for simple queries and GLM-5 for coding, analysis, reasoning, and complex tasks, switching automatically as needed. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 778 次。
如何安装 GLM Autoroute?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install glm-autoroute」即可一键安装,无需额外配置。
GLM Autoroute 是免费的吗?
是的,GLM Autoroute 完全免费(开源免费),可自由下载、安装和使用。
GLM Autoroute 支持哪些平台?
GLM Autoroute 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 GLM Autoroute?
由 Raufi Musaddiq(@raufimusaddiq)开发并维护,当前版本 v1.2.0。