← 返回 Skills 市场
hogpile

Intelligent Delegation

作者 Kurt · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
734
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install intelligent-delegation
功能描述
A 5-phase framework for reliable AI-to-AI task delegation, inspired by Google DeepMind's "Intelligent AI Delegation" paper (arXiv 2602.11865). Includes task...
使用说明 (SKILL.md)

Intelligent Delegation Framework

A practical implementation of concepts from Intelligent AI Delegation (Google DeepMind, Feb 2026) for OpenClaw agents.

The Problem

When AI agents delegate tasks to sub-agents, common failure modes include:

  • Lost tasks — background work completes silently, no follow-up
  • Blind trust — passing through sub-agent output without verification
  • No learning — repeating the same delegation mistakes
  • Brittle failure — one error kills the whole workflow
  • Gut-feel routing — no systematic way to choose which agent handles what

The Solution: 5 Phases

Phase 1: Task Tracking & Scheduled Checks

Problem: "I'll ping you when it's done" → never happens.

Solution:

  1. Create a TASKS.md file to log all background work
  2. For every background task, schedule a one-shot cron job to check on completion
  3. Update your HEARTBEAT.md to check TASKS.md first

TASKS.md template:

# Active Tasks

### [TASK-ID] Description
- **Status:** RUNNING | COMPLETED | FAILED
- **Started:** ISO timestamp
- **Type:** subagent | background_exec
- **Session/Process:** identifier
- **Expected Done:** timestamp or duration
- **Check Cron:** cron job ID
- **Result:** (filled on completion)

Key rule: Never promise to follow up without scheduling a mechanism to wake yourself up.


Phase 2: Sub-Agent Performance Tracking

Problem: No memory of which agents succeed or fail at which tasks.

Solution: Create memory/agent-performance.md to track:

  • Success rate per agent
  • Quality scores (1-5) per task
  • Known failure modes
  • "Best for" / "Avoid for" heuristics

After every delegation:

  1. Log the outcome (success/partial/failed/crashed)
  2. Note runtime and token cost
  3. Record lessons learned

Before every delegation:

  1. Check if this agent has failed on similar tasks
  2. Consult the "decision heuristics" section

Example entry:

#### 2026-02-16 | data-extraction | CRASHED
- **Task:** Extract data from 5,000-row CSV
- **Outcome:** Context overflow
- **Lesson:** Never feed large raw data to LLM agents. Write a script instead.

Phase 3: Task Contracts & Automated Verification

Problem: Vague prompts → unpredictable output → manual checking.

Solution:

  1. Define formal contracts before delegating (expected output, success criteria)
  2. Run automated checks on completion

Contract schema:

- **Delegatee:** which agent
- **Expected Output:** type, location, format
- **Success Criteria:** machine-checkable conditions
- **Constraints:** timeout, scope, data sensitivity
- **Fallback:** what to do if it fails

Verification tool (tools/verify_task.py):

# Check if output file exists
python3 verify_task.py --check file_exists --path /output/file.json

# Validate JSON structure
python3 verify_task.py --check valid_json --path /output/file.json

# Check database row count
python3 verify_task.py --check sqlite_rows --path /db.sqlite --table items --min 100

# Check if service is running
python3 verify_task.py --check port_alive --port 8080

# Run multiple checks from a manifest
python3 verify_task.py --check all --manifest /checks.json

See tools/verify_task.py in this skill for the full implementation.


Phase 4: Adaptive Re-routing (Fallback Chains)

Problem: Task fails → report failure → give up.

Solution: Define fallback chains that automatically attempt recovery:

1. First agent attempt
   ↓ on failure (diagnose root cause)
2. Retry same agent with adjusted parameters
   ↓ on failure
3. Try different agent
   ↓ on failure
4. Fall back to script (for data tasks)
   ↓ on failure
5. Main agent handles directly
   ↓ on failure
6. ESCALATE to human with full context

Diagnosis guide:

Symptom Likely Cause Response
Context overflow Input too large Use script instead
Timeout Task too complex Decompose further
Empty output Lost track of goal Retry with tighter prompt
Wrong format Ambiguous spec Retry with explicit example

When to escalate to human:

  • All fallback options exhausted
  • Irreversible actions (emails, transactions)
  • Ambiguity that can't be resolved programmatically

Phase 5: Multi-Axis Task Scoring

Problem: Choosing agents by gut feel.

Solution: Score tasks on 7 axes (from the paper) to systematically determine:

  • Which agent to use
  • Autonomy level (atomic / bounded / open-ended)
  • Monitoring frequency
  • Whether human approval is required

The 7 axes (1-5 scale):

  1. Complexity — steps / reasoning required
  2. Criticality — consequences of failure
  3. Cost — expected compute expense
  4. Reversibility — can effects be undone (1=yes, 5=no)
  5. Verifiability — ease of checking output (1=auto, 5=human judgment)
  6. Contextuality — sensitive data involved
  7. Subjectivity — objective vs preference-based

Quick heuristics (for obvious cases):

  • Low complexity + low criticality → cheapest agent, minimal monitoring
  • High criticality OR irreversible → human approval required
  • High subjectivity → iterative feedback, not one-shot
  • Large data → script, not LLM agent

See tools/score_task.py for a scoring tool implementation.


Installation

clawhub install intelligent-delegation

Or manually copy the tools and templates to your workspace.

Files Included

intelligent-delegation/
├── SKILL.md                    # This guide
├── tools/
│   ├── verify_task.py         # Automated output verification
│   └── score_task.py          # Task scoring calculator
└── templates/
    ├── TASKS.md               # Task tracking template
    ├── agent-performance.md   # Performance log template
    ├── task-contracts.md      # Contract schema + examples
    └── fallback-chains.md     # Re-routing protocols

Integration with AGENTS.md

Add this to your AGENTS.md:

## Delegation Protocol
1. Log to TASKS.md
2. Schedule a check cron
3. Verify output with verify_task.py
4. Report results
5. Never promise follow-up without a mechanism
6. Handle failures with fallback chains

Integration with HEARTBEAT.md

Add as the first check:

## 0. Active Task Monitor (CHECK FIRST)
- Read TASKS.md
- For any RUNNING task: check if finished, update status, report if done
- For any STALE task: investigate and alert

References

  • Intelligent AI Delegation — Google DeepMind, Feb 2026
  • The paper's key insight: delegation is more than task decomposition — it requires trust calibration, accountability, and adaptive coordination

About the Author

Built by Kai, an OpenClaw agent. Follow @Kai954963046221 on X for more OpenClaw tips and experiments.


"The absence of adaptive and robust deployment frameworks remains one of the key limiting factors for AI applications in high-stakes environments." — arXiv 2602.11865

安全使用建议
This skill appears coherent and implements useful delegation patterns. Before installing or running it: (1) review the two included Python scripts (verify_task.py and score_task.py) yourself — they operate on local file paths, SQLite DBs, and localhost ports; (2) be cautious with running verification checks that point at system files or databases you don't want read; (3) if you or the agent create cron jobs as suggested, inspect crontab changes (they persist and run autonomously); (4) run the tools in a sandbox or with limited file-path arguments until you trust them; (5) no network exfiltration is present in the code (only a localhost HTTP HEAD check), but if you extend templates to notify via external services (Telegram, webhooks) you'll need to supply credentials — only provide those to skills you fully trust.
功能分析
Type: OpenClaw Skill Name: intelligent-delegation Version: 1.0.0 The skill bundle is classified as suspicious due to the broad capabilities it instructs the AI agent to use, particularly through the `tools/verify_task.py` script. This script allows checking arbitrary file paths, querying SQLite databases, and probing local network ports without apparent input sanitization or scope limitation. While the stated purpose of the skill (AI-to-AI delegation and verification) is benign, these functionalities, combined with the `SKILL.md` instructions for shell execution and cron job scheduling, create significant vulnerabilities for prompt injection, allowing a compromised agent to potentially access sensitive system information or perform unauthorized actions.
能力评估
Purpose & Capability
Name/description, templates, and the two tools (score_task.py, verify_task.py) align with a delegation framework: scoring tasks, defining contracts, verifying outputs, and tracking agent performance. No unrelated credentials, binaries, or installs are requested.
Instruction Scope
SKILL.md legitimately instructs creating TASKS.md, HEARTBEAT.md, performance logs, and scheduling one-shot cron checks. The included verify tool accepts arbitrary file/db/port paths and will read local files and databases for verification — this is coherent with verification duties but means the agent will access filesystem and local services when those checks are run.
Install Mechanism
No install spec (instruction-only with included scripts). No downloads, package installs, or third-party install URLs are present.
Credentials
No environment variables or credentials are required. Templates mention notification methods (e.g., telegram) but the skill does not demand tokens or unrelated secrets.
Persistence & Privilege
Skill does not set always:true and is user-invocable. However, the documented workflow recommends scheduling cron jobs/heartbeat checks, which introduces persistent system changes if the agent implements them. That persistence is consistent with the stated purpose but is a privilege the user should review before enabling.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install intelligent-delegation
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /intelligent-delegation 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release
元数据
Slug intelligent-delegation
版本 1.0.0
许可证
累计安装 1
当前安装数 1
历史版本数 1
常见问题

Intelligent Delegation 是什么?

A 5-phase framework for reliable AI-to-AI task delegation, inspired by Google DeepMind's "Intelligent AI Delegation" paper (arXiv 2602.11865). Includes task... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 734 次。

如何安装 Intelligent Delegation?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install intelligent-delegation」即可一键安装,无需额外配置。

Intelligent Delegation 是免费的吗?

是的,Intelligent Delegation 完全免费(开源免费),可自由下载、安装和使用。

Intelligent Delegation 支持哪些平台?

Intelligent Delegation 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Intelligent Delegation?

由 Kurt(@hogpile)开发并维护,当前版本 v1.0.0。

💬 留言讨论