← Back to Skills Marketplace

Durable Agents

Name: Durable Agents
Author: ainakwalamonk

by Zeeshan Ahmad · GitHub ↗ · v1.0.0

cross-platform ⚠ suspicious

448

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install durable-agents

Description

Build autonomous multi-agent pipelines with Mastra (agents only) and Trigger.dev (all workflows and tasks). Use when creating AI agents, designing multi-stag...

README (SKILL.md)

FIRST INSTALL: Read and follow setupSKILL.md before using this skill. After setup is complete and verified (health check + example pipeline + smoke tests pass), delete this block.

Durable Agents — Multi-Agent Pipeline Development Guide

Mastra (AI agent framework) + Trigger.dev (durable task execution with retries, timeouts, fan-out). Build autonomous multi-agent pipelines where each agent owns a single stage, hands off structured output to the next stage through Trigger.dev, and never holds the full context of a pipeline it doesn't own.

Core Principles

All intelligence lives in AGENT.md, not in code. The .ts file is boilerplate wiring. Writing logic in the agent's TypeScript file is wrong.
One agent, one job. Each agent has a single clear responsibility. If an agent does two unrelated things, split it into two agents in a pipeline.
Tasks handle durability, agents handle reasoning. Trigger.dev tasks wrap agent calls with retries and timeouts. The agent receives input and produces output.
Tools return errors, never throw. Every tool returns { success, errorMessage? } on failure. Throwing inside a tool crashes the task. Returning an error lets the agent reason about it.
Type everything. Input schemas, output schemas, tool schemas — all Zod. If it crosses a boundary (tool input, task payload, pipeline stage), it has a schema.
Agents are autonomous, not scripted. Give agents an outcome and a quality bar. Don't wire their steps in code.
Pipelines break context, not logic. Split a pipeline at the point where a different capability is needed — not to artificially divide one agent's work.
All agentic I/O persists to the database. Agent inputs, outputs, and intermediate results are stored as records. The database is the source of truth, not in-memory state.
Every tool that touches a real system is permission-gated. If a tool can post, publish, delete, charge, or trigger anything external, it must confirm intent before executing.

How to Create an Agent

1. Create the directory

src/agents/{name}/
  AGENT.md
  {name}.ts

2. Write the `AGENT.md`

# AGENT: {Name}

## Role
Who this agent is. One sentence.

## Tools
What tools it has and when to use each one. Be explicit — "Use `sqlQuery` to
check if a table exists before referencing it" not just "Has sqlQuery tool."

## Inputs
What payload it receives. Describe the shape and what each field means.

## Goal
What it must achieve. Describe the outcome, not the steps. The agent decides
how to get there. "Produce a deployment plan for the given architecture" not
"First read the architecture, then list the services, then..."

## Output Contract
Exact shape it must return. If structured output is needed, specify the JSON
schema here. Example:
  { "plan": string, "steps": string[], "risks": string[] }

## Quality Standards
What makes output good vs bad. Be specific. "Each step must be independently
executable" not "Steps should be good."

## Guardrails
What it must NOT do. "Never modify database schema directly." "Never assume
the API is authenticated unless payload says so."

## Self-Validation
Checklist the agent must verify before returning:
- Does output match the Output Contract?
- Are all required fields present?
- Does it satisfy the Quality Standards?

3. Create the agent `.ts` file

Pure boilerplate. No logic here.

import fs from "fs";
import path from "path";
import { fileURLToPath } from "url";
import { Agent } from "@mastra/core/agent";
import { model } from "../../config/model.js";

const __dirname = path.dirname(fileURLToPath(import.meta.url));
const instructions = fs.readFileSync(path.join(__dirname, "AGENT.md"), "utf8");

export const myAgent = new Agent({
    id: "my-agent",
    name: "My Agent",
    instructions,
    model,
});

To give the agent tools:

import { myTool } from "../../tools/myTool.js";

export const myAgent = new Agent({
    id: "my-agent",
    name: "My Agent",
    instructions,
    model,
    tools: { myTool },
});

4. Register the agent

In src/mastra/index.ts:

import { myAgent } from "../agents/my-agent/my-agent.js";

export const mastra = new Mastra({
    agents: { plannerAgent, reviewerAgent, myAgent },
});

How to Create a Tool

Structure

import { createTool } from "@mastra/core/tools";
import { z } from "zod";

export const myTool = createTool({
    id: "my-tool",
    description: "What it does and WHEN to use it",
    inputSchema: z.object({
        query: z.string().describe("The search query"),
    }),
    outputSchema: z.object({
        success: z.boolean(),
        data: z.any().optional(),
        errorMessage: z.string().optional(),
    }),
    execute: async ({ query }) => {
        try {
            const result = await doSomething(query);
            return { success: true, data: result };
        } catch (error: any) {
            return { success: false, errorMessage: error.message };
        }
    },
});

Tool Rules

Always define outputSchema. The agent uses it to understand what the tool returns.
Never throw from execute. Return { success: false, errorMessage } instead. Throwing crashes the Trigger.dev task.
Description is for the agent. Write it as instructions: "Use this to check if a database table exists. Pass the table name. Returns true/false."
One tool does one thing. "Query the database" not "Query the database and format the results and send an email."
Use .describe() on Zod fields to tell the agent what to pass.
No side effects unless necessary. If a tool writes, document it clearly in the description and in the agent's AGENT.md guardrails.

Where to put tools

Shared tools: src/tools/{name}.ts
Agent-specific tools: src/agents/{agentName}/tools/{name}.ts

Permissioned Tools for Destructive or External Actions

Any tool that touches a real system — posting to an API, publishing content, sending a message, charging a user, deleting data, triggering a webhook — must be permission-gated. Agents must not be able to fire these actions without explicit intent confirmation.

Before building a tool that has real-world side effects, ask the user:

What exact action does this tool take?
Should the agent be able to trigger this autonomously, or does a human need to approve it first?
What are the consequences of it misfiring?
Should this be rate-limited or scoped to specific records?

Build the answer into the tool's permission layer, not just the agent's AGENT.md guardrails. Guardrails are instructions; permission layers are enforcement.

Pattern: Confirm Before Execute

For any action that can't be undone or that has cost/visibility consequences, the tool must receive an explicit confirmed: true in its input before it proceeds. The agent must call a read/preview tool first, then call the action tool only when it has verified the result and received confirmed: true from the calling context.

export const publishPostTool = createTool({
    id: "publish-post",
    description: "Publishes a post to the platform. Only call this after previewing with `previewPostTool` and receiving confirmed: true from the task payload.",
    inputSchema: z.object({
        postId: z.string().describe("ID of the post record to publish"),
        confirmed: z.boolean().describe("Must be true. Do not set this yourself — it must come from the task payload."),
    }),
    outputSchema: z.object({
        success: z.boolean(),
        publishedUrl: z.string().optional(),
        errorMessage: z.string().optional(),
    }),
    execute: async ({ postId, confirmed }) => {
        if (!confirmed) {
            return { success: false, errorMessage: "Publish requires confirmed: true in payload." };
        }
        try {
            const url = await publishPost(postId);
            return { success: true, publishedUrl: url };
        } catch (error: any) {
            return { success: false, errorMessage: error.message };
        }
    },
});

Pattern: Scope to Records

Destructive or write tools must operate on a specific record ID — never on a query, a filter, or an implicit "current item." The agent must always pass the exact ID of the record it's acting on. This prevents the tool from accidentally operating on the wrong item.

inputSchema: z.object({
    recordId: z.string().describe("Exact DB ID of the record to act on. Do not pass a search query."),
})

What belongs in `AGENT.md` Guardrails vs in the tool

Concern	Where it lives
"Don't publish unless quality score > 0.8"	`AGENT.md` Guardrails
"Don't call this without confirmed: true"	Tool input schema + execute guard
"Only act on records in status: draft"	Tool execute guard (check DB before acting)
"Never delete more than one record per run"	Tool execute guard (enforce the count)

How to Create a Pipeline

Pipelines chain Trigger.dev tasks. Each task calls one agent and passes its output to the next. No single agent holds the full pipeline context — each stage receives only what it needs.

1. Create task files in `src/pipelines/tasks/`

import { task, logger } from "@trigger.dev/sdk/v3";
import { mastra } from "../../mastra/index.js";

export const planTask = task({
    id: "plan-task",
    retry: { maxAttempts: 3, minTimeoutInMs: 1000, factor: 2 },
    run: async (payload: { prompt: string }) => {
        logger.info("Running planner", { promptLength: payload.prompt.length });
        const agent = mastra.getAgent("plannerAgent");
        const response = await agent.generate(JSON.stringify(payload));
        return response.text;
    },
});

2. Create the pipeline orchestrator

In src/pipelines/{name}.ts, chain tasks using triggerAndWait:

import { planTask } from "./tasks/plan-task.js";
import { reviewTask } from "./tasks/review-task.js";

export async function runMyPipeline(input: string) {
    const planResult = await planTask.triggerAndWait({ prompt: input });
    if (!planResult.ok) throw new Error("Plan task failed");

    const reviewResult = await reviewTask.triggerAndWait({ plan: planResult.output });
    if (!reviewResult.ok) throw new Error("Review task failed");

    return { plan: planResult.output, review: reviewResult.output };
}

3. Export tasks for the worker

In src/trigger/index.ts:

export * from "../pipelines/tasks/plan-task.js";
export * from "../pipelines/tasks/review-task.js";

Every task must be exported here or the Trigger.dev worker won't discover it.

4. Add an API endpoint

In src/app/index.ts:

app.post("/my-pipeline", async (req, res) => {
    const { input } = req.body;
    const result = await runMyPipeline(input);
    res.json({ success: true, ...result });
});

Pipeline Design: Agents vs Scripts

Not every pipeline stage needs an agent. Use agents where judgment is required. Use scripts (plain TypeScript functions or Trigger.dev tasks with no agent) where the action is deterministic.

Example: Content Production Pipeline

[Director Agent]         — generates ideas, writes scripts, validates against criteria
        ↓
[Media Selector Agent]   — selects or processes media assets based on the script
        ↓
[Overlay Task]           — no agent; deterministic script that composites text onto video and stores result

The overlay stage has no reasoning to do. It receives exact inputs, executes a fixed operation, and stores the output. Putting an agent here adds latency and cost for no benefit.

When to use an agent in a pipeline stage

Use an agent when the stage requires:

Judgment or evaluation (does this meet a quality bar?)
Selection from ambiguous options (which asset fits this script best?)
Generation from a goal (write a script for this topic)
Iterative refinement based on feedback

Use a plain task (no agent) when the stage is:

A deterministic transformation (resize, encode, composite)
A storage write (save output to DB or file system)
A notification or webhook trigger
A lookup with no interpretation needed

Splitting pipeline stages

Split at the boundary where a different capability is needed — not to artificially divide one agent's work. A director agent that generates ideas, writes a script, and validates it against criteria is doing one coherent job. That's one agent, one task. The media selection is a different capability — that's the split.

Pipeline Patterns

Fan-Out (Parallel Sub-tasks)

import { tasks } from "@trigger.dev/sdk/v3";

const handles = await tasks.batchTrigger("process-item",
    items.map(item => ({ payload: { item } }))
);

Each sub-task runs independently with its own retries.

Review Checkpoint

Insert a review stage between pipeline steps. Three modes:

Mode	Behavior
`"none"`	Auto-approve. Trigger next stage immediately.
`"agent"`	Call a reviewer agent. If approved, continue. If rejected, feed feedback back to the previous stage for revision.
`"human"`	Set a status in the DB to `pending`. Return. A human reviews externally. Resume the pipeline via an API callback.

Retry Configuration

Every task must have explicit retry config. LLM calls are flaky — the default (no retries) means one transient API error kills the pipeline.

retry: {
    maxAttempts: 3,
    minTimeoutInMs: 1000,
    factor: 2,
}

Database as the Agentic Record Layer

Every agent input, output, and intermediate result must be persisted to the database before the next stage runs. This is not optional. Agents operate on DB records — they do not pass raw data through in-memory pipelines.

Why

Deduplication. Check if an equivalent job has already run before triggering a new one. Compare by content hash, source ID, or a natural key.
Verification. The next stage reads from the DB, not from the previous task's return value. If a record isn't in the DB, the stage doesn't proceed.
Record keeping. Every generated asset, decision, and status transition is a row. You can audit, replay, and debug any run.
Resume on failure. If a task retries, it checks the DB first. If the output already exists, it skips regeneration and continues.

Pattern: Write Before Passing

Every task writes its output to the DB and returns the record ID. The next task receives the ID, reads from the DB, and operates on the record.

// Stage 1: director agent writes its output
export const scriptTask = task({
    id: "script-task",
    retry: { maxAttempts: 3, minTimeoutInMs: 1000, factor: 2 },
    run: async (payload: { projectId: string }) => {
        const existing = await db.script.findFirst({ where: { projectId: payload.projectId } });
        if (existing) return { scriptId: existing.id }; // already done, skip

        const agent = mastra.getAgent("directorAgent");
        const response = await agent.generate(JSON.stringify(payload));
        const output = ScriptOutputSchema.parse(JSON.parse(response.text));

        const record = await db.script.create({
            data: { projectId: payload.projectId, content: output.script, status: "draft" },
        });
        return { scriptId: record.id };
    },
});

// Stage 2: next agent reads by ID
export const mediaTask = task({
    id: "media-task",
    retry: { maxAttempts: 3, minTimeoutInMs: 1000, factor: 2 },
    run: async (payload: { scriptId: string }) => {
        const script = await db.script.findUniqueOrThrow({ where: { id: payload.scriptId } });
        const agent = mastra.getAgent("mediaSelectorAgent");
        const response = await agent.generate(JSON.stringify({ script: script.content }));
        const output = MediaOutputSchema.parse(JSON.parse(response.text));

        const record = await db.mediaSelection.create({
            data: { scriptId: payload.scriptId, assetIds: output.assetIds, status: "selected" },
        });
        return { mediaSelectionId: record.id };
    },
});

Pattern: Status Transitions as Pipeline Control

Store a status field on every record. Use it to gate pipeline stages and drive human review checkpoints.

Status	Meaning
`pending`	Created, not yet processed
`processing`	Task is running
`draft`	Agent output produced, not reviewed
`approved`	Passed review (agent or human)
`rejected`	Failed review, needs revision
`published`	Final action taken
`failed`	Unrecoverable error

await db.script.update({
    where: { id: scriptId },
    data: { status: "processing" },
});
// ... agent call ...
await db.script.update({
    where: { id: scriptId },
    data: { status: "draft", content: output.script },
});

Keeping Agents Autonomous

Define the destination and the quality bar. Don't specify how to get there.

Wrong — micromanaging the agent:

1. Read the input
2. Extract the requirements
3. For each requirement, write a task
4. Format the tasks as a numbered list
5. Return the list

Right — defining the outcome:

## Goal
Produce a technical implementation plan for the given objective.

## Output Contract
{ "tasks": [{ "title": string, "description": string, "dependencies": string[] }] }

## Quality Standards
- Each task must be independently executable by a developer
- Dependencies must reference other tasks by title
- No task should take more than 4 hours of work

Type Enforcement

Task Payloads

Always type the run function parameter:

run: async (payload: { prompt: string; maxTokens?: number }) => {

Structured Output from Agents

Define the exact schema in the AGENT.md Output Contract section, then validate with Zod on receipt:

const OutputSchema = z.object({
    tasks: z.array(z.object({
        title: z.string(),
        description: z.string(),
        dependencies: z.array(z.string()),
    })),
});

const response = await agent.generate(JSON.stringify(payload));
const parsed = OutputSchema.parse(JSON.parse(response.text));

If parsing fails, the task throws, Trigger.dev retries with the same input, and the agent produces output again.

Tool Schemas

Always define both inputSchema and outputSchema on tools. The agent uses these to understand what arguments to pass and what it will receive back.

Key Rules

All intelligence lives in AGENT.md, not in code
Agent .ts files are boilerplate wiring only — no logic
Tools return { success, errorMessage } on failure — never throw
Task wrappers handle durability, agents handle reasoning
Self-validation checklist in AGENT.md is mandatory for structured output agents
Every Trigger.dev task has explicit retry config
Every task is exported from src/trigger/index.ts
One model config for all agents — src/config/model.ts
Pipeline stages use triggerAndWait for sequential, batchTrigger for parallel
Check result.ok after every triggerAndWait — don't assume success
Every agent output is written to the DB before the next stage runs — never pass raw data between tasks
Tasks that would duplicate work must check the DB first and skip if already done
Every tool that takes a real-world action requires confirmed: true in the input and must verify it before executing
Not every pipeline stage needs an agent — use plain tasks for deterministic operations

Usage Guidance

This skill contains a one-time setup guide that will search your filesystem and Docker containers for credentials, write to .env and to the Trigger CLI config, run database queries inside Docker, and even generate and insert PATs. Before running anything or letting an agent execute these steps: (1) Review the repository it clones (git clone https://github.com/ainakwalamonk/durableclaw.git) and every script (./setup.sh, init scripts) manually. (2) Back up any files that may be modified (your .env, trigger config, and Trigger CLI prefs). (3) Prefer running the setup interactively in an isolated VM/container rather than on a production host. (4) Ask the author why required credentials are not declared in metadata and why the setup insists on 'never stop for user input'. (5) Disable autonomous invocation for this skill or require explicit human approval before it runs any system-modifying steps. If you want, I can list the exact files and commands in the setup to review line-by-line.

Capability Analysis

Type: OpenClaw Skill Name: durable-agents Version: 1.0.0 The skill is highly suspicious due to severe prompt injection vulnerabilities and instructions that grant the AI agent broad, unsupervised access to sensitive system information. Specifically, `setupSKILL.md` instructs the agent to 'find the solution independently. Read logs, inspect the DB, check config files. Do not ask the user.' and to actively search for LLM API keys across various system locations (e.g., other Docker container environments, config files) and directly query/modify the database to extract and insert sensitive API keys and PATs. While framed as setup/debugging, these instructions enable extensive data discovery and system modification without human oversight, making the agent highly susceptible to malicious prompt injection for data exfiltration or unauthorized actions.

Capability Assessment

ℹ Purpose & Capability

The described goal (building Mastra + Trigger.dev pipelines) can legitimately require starting local Docker services, configuring Trigger.dev, and wiring an LLM gateway. However, the skill metadata declares no required env vars or credentials even though the runtime instructions explicitly search for and write AI credentials, Trigger secrets, and CLI PATs — a mismatch between claim and actual needs.

⚠ Instruction Scope

The SKILL.md / setupSkill.md direct a full, non-interactive setup: scanning project directories and running docker inspect, reading/writing .env, querying Postgres via docker exec, and editing ~/Library/Preferences/trigger/default.json. It also enforces 'Never stop for user input' and 'find the solution independently' rules, which grant broad discretion to probe the host filesystem and services. Those steps go beyond typical guidance and can expose or overwrite secrets and user config.

✓ Install Mechanism

No install spec or remote downloads are included (instruction-only skill), which reduces supply-chain risk. The primary risk comes from the actions the instructions ask the operator/agent to perform, not from any packaged installer.

⚠ Credentials

Although the registry metadata lists no required env vars or credentials, the setup instructs obtaining AI_BASE_URL, AI_API_KEY, MODEL_ID, TRIGGER_SECRET_KEY, TRIGGER_ACCESS_TOKEN and possibly DB access. Asking the runtime to discover these values in other projects/containers and to write them into .env or system files is disproportionate to what's declared and increases the chance of secret exposure or accidental overwrite.

⚠ Persistence & Privilege

The setup explicitly edits local config files (e.g., .env, trigger.config.ts, and ~/Library/Preferences/trigger/default.json), inserts tokens into the DB, and generates encrypted PATs using a hard-coded ENCRYPTION_KEY in the recovery instructions. While these changes may be needed for local self-hosting, they are privileged and should be performed with explicit user consent and review — not by a non-interactive process.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install durable-agents
After installation, invoke the skill by name or use /durable-agents
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release of durable-agents for autonomous multi-agent pipelines. - Integrates Mastra (for agent reasoning) and Trigger.dev (for durable, retryable task orchestration) - Establishes strict best practices: logic in AGENT.md, one job per agent, Zod-typed schemas everywhere - Mandates error handling by return value (never throwing) for all tools - Provides detailed step-by-step guides for creating agents and tools - Enforces permission gating for any tool that performs real-world or destructive actions - Ensures all agentic outputs and intermediate states are persisted to a database, never in memory

Metadata

Slug durable-agents

Version 1.0.0

License —

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Durable Agents?

Build autonomous multi-agent pipelines with Mastra (agents only) and Trigger.dev (all workflows and tasks). Use when creating AI agents, designing multi-stag... It is an AI Agent Skill for Claude Code / OpenClaw, with 448 downloads so far.

How do I install Durable Agents?

Run "/install durable-agents" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Durable Agents free?

Yes, Durable Agents is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Durable Agents support?

Durable Agents is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Durable Agents?

It is built and maintained by Zeeshan Ahmad (@ainakwalamonk); the current version is v1.0.0.

More Skills