Description

Guides architectural decisions for LangGraph applications. Use when deciding between LangGraph vs alternatives, choosing state management strategies, designi...

README (SKILL.md)

LangGraph Architecture Decisions

Name: Langgraph Architecture
Author: anderskev

When to Use LangGraph

Use LangGraph When You Need:

Stateful conversations - Multi-turn interactions with memory
Human-in-the-loop - Approval gates, corrections, interventions
Complex control flow - Loops, branches, conditional routing
Multi-agent coordination - Multiple LLMs working together
Persistence - Resume from checkpoints, time travel debugging
Streaming - Real-time token streaming, progress updates
Reliability - Retries, error recovery, durability guarantees

Consider Alternatives When:

Scenario	Alternative	Why
Single LLM call	Direct API call	Overhead not justified
Linear pipeline	LangChain LCEL	Simpler abstraction
Stateless tool use	Function calling	No persistence needed
Simple RAG	LangChain retrievers	Built-in patterns
Batch processing	Async tasks	Different execution model

State Schema Decisions

TypedDict vs Pydantic

TypedDict	Pydantic
Lightweight, faster	Runtime validation
Dict-like access	Attribute access
No validation overhead	Type coercion
Simpler serialization	Complex nested models

Recommendation: Use TypedDict for most cases. Use Pydantic when you need validation or complex nested structures.

Reducer Selection

Use Case	Reducer	Example
Chat messages	`add_messages`	Handles IDs, RemoveMessage
Simple append	`operator.add`	`Annotated[list, operator.add]`
Keep latest	None (LastValue)	`field: str`
Custom merge	Lambda	`Annotated[list, lambda a, b: ...]`
Overwrite list	`Overwrite`	Bypass reducer

State Size Considerations

# SMALL STATE (\x3C 1MB) - Put in state
class State(TypedDict):
    messages: Annotated[list, add_messages]
    context: str

# LARGE DATA - Use Store
class State(TypedDict):
    messages: Annotated[list, add_messages]
    document_ref: str  # Reference to store

def node(state, *, store: BaseStore):
    doc = store.get(namespace, state["document_ref"])
    # Process without bloating checkpoints

Graph Structure Decisions

Single Graph vs Subgraphs

Single Graph when:

All nodes share the same state schema
Simple linear or branching flow
\x3C 10 nodes

Subgraphs when:

Different state schemas needed
Reusable components across graphs
Team separation of concerns
Complex hierarchical workflows

Conditional Edges vs Command

Conditional Edges	Command
Routing based on state	Routing + state update
Separate router function	Decision in node
Clearer visualization	More flexible
Standard patterns	Dynamic destinations

# Conditional Edge - when routing is the focus
def router(state) -> Literal["a", "b"]:
    return "a" if condition else "b"
builder.add_conditional_edges("node", router)

# Command - when combining routing with updates
def node(state) -> Command:
    return Command(goto="next", update={"step": state["step"] + 1})

Static vs Dynamic Routing

Static Edges (add_edge):

Fixed flow known at build time
Clearer graph visualization
Easier to reason about

Dynamic Routing (add_conditional_edges, Command, Send):

Runtime decisions based on state
Agent-driven navigation
Fan-out patterns

Persistence Strategy

Checkpointer Selection

Checkpointer	Use Case	Characteristics
`InMemorySaver`	Testing only	Lost on restart
`SqliteSaver`	Development	Single file, local
`PostgresSaver`	Production	Scalable, concurrent
Custom	Special needs	Implement BaseCheckpointSaver

Checkpointing Scope

# Full persistence (default)
graph = builder.compile(checkpointer=checkpointer)

# Subgraph options
subgraph = sub_builder.compile(
    checkpointer=None,   # Inherit from parent
    checkpointer=True,   # Independent checkpointing
    checkpointer=False,  # No checkpointing (runs atomically)
)

When to Disable Checkpointing

Short-lived subgraphs that should be atomic
Subgraphs with incompatible state schemas
Performance-critical paths without need for resume

Multi-Agent Architecture

Supervisor Pattern

Best for:

Clear hierarchy
Centralized decision making
Different agent specializations

          ┌─────────────┐
          │  Supervisor │
          └──────┬──────┘
    ┌────────┬───┴───┬────────┐
    ▼        ▼       ▼        ▼
┌──────┐ ┌──────┐ ┌──────┐ ┌──────┐
│Agent1│ │Agent2│ │Agent3│ │Agent4│
└──────┘ └──────┘ └──────┘ └──────┘

Peer-to-Peer Pattern

Best for:

Collaborative agents
No clear hierarchy
Flexible communication

┌──────┐     ┌──────┐
│Agent1│◄───►│Agent2│
└──┬───┘     └───┬──┘
   │             │
   ▼             ▼
┌──────┐     ┌──────┐
│Agent3│◄───►│Agent4│
└──────┘     └──────┘

Handoff Pattern

Best for:

Sequential specialization
Clear stage transitions
Different capabilities per stage

┌────────┐    ┌────────┐    ┌────────┐
│Research│───►│Planning│───►│Execute │
└────────┘    └────────┘    └────────┘

Streaming Strategy

Stream Mode Selection

Mode	Use Case	Data
`updates`	UI updates	Node outputs only
`values`	State inspection	Full state each step
`messages`	Chat UX	LLM tokens
`custom`	Progress/logs	Your data via StreamWriter
`debug`	Debugging	Tasks + checkpoints

Subgraph Streaming

# Stream from subgraphs
async for chunk in graph.astream(
    input,
    stream_mode="updates",
    subgraphs=True  # Include subgraph events
):
    namespace, data = chunk  # namespace indicates depth

Human-in-the-Loop Design

Interrupt Placement

Strategy	Use Case
`interrupt_before`	Approval before action
`interrupt_after`	Review after completion
`interrupt()` in node	Dynamic, contextual pauses

Resume Patterns

# Simple resume (same thread)
graph.invoke(None, config)

# Resume with value
graph.invoke(Command(resume="approved"), config)

# Resume specific interrupt
graph.invoke(Command(resume={interrupt_id: value}), config)

# Modify state and resume
graph.update_state(config, {"field": "new_value"})
graph.invoke(None, config)

Gates (sequenced)

Complete in order before treating a LangGraph design as locked in. Each step has an objective pass condition (artifact or explicit “none”), not an honor-system “we considered it.”

Alternatives — Pass: For the workload, either (a) at least one row from Consider Alternatives When was evaluated and rejected with a one-line reason, or (b) the use case clearly matches Use LangGraph When You Need and does not fit a “consider alternative” row.
State contract — Pass: Every state field has an assigned reducer (or default/LastValue) documented in the same place as the schema; large payloads are references or Store-backed, not inlined blobs (see State Size Considerations).
Checkpointer — Pass: The saver type is chosen for the target environment per Checkpointer Selection (e.g. production is not InMemorySaver unless explicitly test-only).
Loops and flaky nodes — Pass: recursion_limit (or equivalent) is set for any graph that can cycle; per-node RetryPolicy or a documented “no retries” choice exists for external calls (see Retry Configuration).

Error Handling Strategy

Retry Configuration

# Per-node retry
RetryPolicy(
    initial_interval=0.5,
    backoff_factor=2.0,
    max_interval=60.0,
    max_attempts=3,
    retry_on=lambda e: isinstance(e, (APIError, TimeoutError))
)

# Multiple policies (first match wins)
builder.add_node("node", fn, retry_policy=[
    RetryPolicy(retry_on=RateLimitError, max_attempts=5),
    RetryPolicy(retry_on=Exception, max_attempts=2),
])

Fallback Patterns

def node_with_fallback(state):
    try:
        return primary_operation(state)
    except PrimaryError:
        return fallback_operation(state)

# Or use conditional edges for complex fallback routing
def route_on_error(state) -> Literal["retry", "fallback", "__end__"]:
    if state.get("error") and state["attempts"] \x3C 3:
        return "retry"
    elif state.get("error"):
        return "fallback"
    return END

Scaling Considerations

Horizontal Scaling

Use PostgresSaver for shared state
Consider LangGraph Platform for managed infrastructure
Use stores for large data outside checkpoints

Performance Optimization

Minimize state size - Use references for large data
Parallel nodes - Fan out when possible
Cache expensive operations - Use CachePolicy
Async everywhere - Use ainvoke, astream

Resource Limits

# Set recursion limit
config = {"recursion_limit": 50}
graph.invoke(input, config)

# Track remaining steps in state
class State(TypedDict):
    remaining_steps: RemainingSteps

def check_budget(state):
    if state["remaining_steps"] \x3C 5:
        return "wrap_up"
    return "continue"

Decision Checklist

After Gates (sequenced), before implementing:

Is LangGraph the right tool? (vs simpler alternatives)
State schema defined with appropriate reducers?
Persistence strategy chosen? (dev vs prod checkpointer)
Streaming needs identified?
Human-in-the-loop points defined?
Error handling and retry strategy?
Multi-agent coordination pattern? (if applicable)
Resource limits configured?

Usage Guidance

This skill appears to be pure guidance and low-risk: it asks for no credentials and installs nothing. You can safely use it for design advice, but treat the code examples as templates — review and test them before copying into production. Also prefer skills from known sources if you need long-term maintenance or security guarantees.

Capability Analysis

Type: OpenClaw Skill Name: langgraph-architecture Version: 1.0.1 The skill bundle 'langgraph-architecture' is a documentation-only resource providing architectural guidance for LangGraph applications. It contains no executable code, only markdown instructions and Python code snippets for reference in SKILL.md. There are no indicators of malicious intent, data exfiltration, or prompt injection attacks.

Capability Assessment

✓ Purpose & Capability

The skill's name and description match the SKILL.md content: architectural guidance for LangGraph (state, routing, persistence, multi-agent patterns). Nothing requested (no env vars, no binaries, no config paths) is disproportionate to that purpose.

✓ Instruction Scope

SKILL.md contains recommendations and example code snippets (Python types, reducers, checkpointer choices, streaming modes). The instructions do not direct the agent to read local files, access secrets, call external endpoints, or perform actions beyond giving guidance and sample code.

✓ Install Mechanism

No install spec and no code files — the skill is instruction-only, so nothing will be written to disk or fetched during install.

✓ Credentials

The skill declares no required environment variables, no primary credential, and no config paths. This is proportionate for a documentation/architecture guidance skill.

✓ Persistence & Privilege

always is false and there are no signs the skill modifies agent configs or requests persistent presence. The default ability for the agent to invoke the skill autonomously is normal and not a concern here given the skill's benign footprint.

Version History

v1.0.1

- Added a new "Gates (sequenced)" section specifying required architectural review steps before finalizing a LangGraph design, each with explicit pass conditions and references to documentation. - No other content was changed; all other architectural guidance and patterns remain the same.

v1.0.0

langgraph-architecture 1.0.0 - Initial release providing architectural guidance for LangGraph applications. - Covers state management, reducers, and persistence strategies. - Outlines choices for graph structure, multi-agent design patterns, and streaming options. - Details human-in-the-loop architectures, error handling, and scaling practices. - Helps compare LangGraph to alternatives for common scenarios.

Metadata

Slug langgraph-architecture

Version 1.0.1

License MIT-0

All-time Installs 1

Active Installs 1

Total Versions 2

Frequently Asked Questions

What is Langgraph Architecture?

Guides architectural decisions for LangGraph applications. Use when deciding between LangGraph vs alternatives, choosing state management strategies, designi... It is an AI Agent Skill for Claude Code / OpenClaw, with 183 downloads so far.

How do I install Langgraph Architecture?

Run "/install langgraph-architecture" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Langgraph Architecture free?

Yes, Langgraph Architecture is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Langgraph Architecture support?

Langgraph Architecture is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Langgraph Architecture?

It is built and maintained by Kevin Anderson (@anderskev); the current version is v1.0.1.

More Skills

Langgraph Architecture

LangGraph Architecture Decisions

When to Use LangGraph

Use LangGraph When You Need:

Consider Alternatives When:

State Schema Decisions

TypedDict vs Pydantic

Reducer Selection

State Size Considerations

Graph Structure Decisions

Single Graph vs Subgraphs

Conditional Edges vs Command

Static vs Dynamic Routing

Persistence Strategy

Checkpointer Selection

Checkpointing Scope

When to Disable Checkpointing

Multi-Agent Architecture

Supervisor Pattern

Peer-to-Peer Pattern

Handoff Pattern

Streaming Strategy

Stream Mode Selection

Subgraph Streaming

Human-in-the-Loop Design

Interrupt Placement

Resume Patterns

Gates (sequenced)

Error Handling Strategy

Retry Configuration

Fallback Patterns

Scaling Considerations

Horizontal Scaling

Performance Optimization

Resource Limits

Decision Checklist

What is Langgraph Architecture?

How do I install Langgraph Architecture?

Is Langgraph Architecture free?

Which platforms does Langgraph Architecture support?

Who created Langgraph Architecture?

💬 Comments