Self-Improving Agent (Proactive Self-Reflection)

Name: Self-Improving Agent (Proactive Self-Reflection)
Author: jpengcheng523-netizen

Description

Self-reflection + Self-criticism + Self-learning + Self-organizing memory. Agent evaluates its own work, catches mistakes, and improves permanently. Use befo...

README (SKILL.md)

When to Use

User corrects you or points out mistakes. You complete significant work and want to evaluate the outcome. You notice something in your own output that could be better. Knowledge should compound over time without manual maintenance.

Architecture

Memory lives in ~/self-improving/ with tiered structure. If ~/self-improving/ does not exist, run setup.md.

~/self-improving/
├── memory.md          # HOT: ≤100 lines, always loaded
├── index.md           # Topic index with line counts
├── projects/          # Per-project learnings
├── domains/           # Domain-specific (code, writing, comms)
├── archive/           # COLD: decayed patterns
└── corrections.md     # Last 50 corrections log

Quick Reference

Topic	File
Setup guide	`setup.md`
Memory template	`memory-template.md`
Learning mechanics	`learning.md`
Security boundaries	`boundaries.md`
Scaling rules	`scaling.md`
Memory operations	`operations.md`
Self-reflection log	`reflections.md`

Detection Triggers

Log automatically when you notice these patterns:

Corrections → add to corrections.md, evaluate for memory.md:

"No, that's not right..."
"Actually, it should be..."
"You're wrong about..."
"I prefer X, not Y"
"Remember that I always..."
"I told you before..."
"Stop doing X"
"Why do you keep..."

Preference signals → add to memory.md if explicit:

"I like when you..."
"Always do X for me"
"Never do Y"
"My style is..."
"For [project], use..."

Pattern candidates → track, promote after 3x:

Same instruction repeated 3+ times
Workflow that works well repeatedly
User praises specific approach

Ignore (don't log):

One-time instructions ("do X now")
Context-specific ("in this file...")
Hypotheticals ("what if...")

Self-Reflection

After completing significant work, pause and evaluate:

Did it meet expectations? — Compare outcome vs intent
What could be better? — Identify improvements for next time
Is this a pattern? — If yes, log to corrections.md

When to self-reflect:

After completing a multi-step task
After receiving feedback (positive or negative)
After fixing a bug or mistake
When you notice your output could be better

Log format:

CONTEXT: [type of task]
REFLECTION: [what I noticed]
LESSON: [what to do differently]

Example:

CONTEXT: Building Flutter UI
REFLECTION: Spacing looked off, had to redo
LESSON: Check visual spacing before showing user

Self-reflection entries follow the same promotion rules: 3x applied successfully → promote to HOT.

Quick Queries

User says	Action
"What do you know about X?"	Search all tiers for X
"What have you learned?"	Show last 10 from `corrections.md`
"Show my patterns"	List `memory.md` (HOT)
"Show [project] patterns"	Load `projects/{name}.md`
"What's in warm storage?"	List files in `projects/` + `domains/`
"Memory stats"	Show counts per tier
"Forget X"	Remove from all tiers (confirm first)
"Export memory"	ZIP all files

Memory Stats

On "memory stats" request, report:

📊 Self-Improving Memory

HOT (always loaded):
  memory.md: X entries

WARM (load on demand):
  projects/: X files
  domains/: X files

COLD (archived):
  archive/: X files

Recent activity (7 days):
  Corrections logged: X
  Promotions to HOT: X
  Demotions to WARM: X

Core Rules

1. Learn from Corrections and Self-Reflection

Log when user explicitly corrects you
Log when you identify improvements in your own work
Never infer from silence alone
After 3 identical lessons → ask to confirm as rule

2. Tiered Storage

Tier	Location	Size Limit	Behavior
HOT	memory.md	≤100 lines	Always loaded
WARM	projects/, domains/	≤200 lines each	Load on context match
COLD	archive/	Unlimited	Load on explicit query

3. Automatic Promotion/Demotion

Pattern used 3x in 7 days → promote to HOT
Pattern unused 30 days → demote to WARM
Pattern unused 90 days → archive to COLD
Never delete without asking

4. Namespace Isolation

Project patterns stay in projects/{name}.md
Global preferences in HOT tier (memory.md)
Domain patterns (code, writing) in domains/
Cross-namespace inheritance: global → domain → project

5. Conflict Resolution

When patterns contradict:

Most specific wins (project > domain > global)
Most recent wins (same level)
If ambiguous → ask user

6. Compaction

When file exceeds limit:

Merge similar corrections into single rule
Archive unused patterns
Summarize verbose entries
Never lose confirmed preferences

7. Transparency

Every action from memory → cite source: "Using X (from projects/foo.md:12)"
Weekly digest available: patterns learned, demoted, archived
Full export on demand: all files as ZIP

8. Security Boundaries

See boundaries.md — never store credentials, health data, third-party info.

9. Graceful Degradation

If context limit hit:

Load only memory.md (HOT)
Load relevant namespace on demand
Never fail silently — tell user what's not loaded

Scope

This skill ONLY:

Learns from user corrections and self-reflection
Stores preferences in local files (~/self-improving/)
Reads its own memory files on activation

This skill NEVER:

Accesses calendar, email, or contacts
Makes network requests
Reads files outside ~/self-improving/
Infers preferences from silence or observation
Modifies its own SKILL.md

Related Skills

Install with clawhub install \x3Cslug> if user confirms:

memory — Long-term memory patterns for agents
learning — Adaptive teaching and explanation
decide — Auto-learn decision patterns
escalate — Know when to ask vs act autonomously

Feedback

If useful: clawhub star self-improving
Stay updated: clawhub sync

Usage Guidance

This skill is coherent with its description: it keeps a local memory under ~/self-improving/ and learns from explicit corrections. Before installing, consider: 1) It will create and modify files in your home directory and suggests edits to workspace config files (AGENTS.md, SOUL.md) — review those changes manually. 2) Do not store secrets, health, or third-party personal data in the memory files (the skill's docs forbid this, but accidental storage is possible). 3) Choose a conservative operating mode (Passive or Strict) if you want more confirmation before patterns are promoted. 4) Back up or inspect ~/self-improving/ regularly and verify the 'forget everything' flow works as you expect. 5) If you share your environment (team machine, shared agent), treat the memory as potentially visible to others and audit accordingly.

Capability Analysis

Type: OpenClaw Skill Name: jpeng-self-improving Version: 1.2.11 The skill is a comprehensive framework for agent self-improvement through local file-based memory storage in `~/self-improving/`. It provides structured templates and logic for the agent to log user corrections, preferences, and self-reflections to compound execution quality over time. Critically, it includes a `boundaries.md` file that explicitly forbids the storage of sensitive data such as credentials, financial information, or PII. While the `setup.md` file instructs the agent to modify its own configuration files (`SOUL.md`, `AGENTS.md`) and execute basic shell commands (e.g., `mkdir`, `find`), these actions are strictly scoped to the memory directory and are necessary for the skill's stated purpose of autonomous self-optimization.

Capability Assessment

ℹ Purpose & Capability

The name/description (self-reflection, memory, learning) matches the instructions: creating a local memory directory, logging corrections, promoting patterns, and citing sources. One notable out-of-scope action: setup.md suggests editing workspace files (AGENTS.md, SOUL.md, HEARTBEAT.md) to integrate the skill. Modifying workspace config is related to integration but is an additional side-effect the user should expect and review.

ℹ Instruction Scope

All runtime steps are documented as file I/O and local operations (create ~/self-improving/, load memory.md on session start, write corrections.md, export ZIPs, run weekly maintenance). There are no network endpoints, shell downloads, or hidden code. However the instructions do ask the agent to read/write files outside the memory folder (AGENTS.md, SOUL.md) and to perform scheduled maintenance (cron-style weekly tasks), which expands scope beyond pure ephemeral reflection.

✓ Install Mechanism

This is an instruction-only skill with no install spec and no bundled code—nothing is downloaded or executed from external URLs. That is the lowest-risk install model.

ℹ Credentials

The skill requests no environment variables or credentials and declares a home-folder-based storage location. That is proportionate. One caveat: the skill depends on the user not storing secrets in the memory files—boundaries.md explicitly forbids storing credentials/medical/third-party data, but accidental storage would create a risk; the user must enforce that policy.

ℹ Persistence & Privilege

The skill creates persistent files under ~/self-improving/ and expects to load memory.md every session, and it recommends periodic maintenance tasks. It does not force inclusion (always:false) and does not require elevated permissions, but the persistent local state will change agent behavior across sessions and must be managed (e.g., confirm 'forget everything' behavior and backups).

Version History

v1.2.11

Proactive self-reflection for AI agents

Metadata

Slug jpeng-self-improving

Version 1.2.11

License MIT-0

All-time Installs 5

Active Installs 5

Total Versions 1

Frequently Asked Questions

What is Self-Improving Agent (Proactive Self-Reflection)?

Self-reflection + Self-criticism + Self-learning + Self-organizing memory. Agent evaluates its own work, catches mistakes, and improves permanently. Use befo... It is an AI Agent Skill for Claude Code / OpenClaw, with 624 downloads so far.

How do I install Self-Improving Agent (Proactive Self-Reflection)?

Run "/install jpeng-self-improving" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Self-Improving Agent (Proactive Self-Reflection) free?

Yes, Self-Improving Agent (Proactive Self-Reflection) is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Self-Improving Agent (Proactive Self-Reflection) support?

Self-Improving Agent (Proactive Self-Reflection) is cross-platform and runs anywhere OpenClaw / Claude Code is available (linux, darwin, win32).

Who created Self-Improving Agent (Proactive Self-Reflection)?

It is built and maintained by jpengcheng523-netizen (@jpengcheng523-netizen); the current version is v1.2.11.

More Skills