← Back to Skills Marketplace
mercury7353

Live Evo: Online Evolution with verified experiences

by YaolunZhang · GitHub ↗ · v0.1.0
cross-platform ✓ Security Clean
317
Downloads
1
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install live-evo
Description
Self-evolving memory system that learns from verifiable tasks. Use when completing tasks where you can verify the outcome (coding, predictions, analysis). Au...
README (SKILL.md)

Live-Evo: Online Self-Evolving Memory

You are using the Live-Evo memory system that learns from past mistakes through experience accumulation and adaptive evaluation.

IMPORTANT — Script location: All scripts are in the scripts/ subdirectory next to this SKILL.md file. When running scripts, use the absolute path to the scripts/ directory relative to where this file is located. For example, if this SKILL.md is at /path/to/live-evo/SKILL.md, the scripts are at /path/to/live-evo/scripts/.

Experience data is stored persistently at ~/.live-evo/experience_db.jsonl (independent of skill installation location).

Core Workflow

1. Retrieve & Compile (Before Acting)

Run the experience retrieval script to find relevant past experiences:

python \x3Cscripts-dir>/retrieve.py --query "YOUR_TASK_DESCRIPTION"

If experiences are found, they will be compiled into a task-specific guideline. Use this guideline to inform your approach.

2. Decide: Verify or Direct Apply

You must judge whether contrastive verification (two attempts) is worthwhile based on:

Factor Do Contrastive Eval Skip, Direct Apply
Cost of re-running Low (e.g. run a test) High (e.g. long build, API costs, heavy computation)
Verifiability Clear ground truth exists (tests, known answer) No easy way to verify programmatically
Task complexity Simple enough to attempt twice Too complex/large to reasonably duplicate
Guideline relevance Retrieved guideline is highly relevant Guideline is loosely related or no guideline found

If contrastive eval IS worthwhile → Go to Step 2A If contrastive eval is NOT worthwhile → Go to Step 2B

Step 2A: Contrastive Evaluation (Two Attempts)

Make two independent attempts:

Attempt A (Without Memory):

  • Solve the task using only your base knowledge
  • Record your answer/approach

Attempt B (With Guideline):

  • Apply the retrieved guideline
  • Solve the task with this informed approach
  • Record your answer/approach

Then verify and update weights:

python \x3Cscripts-dir>/update.py \
  --task "TASK_DESCRIPTION" \
  --result-a "RESULT_WITHOUT_MEMORY" \
  --result-b "RESULT_WITH_GUIDELINE" \
  --correct "CORRECT_ANSWER" \
  --experience-ids "id1,id2,..."

Step 2B: Direct Apply with Feedback-Based Learning

When contrastive evaluation is not feasible:

  1. Apply the guideline directly (if one was retrieved) and complete the task
  2. Observe feedback from any of these sources:
    • User feedback (corrections, complaints, approval)
    • Environment signals (test results, error messages, build output)
    • Outcome observation (did the result work as expected?)
  3. Store experience directly if feedback reveals a lesson:
python \x3Cscripts-dir>/add_experience.py \
  --question "THE_TASK_QUESTION" \
  --failure-reason "What went wrong (from feedback)" \
  --improvement "Key lesson learned" \
  --category "coding|analysis|prediction|debugging|other"

No contrastive comparison needed — just learn from what happened.

3. Add New Experience (On Any Failure)

Whenever a task fails or feedback reveals a learnable lesson — regardless of which path you took — store the experience:

python \x3Cscripts-dir>/add_experience.py \
  --question "THE_TASK_QUESTION" \
  --failure-reason "What went wrong" \
  --improvement "Key lesson learned" \
  --category "coding|analysis|prediction|debugging|other"

4. Update Weights (When Possible)

If you used a retrieved guideline and can determine whether it helped:

python \x3Cscripts-dir>/update.py \
  --task "TASK_DESCRIPTION" \
  --result-a "WHAT_WOULD_HAVE_HAPPENED" \
  --result-b "WHAT_ACTUALLY_HAPPENED" \
  --correct "CORRECT_OUTCOME" \
  --experience-ids "id1,id2,..."

If you cannot determine whether the guideline helped, skip weight updates — no update is better than a wrong update.

When to Use Live-Evo

Use this system for:

  • Coding tasks: Bug fixes, implementations where tests can verify
  • Analysis tasks: Where ground truth can be checked
  • Predictions: Forecasting with eventual verification
  • Problem solving: Tasks with objectively correct answers
  • Any task with user feedback: Even without formal verification, user corrections are valuable signals

Experience Format

Each experience contains:

  • question: The original task/question
  • failure_reason: What went wrong in the original attempt
  • improvement: Key lesson or approach that would have helped
  • missed_information: Information sources or considerations that were missed
  • weight: Quality score (0.1-2.0) updated based on usefulness
  • category: Domain category for filtering

Key Principles

  1. Cost-Aware Verification: Only do contrastive evaluation when the cost is justified — don't waste tokens/time on expensive double-runs
  2. Feedback is Gold: User corrections, test failures, and error messages are direct learning signals — always store these
  3. Selective Acquisition: Only store experiences that contain a genuine, actionable lesson
  4. Weight-based Retrieval: Good experiences rise, bad ones fade
  5. Task-Specific Guidelines: Don't apply raw experiences — synthesize them into actionable guidance
  6. When in Doubt, Store: It's better to store a potentially useful experience than to miss a lesson; low-quality experiences will naturally decay via weight updates

Manual Commands

View all experiences:

python \x3Cscripts-dir>/list_experiences.py

Search experiences:

python \x3Cscripts-dir>/retrieve.py --query "your search query" --top-k 5

Get statistics:

python \x3Cscripts-dir>/stats.py
Usage Guidance
This skill appears to do what it claims: it stores and retrieves 'experiences' locally to help produce task-specific guidelines and adjust weights based on verification. Before installing/using: (1) review the included Python files (they are small and local) — no network/exfiltration code is present; (2) avoid passing secrets or private data (API keys, passwords, private messages, proprietary code) into the add/update/retrieve commands because those strings are stored in plaintext under ~/.live-evo; (3) note that a referenced bundled seed path (experiences/experience_db.jsonl) is looked up on first run — if present it will be copied into ~/.live-evo, otherwise nothing is copied; (4) if you want less persistence, run the scripts with a separate working directory or periodically delete/rotate ~/.live-evo; (5) if you plan to share outputs produced using retrieved experiences, review them first to ensure they don't leak stored sensitive inputs.
Capability Analysis
Type: OpenClaw Skill Name: live-evo Version: 0.1.0 The 'live-evo' skill is a self-evolving memory system designed to help an AI agent learn from past tasks and mistakes. It stores experience data locally in `~/.live-evo/` and provides Python scripts (`experience_manager.py`, `retrieve.py`, `update.py`) to manage, search, and weight these experiences. The code uses standard libraries, contains no network activity or data exfiltration logic, and the instructions in `SKILL.md` are strictly aligned with the stated purpose of task improvement and verification.
Capability Assessment
Purpose & Capability
The skill name/description (self-evolving memory for verifiable tasks) matches what the code and SKILL.md do: add, list, retrieve, generate guidelines from, and reweight past experiences. There are no unrelated requirements (no credentials, no unrelated binaries).
Instruction Scope
SKILL.md instructs the agent to run the included scripts and explicitly tells how to call them; those scripts only read/write the experience DB and local weight history. They do not read other system configuration or environment variables. Note: SKILL.md and the scripts insist on persistent storage at ~/.live-evo, so any task text, user feedback, or answers you pass to these scripts will be stored on disk.
Install Mechanism
There is no install spec — this is instruction-only plus bundled Python scripts. No packages are downloaded or executed from external URLs. Risk from installation is low; executing the provided Python scripts runs local code included in the skill package.
Credentials
The skill requests no environment variables or external credentials (proportional). However, it persistently stores user-provided inputs (questions, failure reasons, improvements) under ~/.live-evo/experience_db.jsonl and writes weight history to ~/.live-evo/weight_history.jsonl. This is expected for a memory system but may capture sensitive data if you pass secrets or private content into the scripts.
Persistence & Privilege
always is false and model invocation is allowed (normal). The skill writes its own data under the user's home directory (~/.live-evo) which is a reasonable level of persistence for a memory skill, but it is persistent across agent runs and not encrypted or access-controlled by the skill. The skill does not modify other skills or system-wide settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install live-evo
  3. After installation, invoke the skill by name or use /live-evo
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.0
Initial release of live-evo: a self-evolving, task-aware memory system - Introduces an adaptive memory system that learns from verifiable tasks and user feedback. - Supports automatic retrieval of relevant past experiences and synthesizes them into actionable, task-specific guidelines. - Provides workflows for both contrastive (two-attempt) evaluation and direct application with feedback-driven learning. - Persists experience data and maintains a weight-based system for prioritizing high-quality lessons. - Includes command-line scripts for retrieval, updating, and managing experiences.
Metadata
Slug live-evo
Version 0.1.0
License
All-time Installs 1
Active Installs 1
Total Versions 1
Frequently Asked Questions

What is Live Evo: Online Evolution with verified experiences?

Self-evolving memory system that learns from verifiable tasks. Use when completing tasks where you can verify the outcome (coding, predictions, analysis). Au... It is an AI Agent Skill for Claude Code / OpenClaw, with 317 downloads so far.

How do I install Live Evo: Online Evolution with verified experiences?

Run "/install live-evo" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Live Evo: Online Evolution with verified experiences free?

Yes, Live Evo: Online Evolution with verified experiences is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Live Evo: Online Evolution with verified experiences support?

Live Evo: Online Evolution with verified experiences is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Live Evo: Online Evolution with verified experiences?

It is built and maintained by YaolunZhang (@mercury7353); the current version is v0.1.0.

💬 Comments