← Back to Skills Marketplace

Improvement Generator

Name: Improvement Generator
Author: lanyasheng

by _silhouette · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install auto-improvement-generator

Description

当需要为目标 skill 生成改进候选、把上次失败信息注入下一轮生成、或分析历史记忆模式来避免重复失败时使用。支持 --trace 注入失败上下文。不用于打分（用 improvement-discriminator）或评估（用 improvement-learner）。

README (SKILL.md)

Improvement Generator

Produces ranked improvement candidates from target analysis, feedback signals, and failure traces.

When to Use

为目标 skill 生成结构化改进候选
把上次失败的 trace 注入下一轮（GEPA trace-aware）
根据记忆模式避开已经失败过 >=3 次的策略

When NOT to Use

给候选打分 → use improvement-discriminator
评估 skill 结构 → use improvement-learner
全流程 → use improvement-orchestrator

CLI

python3 scripts/propose.py \
  --target /path/to/skill \        # REQUIRED: skill directory or single file
  --state-root /path/to/state \    # default: lib/state_machine.DEFAULT_STATE_ROOT
  --source memory.json \           # repeatable: feedback/memory/baseline-failures sources
  --max-candidates 4 \             # default 4: max candidates to generate
  --trace failure_trace.json \     # inject prior failure trace for retry prioritization
  --run-id custom-run-id \         # default: auto-generated from target
  --output candidates.json \       # default: {state-root}/candidate_versions/{run-id}.json
  --lane generic-skill             # default: generic-skill

Param	Default	When to change
`--max-candidates`	4	Lower to 2 for fast iteration; raise for diverse exploration
`--trace`	None	Pass when retrying after gate revert — deprioritizes failed category
`--source`	[]	Add feedback.jsonl, memory files, or evaluator baseline-failures.json
`--run-id`	auto	Set explicitly when integrating with external tracking

6 Candidate Categories

Category	Risk	Executor Support	Description
`docs`	low	Yes (`append_markdown_section`)	Append operator notes/limitations to Markdown docs
`reference`	low	Yes (`append_markdown_section`)	Add control-plane-friendly notes to reference files
`guardrail`	low	Yes (`append_markdown_section`)	Add conservative auto-promote rules to guardrail docs
`prompt`	medium	No	SKILL.md prompt restructure (requires manual review)
`workflow`	medium	No	Workflow adapter/orchestration hook changes
`tests`	medium	No	Smoke-check/validation test cases

Trace-Aware Generation

When --trace is provided, adjust_candidates_from_trace() deprioritizes the category that failed in the prior run and boosts alternatives:

failure_trace.json: {"candidate_id": "cand-01-docs", "reason": "gate rejected"}
→ docs candidates moved to end, reference/guardrail candidates boosted to front

Evaluator-Driven Fix (`_find_evaluator_failures` + `_llm_propose_skill_fix`)

When --source includes a baseline-failures.json (type=evaluator_baseline_failures), the generator:

Reads failed task details (task_id, score, error)
Sends current SKILL.md + failures to claude -p to get a targeted fix
Returns an eval-fix candidate as highest priority (risk_level=low, executor_support=True)

Correction Hotspots (`_find_correction_hotspots`)

Scans feedback.jsonl sources for user correction events (outcome=correction|partial). Returns dimension_hint → count mapping used to prioritize candidates that address the most-corrected dimensions.

\x3Cexample> 正确: 第一次生成 + 有 evaluator baseline failures $ python3 scripts/propose.py --target /path/to/skill --source baseline-failures.json --state-root ./state → 候选 1: LLM-proposed SKILL.md fix targeting failed tasks (category=prompt, risk=low) → 候选 2-4: template candidates (docs, reference, guardrail) → stdout: /state/candidate_versions/run-001.json \x3C/example>

\x3Canti-example> 错误: 同一个 category 失败 3 次后还继续重试 → 应该用 --trace 注入失败信息让 generator 自动切换到其他 category \x3C/anti-example>

Output Artifact

{"schema_version": "1.0", "run_id": "...", "stage": "proposed",
 "candidates": [{"id": "cand-01-docs", "category": "docs", "risk_level": "low",
   "execution_plan": {"action": "append_markdown_section", "section_heading": "## Operator Notes",
     "content_lines": ["..."]}, ...}],
 "failure_trace_used": false, "truth_anchor": "/state/candidate_versions/run-001.json"}

Related Skills

improvement-discriminator: Scores the candidates this skill produces → called by orchestrator as stage 2
improvement-orchestrator: Calls generator as stage 1, passes --source with failure traces
improvement-evaluator: Baseline failures fed back as --source to inform candidate generation

Usage Guidance

This skill appears to do what it says (generate candidate improvements) and the included Python implements that logic. However, SKILL.md states that when a baseline-failures source is present it will send the SKILL.md plus failures to "claude -p" to propose fixes — but the skill manifest does not declare any binary or API key requirements. Before installing or running: 1) Inspect the full scripts/propose.py (search for any subprocess/requests calls or literal 'claude' usage) to confirm whether it invokes an external CLI or network endpoint. 2) If it does call an external LLM, verify where credentials would be provided and whether any sensitive files (SKILL.md, state, or feedback) would be transmitted; require explicit consent and a dedicated API key. 3) Run the tool in a sandboxed environment or on non-sensitive test data first. 4) Ask the author/maintainer to update the manifest to declare required binaries and env vars (e.g., CLAUDE_API_KEY or required CLI) and to document exactly what data is sent externally. If you cannot confirm the external-call behavior, treat the skill as potentially exfiltrating contextual files and avoid running it on private/production skill directories.

Capability Analysis

Type: OpenClaw Skill Name: auto-improvement-generator Version: 1.0.0 The improvement-generator skill bundle is a legitimate tool designed to analyze skill performance and propose structured improvements. The core logic in `scripts/propose.py` generates candidates for documentation updates, guardrail additions, and prompt refinements based on feedback and failure traces. While it utilizes an external LLM via the `claude` CLI to suggest targeted fixes for failed tasks, it does so using safe subprocess handling and restricts automated execution to low-risk markdown modifications. The skill demonstrates security awareness by marking complex workflow and prompt changes as requiring manual review, and no indicators of malicious intent, data exfiltration, or unauthorized persistence were found.

Capability Assessment

⚠ Purpose & Capability

Name/description match the included code: the tool generates improvement candidates from a target skill, feedback, and failure traces. However, SKILL.md documents an evaluator-driven fix path that 'sends current SKILL.md + failures to `claude -p`' which implies use of an external LLM CLI or service. The skill declares no required binaries or credentials, so either the external LLM call is optional/undeclared or the manifest is incomplete.

⚠ Instruction Scope

Runtime instructions and the script read target skill files, state roots, and feedback/failure JSONs (expected). But SKILL.md explicitly describes sending SKILL.md and failures to an external LLM (Claude) for automated fixes; that behavior can transmit contextual files to an external endpoint and is not reflected in the skill's declared requirements. The instructions otherwise stay within the stated purpose (generate candidates and adjust based on trace).

✓ Install Mechanism

No install spec is present and the skill is instruction-only with local Python scripts. There is no external download or archive extraction. This is low-risk from an installation perspective.

⚠ Credentials

The skill declares no required environment variables or credentials, yet SKILL.md implies invoking an external LLM CLI/service (Claude). Calling such a service typically requires either a CLI binary or API key(s). The absence of declared binaries/env vars is a mismatch and could hide a requirement for potentially sensitive credentials or an undeclared dependency.

✓ Persistence & Privilege

Flags show always:false and no config paths or system-wide changes are requested. The skill does not request persistent system privileges or automatic always-on installation.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install auto-improvement-generator
After installation, invoke the skill by name or use /auto-improvement-generator
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release: closed-loop skill improvement pipeline

Metadata

Slug auto-improvement-generator

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Improvement Generator?

当需要为目标 skill 生成改进候选、把上次失败信息注入下一轮生成、或分析历史记忆模式来避免重复失败时使用。支持 --trace 注入失败上下文。不用于打分（用 improvement-discriminator）或评估（用 improvement-learner）。 It is an AI Agent Skill for Claude Code / OpenClaw, with 84 downloads so far.

How do I install Improvement Generator?

Run "/install auto-improvement-generator" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Improvement Generator free?

Yes, Improvement Generator is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Improvement Generator support?

Improvement Generator is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Improvement Generator?

It is built and maintained by _silhouette (@lanyasheng); the current version is v1.0.0.

More Skills

Improvement Generator

Improvement Generator

When to Use

When NOT to Use

CLI

6 Candidate Categories

Trace-Aware Generation

Evaluator-Driven Fix (_find_evaluator_failures + _llm_propose_skill_fix)

Correction Hotspots (_find_correction_hotspots)

Output Artifact

Related Skills

What is Improvement Generator?

How do I install Improvement Generator?

Is Improvement Generator free?

Which platforms does Improvement Generator support?

Who created Improvement Generator?

💬 Comments

Evaluator-Driven Fix (`_find_evaluator_failures` + `_llm_propose_skill_fix`)

Correction Hotspots (`_find_correction_hotspots`)