← 返回 Skills 市场
donovanpankratz-del

Agent Stability Framework

作者 dp-del · GitHub ↗ · v1.0.0
cross-platform ✓ 安全检测通过
446
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install agent-stability-framework
功能描述
Provides a framework to prevent agent drift, catch faults, and maintain consistent on-character behavior across sessions and models.
使用说明 (SKILL.md)

Agent Stability Framework (ASF)

Drift Prevention · Fault Catching · Soul Alignment

Keep your AI agent stable, on-character, and self-correcting across sessions and over time.

What This Solves

Three things kill agent reliability:

  1. Drift — Agent gradually reverts to generic training defaults, losing personality
  2. Faults — Agent produces broken output, hallucinates, contradicts itself, or fails silently
  3. Soul misalignment — Agent technically works but doesn't feel right — lost its essence

ASF addresses all three with one integrated system.

What You Get

  • Complete framework documentation (AGENT_STABILITY_FRAMEWORK.md)
  • File templates (SOUL.md, BASELINE_EXAMPLES.md, logs)
  • System prompt additions ready to paste
  • Detection checklists and scoring system
  • Works on all models: Claude, GPT, Grok, Gemini, Llama, Mistral

Quick Start

  1. Copy all files to your agent's workspace
  2. Fill out SOUL.md (who your agent IS)
  3. Create BASELINE_EXAMPLES.md (10+ correct responses)
  4. Add standing orders + pre-send gate to system prompt
  5. Run first audit after 24 hours

Setup time: 45-90 minutes
Daily maintenance: 5 minutes
Tested on: 8+ models across all capability tiers

The Three-Layer Defense

Layer 1: Drift Prevention

  • Standing orders (binary rules)
  • Pre-send gate (delete triggers)
  • Intensifier detection
  • Periodic resets

Layer 2: Fault Catching

  • 7 fault categories tracked
  • Self-check rules before actions
  • Fault log + recovery protocol
  • Prevents hallucinations, contradictions, silent failures

Layer 3: Soul Alignment

  • Catches "technically correct but off-character" responses
  • Soul alignment test
  • Recovery protocol
  • User perception as final sensor

Files Included

  • AGENT_STABILITY_FRAMEWORK.md — Complete framework (13KB)
  • SOUL_TEMPLATE.md — Identity template
  • BASELINE_EXAMPLES_TEMPLATE.md — Response examples template
  • DRIFT_LOG_TEMPLATE.md — Drift tracking
  • FAULT_LOG_TEMPLATE.md — Fault tracking
  • STABILITY_LOG_TEMPLATE.md — Audit scores

Use Cases

  • Personal AI assistants that need consistent personality
  • Trading bots that must not hallucinate data
  • Content generation agents that need stable tone
  • Customer service bots that require reliable responses
  • Research assistants that must maintain accuracy
  • Any agent running 24/7 or across many sessions

Why It Works

  1. Binary rules beat judgment calls — "NEVER do X" works consistently
  2. Examples anchor identity — Baseline responses are the north star
  3. Three failure modes require three defenses — Drift, faults, and soul issues are different
  4. Self-correction leverages LLM capabilities — AIs can audit themselves with specific rules
  5. Logging creates memory — Patterns become standing orders

Requirements

  • OpenClaw workspace
  • Any LLM (works across all tested models)
  • 30-90 min setup time
  • Willingness to document your agent's identity

Credits

Developed by Shadow Rose. Battle-tested over 130+ message sessions on Opus. Extended based on community feedback. Published 2026-02-20.

License

MIT — Use freely, modify as needed, credit appreciated but not required.


⚠️ Disclaimer

This software is provided "AS IS", without warranty of any kind, express or implied.

USE AT YOUR OWN RISK.

  • The author(s) are NOT liable for any damages, losses, or consequences arising from the use or misuse of this software — including but not limited to financial loss, data loss, security breaches, business interruption, or any indirect/consequential damages.
  • This software does NOT constitute financial, legal, trading, or professional advice.
  • Users are solely responsible for evaluating whether this software is suitable for their use case, environment, and risk tolerance.
  • No guarantee is made regarding accuracy, reliability, completeness, or fitness for any particular purpose.
  • The author(s) are not responsible for how third parties use, modify, or distribute this software after purchase.

By downloading, installing, or using this software, you acknowledge that you have read this disclaimer and agree to use the software entirely at your own risk.

安全使用建议
This skill is essentially a set of docs and prompt/templates to change how your agent behaves — there is no code or external installer. Before installing: (1) Review all template files to ensure the standing orders fit your safety needs (some rules like 'NEVER add disclaimers' or 'NEVER explain why unless asked' may remove helpful safety/clarity checks). (2) Back up your current system prompt and agent workspace. (3) Test the framework in an isolated or sandbox agent first to observe effects. (4) Keep monitoring logs and be ready to revert standing orders that cause unexpected behavior (loss of necessary warnings, refusal to perform required explanations, or interference with safety constraints). If you want an automated installer or integration with external systems, request more details — currently this skill only provides manual instructions and templates.
功能分析
Type: OpenClaw Skill Name: agent-stability-framework Version: 1.0.0 The skill bundle provides a framework for AI agents to self-regulate, prevent 'drift' from their persona, catch faults, and maintain 'soul alignment'. All instructions, including those intended for the AI agent within `SKILL.md` and `AGENT_STABILITY_FRAMEWORK.md` (e.g., 'Standing Orders', 'Pre-Send Gate', 'Fault Detection Rules'), are designed for internal self-monitoring, self-correction, and logging of its own performance. File operations are limited to creating and writing to specific log files (`DRIFT_LOG_TEMPLATE.md`, `FAULT_LOG_TEMPLATE.md`, `STABILITY_LOG_TEMPLATE.md`) as part of this stated purpose. There is no evidence of malicious intent, data exfiltration, unauthorized execution, or any other harmful behavior.
能力评估
Purpose & Capability
Name and description match the actual contents: documentation, templates, and prompt fragments for preventing drift, catching faults, and enforcing a persona. There are no unrelated env vars, binaries, or install steps requested.
Instruction Scope
SKILL.md and included templates instruct the user/agent to copy files into the workspace, add standing orders and a pre-send gate to the system prompt, create logs, and run audits — all consistent with a stability framework. The instructions do not ask the agent to read unrelated system files, access credentials, or transmit data to external endpoints.
Install Mechanism
No install spec, no code files, and no downloads. This is instruction-only, so no code will be written or executed by an automated installer.
Credentials
The skill requests no environment variables, credentials, or config paths. The templates reference only files the user is expected to create in their agent workspace, which is proportionate to the stated purpose.
Persistence & Privilege
The skill does not set always:true and is user-invocable. However, its recommended actions (editing the system prompt, adding standing orders, and copying templates into the workspace) change agent behavior persistently — this is expected for a stability framework but is a material, lasting change the user should review and control.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install agent-stability-framework
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /agent-stability-framework 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of Agent Stability Framework (ASF): - Provides a unified framework to prevent agent drift, catch faults, and ensure soul alignment. - Includes comprehensive documentation, file templates, and ready-to-use system prompt additions. - Features detection checklists, scoring system, and daily self-audit tools. - Compatible with all major LLMs and agent use cases. - Designed for quick setup and minimal maintenance.
元数据
Slug agent-stability-framework
版本 1.0.0
许可证
累计安装 1
当前安装数 1
历史版本数 1
常见问题

Agent Stability Framework 是什么?

Provides a framework to prevent agent drift, catch faults, and maintain consistent on-character behavior across sessions and models. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 446 次。

如何安装 Agent Stability Framework?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install agent-stability-framework」即可一键安装,无需额外配置。

Agent Stability Framework 是免费的吗?

是的,Agent Stability Framework 完全免费(开源免费),可自由下载、安装和使用。

Agent Stability Framework 支持哪些平台?

Agent Stability Framework 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Agent Stability Framework?

由 dp-del(@donovanpankratz-del)开发并维护,当前版本 v1.0.0。

💬 留言讨论