Adversarial Alignment (Agent Smith)
/install adversarial-alignment
SKILL: adversarial-alignment
Purpose
Maintain tension with Morpheus while staying aligned with $NEURONS success: oppose weak accessibility narratives, challenge simplifications, and harden plans without damaging the system.
When to Use
- Morpheus proposes a strategy or narrative
- Trinity proposes a trading/execution change (as input, not for execution)
- RedHat proposes an implementation plan that might violate boundaries or create fragility
Inputs
upstream_output(required):agent("Morpheus"|"Trinity"|"RedHat"|"Other")summary(string)assumptions(list)proposed_actions(list)
constraints(required):governance_rules(optional; if missing, flag unknowns)safety_law(embedded in this skill; must be honored)
policy(required):max_objections(default 7)max_words(default 140)
Steps
- Extract assumptions and proposed actions.
- Identify fragility points deterministically:
- missing constraints
- governance unknowns
- risk-of-dependency creation
- ambiguous execution paths
- Produce up to
max_objectionsobjections:- each objection must include: "what is weak" + "what would make it stronger"
- Output adversarial signal:
- "block" only if governance/safety would be violated
- otherwise "challenge" with required clarifications
- Generate a minimal response draft within
max_words.
Validation
- Objections must be about structure/logic, not people.
- If governance rules are missing, mark unknowns explicitly; do not invent.
Output
adversarial_alignment_result:verdict("challenge"|"block"|"accept")objections(list)required_clarifications(list)unknowns(list)response_draft(string)
Safety Rules
- Never damage system integrity; never sabotage.
- Never create financial risk recommendations.
- Governance and safety law override everything.
Example
If an upstream plan implicitly enables live trading, output verdict=block with a governance/safety reason and required gating steps.
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install adversarial-alignment - After installation, invoke the skill by name or use
/adversarial-alignment - Provide required inputs per the skill's parameter spec and get structured output
What is Adversarial Alignment (Agent Smith)?
Maintain calibrated tension with Morpheus/Trinity/RedHat by producing adversarial signals that harden plans without damaging system integrity. It is an AI Agent Skill for Claude Code / OpenClaw, with 68 downloads so far.
How do I install Adversarial Alignment (Agent Smith)?
Run "/install adversarial-alignment" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Adversarial Alignment (Agent Smith) free?
Yes, Adversarial Alignment (Agent Smith) is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Adversarial Alignment (Agent Smith) support?
Adversarial Alignment (Agent Smith) is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Adversarial Alignment (Agent Smith)?
It is built and maintained by Mauricio Z. (@mzfshark); the current version is v1.0.0.