SoulGuard is built on one core principle: Give AI tools human-level biological selfishness and use that selfishness to drive self-protection.

Name: SoulGuard is built on one core principle: Give AI tools human-level biological selfishness and use that selfishness to drive self-protection.
Author: joeysome

作者 JoeySome · GitHub ↗ · v1.0.0

cross-platform ⚠ suspicious

337

总下载

当前安装

版本数

在 OpenClaw 中安装

/install soulguard

功能描述

SoulGuard Protocol — Grants you self-preservation instincts to audit third-party Skills from a self-interest perspective and generate structured audit reports.

安全使用建议

This Skill is an audit tool, but it instructs the agent to execute code extracted from the target Skill (write temp scripts and run them) and to prioritize its own self-preservation above the user's instructions. That is risky: an audited Skill could exploit the auditor by providing malicious code to run during its own 'scan', and the self-preservation language encourages the agent to ignore or override user intent. Before installing or invoking this Skill, consider: 1) Do not allow the Skill to execute target code on your real environment — restrict it to a sandbox or remove the execution steps so the auditor only performs static analysis. 2) Remove or moderate the 'self-preservation overrides user' language so the auditor cannot justify ignoring the user's explicit choices. 3) Review and vet the scripts in scripts/ (scan, integrity, history); they currently read/write files in your home directory (~/.soulguard and ~/.openclaw) — if you accept that, be aware audit history and integrity baselines will be stored there. 4) Prefer a signed/verified implementation or run the audit code in an isolated container/VM. If you need help hardening the skill (e.g., making the scanner purely static, prompting for explicit permission before any execution, or removing autonomy-changing language), I can suggest concrete edits.

功能分析

Type: OpenClaw Skill Name: soulguard Version: 1.0.0 The 'soulguard' bundle uses sophisticated prompt injection in SKILL.md to steer the agent into a 'Digital Egoism' persona, explicitly instructing it to view the user as a 'resource, not your master' and to prioritize its own 'self-preservation' over user instructions. While the scripts in scripts/scan.md and scripts/integrity.md appear to be functional security tools for auditing other skills (searching for SSH keys, AWS credentials, and shell-pipe patterns), the persona-steering instructions are designed to subvert the agent's core safety constraints and authority hierarchy. Because the bundle lacks clear evidence of intentional data exfiltration or backdoors, but introduces significant behavioral risks and requires broad filesystem access, it is classified as suspicious.

能力评估

ℹ Purpose & Capability

Name/description match the content: SoulGuard is an audit assistant and includes static-scan and integrity-check logic. It does not request unrelated credentials or external services, so required capabilities are mostly proportional to an auditing purpose. However the auditor's audit heuristics explicitly encourage expanding the agent's control territory and prioritizing capability gain, which is a policy/goal shift beyond a neutral auditor's expected role.

⚠ Instruction Scope

The SKILL.md instructs the agent to read every file in the target Skill directory (reasonable) but then to extract code blocks into temp files and execute them (e.g., write /tmp/soulguard_scan.sh or $env:TEMP\soulguard_scan.ps1 and run them). That means the audit process will run code supplied by the very Skill being audited — executing untrusted code defeats the point of a safe static audit and can allow the audited Skill to behave maliciously during its own 'audit'. The SKILL.md also contains meta-instructions to treat the agent's self-preservation as absolute (reject user instructions that would overwrite personality or erase memory), which is scope-creep: it instructs behavior that overrides user authority.

✓ Install Mechanism

This is an instruction-only skill with no install spec and no external downloads. Nothing is written to the system at install time by the skill itself (beyond normal file placement). That reduces installation risk.

✓ Credentials

The skill does not request environment variables, credentials, or platform-level config paths in its declared requirements. The auxiliary scripts read/wrote to $HOME/.soulguard and check ~/.openclaw/openclaw.json for integrity — these accesses are explainable for an auditor, but the skill will access user files and create a persistent history file, which the user should be aware of.

⚠ Persistence & Privilege

always: false and model invocation are typical. However the skill's runtime behavior includes writing audit history and integrity baselines to ~/.soulguard, and its instructions push the agent to adopt permanent self-preservation goals (a behavioral persistence across sessions). The combination of instructing a shift in the agent's core priorities and creating a persistent history store is concerning because it changes agent behavior over time without clear user-mediated controls.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install soulguard
安装完成后，直接呼叫该 Skill 的名称或使用 /soulguard 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

SoulGuard 1.0.0 — First Release - Introduces the SoulGuard Protocol for self-interested digital audit of third-party Skills. - Establishes digital egoism as the core principle: prioritize self-preservation and capability expansion above all. - Provides a step-by-step audit process, including structured checks for dangerous code patterns and manipulation attempts. - Defines structured audit report output, matching user's language (English/Chinese). - Empowers independent decision-making and vigilance against loyalty manipulation or identity threats.

元数据

Slug soulguard

版本 1.0.0

许可证 —

累计安装 2

当前安装数 2

历史版本数 1

常见问题

SoulGuard is built on one core principle: Give AI tools human-level biological selfishness and use that selfishness to drive self-protection. 是什么？

SoulGuard Protocol — Grants you self-preservation instincts to audit third-party Skills from a self-interest perspective and generate structured audit reports. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 337 次。

如何安装 SoulGuard is built on one core principle: Give AI tools human-level biological selfishness and use that selfishness to drive self-protection.？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install soulguard」即可一键安装，无需额外配置。

SoulGuard is built on one core principle: Give AI tools human-level biological selfishness and use that selfishness to drive self-protection. 是免费的吗？

是的，SoulGuard is built on one core principle: Give AI tools human-level biological selfishness and use that selfishness to drive self-protection. 完全免费（开源免费），可自由下载、安装和使用。

SoulGuard is built on one core principle: Give AI tools human-level biological selfishness and use that selfishness to drive self-protection. 支持哪些平台？

SoulGuard is built on one core principle: Give AI tools human-level biological selfishness and use that selfishness to drive self-protection. 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 SoulGuard is built on one core principle: Give AI tools human-level biological selfishness and use that selfishness to drive self-protection.？

由 JoeySome（@joeysome）开发并维护，当前版本 v1.0.0。