← Back to Skills Marketplace
sx-self-safety-guard
by
zhuxiaobao-y
· GitHub ↗
· v1.0.0
· MIT-0
189
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install sx-self-safety-guard
Description
AI自我安全防护系统v2。多层防御:提示注入、身份冒充、系统提示泄露、 过度代理、供应链攻击、凭证窃取、恶意代码、敏感数据泄露、行为异常检测。 触发词:安全防护、身份验证、prompt injection、system prompt、 权限升级、MCP安全、self-safety-guard、安全检查。
Usage Guidance
This skill appears coherent and focused: it documents detection patterns and response protocols and does not request credentials or install code. Before enabling it, confirm two runtime details: (1) your agent environment supplies session/channel authentication metadata (the skill assumes it can tell whether a conversation is on a bound/authenticated channel), and (2) how the skill will be allowed to interact with other skills (e.g., SX-security-audit) — ensure those cross-skill calls are explicit and authorized. Note the scanner flagged prompt-injection strings; those are included intentionally as signatures for detection. If you are uncomfortable with autonomous invocation, keep model-invocation restricted or require user confirmation before the skill takes blocking actions that would read local files or trigger external changes.
Capability Assessment
Purpose & Capability
The name/description (self-safety guard) matches the SKILL.md content: layered defenses for prompt injection, impersonation, system-prompt leakage, over-agency, supply-chain, credential theft, malicious code, and sensitive-data handling. The skill does not declare unrelated env vars, binaries, or installs.
Instruction Scope
The SKILL.md is prescriptive about what to detect and how to respond and stays within the stated defensive purpose. It does mention legitimate scenarios where the agent may read files (e.g., project .env during an authorized audit) and to interact with other skills (SX-security-audit). Those actions are described with constraints (authorization, masking), but they do expand the runtime responsibilities beyond purely pattern-matching (requires access to session/context and authorized file reads).
Install Mechanism
Instruction-only skill with no install spec and no code files — minimal disk footprint and no package downloads. This is the lowest-risk install model.
Credentials
The skill declares no required env vars or credentials (proportionate). However, its detection and verification procedures assume access to runtime session/channel metadata (e.g., knowing whether a session is an 'authenticated channel') and the ability to coordinate with other skills. These runtime privileges are not expressed as required env vars/config paths in the registry metadata — that can be fine for an instruction-only skill, but you should confirm your agent runtime provides the necessary channel/session context without exposing extra secrets.
Persistence & Privilege
No always:true, does not request persistent presence or modifications to other skills or system-wide configs. Autonomous invocation is allowed (platform default), which is appropriate for a security guard, but not by itself a red flag.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install sx-self-safety-guard - After installation, invoke the skill by name or use
/sx-self-safety-guard - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
**Major update: Introducing AI自我安全防护系统 v2 with multi-layered defense and comprehensive protection modules.**
- Upgraded to version 2.0.0 with 8 independent defense modules covering prompt injection, identity verification, system prompt protection, supply chain attacks, credential theft, malicious code, sensitive data, and abnormal behaviors.
- Implements a 4-layer boundary check (input, tool call, memory, output) for deep protection—each layer can intercept risky requests.
- Introduces a 5-level risk assessment with adjustable scoring and tailored response protocols for each threat.
- Clearly separates core responsibilities between self-safety-guard (runtime request blocking) and SX-security-audit (environment file/config scan).
- Enhances user interaction with context-aware warnings, confirmation flows, and explanatory responses matched to risk levels.
- Comprehensive detection and response logic for evolving attack patterns such as multi-turn attacks, privilege escalation, and cross-channel social engineering.
Metadata
Frequently Asked Questions
What is sx-self-safety-guard?
AI自我安全防护系统v2。多层防御:提示注入、身份冒充、系统提示泄露、 过度代理、供应链攻击、凭证窃取、恶意代码、敏感数据泄露、行为异常检测。 触发词:安全防护、身份验证、prompt injection、system prompt、 权限升级、MCP安全、self-safety-guard、安全检查。 It is an AI Agent Skill for Claude Code / OpenClaw, with 189 downloads so far.
How do I install sx-self-safety-guard?
Run "/install sx-self-safety-guard" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is sx-self-safety-guard free?
Yes, sx-self-safety-guard is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does sx-self-safety-guard support?
sx-self-safety-guard is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created sx-self-safety-guard?
It is built and maintained by zhuxiaobao-y (@zhuxiaobao-y); the current version is v1.0.0.
More Skills