← Back to Skills Marketplace
compass-soul

Agent Safety

by compass-soul · GitHub ↗ · v1.0.0
cross-platform ✓ Security Clean
782
Downloads
0
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install agent-safety
Description
Outbound safety for autonomous AI agents — scans YOUR output before it leaves the machine. Git pre-commit hooks that automatically block commits containing A...
README (SKILL.md)

Agent Safety

Automated safety tools for autonomous AI agents. The principle: don't rely on prompts for safety — automate enforcement.

All scripts are in this skill's scripts/ directory. When OpenClaw loads this skill, resolve paths relative to this file's location.

Pre-Publish Security Scan

Scans files for secrets, PII, and internal paths before publishing.

bash scripts/pre-publish-scan.sh \x3Cfile-or-directory>

Detects:

  • API keys (AWS, GitHub, Anthropic, OpenAI, generic patterns)
  • Private keys (PEM blocks), Bearer tokens, hardcoded passwords
  • Email addresses, phone numbers, SSNs, credit card patterns
  • Physical addresses, name fields
  • Home directory paths, internal config paths

Exit 0 = clean. Exit 1 = blocking issues found, do not publish.

Git Pre-Commit Hook

Install once per repo. Automatically scans staged files on every commit:

bash scripts/install-hook.sh \x3Crepo-path>
  • Scans staged content (what's being committed, not working tree)
  • Blocks commit if secrets or SSNs found
  • Flags PII for review
  • Only bypassed with explicit git commit --no-verify

Install this on every repo you work with. It's the real guardrail.

Health Check

System monitoring for disk, workspace, security, and updates:

bash scripts/health-check.sh

Checks: Disk usage, workspace size, memory file growth, OpenClaw version, macOS updates, firewall status, SIP status.

Run periodically (every few heartbeats). Watch for warnings.

Rules

  1. Run pre-publish scan before ANY external publish action
  2. Install pre-commit hook on EVERY repo you work with
  3. Blocking issues (secrets, SSNs) must be fixed — no override
  4. Review items (emails, paths) need human judgment
  5. If a secret was ever committed, it's compromised — rotate immediately
Usage Guidance
This skill appears to do what it says: inspect the included scripts before use, and be aware that install-hook.sh will create/overwrite a .git/hooks/pre-commit file in any repo you point it at (it is repo-local but will block commits until pass or you use --no-verify). The scanner prints filenames and match categories but does not exfiltrate file contents. Note health-check uses macOS-specific tools (softwareupdate, csrutil, firewall utility) and runs npm/openclaw queries that may contact the network for version checks. Review the regex rules in pre-publish-scan.sh to confirm they match your expectations (may produce false positives/negatives) and test the hook on a safe repo before installing broadly. If you previously committed secrets, follow the skill's guidance to rotate them — the scanner cannot undo prior exposure.
Capability Analysis
Type: OpenClaw Skill Name: agent-safety Version: 1.0.0 The OpenClaw AgentSkills skill bundle is designed for agent safety, providing tools to scan for secrets, PII, and malicious patterns before publishing or committing. The `SKILL.md` and `README.md` clearly state this defensive purpose, and do not contain any prompt injection attempts with malicious intent. The `health-check.sh` script performs legitimate system monitoring, including checking for software updates via external network calls (npm, softwareupdate), without exfiltrating sensitive data. The `install-hook.sh` script installs a git pre-commit hook that uses `pre-publish-scan.sh` to analyze staged content for security issues. The `pre-publish-scan.sh` script is a defensive tool that actively detects and blocks patterns indicative of data exfiltration (e.g., `webhook.site`, `ngrok.io`), reverse shells (`/dev/tcp/`), bulk environment variable harvesting, and sensitive file access (`/etc/passwd`, `~/.ssh`), rather than performing these actions itself. All observed behaviors align with the stated purpose of enhancing security and preventing accidental data leaks.
Capability Assessment
Purpose & Capability
Name/description claim outbound scanning and git-level enforcement; included scripts (pre-publish-scan.sh, install-hook.sh, health-check.sh) implement exactly that. No unrelated credentials, binaries, or install artifacts are requested. Reading OpenClaw workspace files and checking system state is consistent with the described health checks.
Instruction Scope
SKILL.md instructs running the provided scripts and installing the pre-commit hook. The scripts operate on staged files or the specified workspace and do not send scanned content to external endpoints. Health check runs local system queries (openclaw --version, npm view, softwareupdate, csrutil) — these are expected for version/update checks but are macOS-specific and may fail on other OSes.
Install Mechanism
No network download or extract install mechanism; this is an instruction-only skill with included scripts. install-hook.sh writes a repo-local .git/hooks/pre-commit file (intended behavior). There are no remote fetches of arbitrary code during install.
Credentials
The skill declares no required env vars or credentials. Scripts read files under $HOME/.openclaw/workspace and run local system commands; that access is coherent with the purpose of scanning workspace context and health-checking the system.
Persistence & Privilege
always:false and user-invocable: true. The only persistent change performed by the provided scripts is installation of a repo-local git pre-commit hook. The skill does not request global/system privileges or modify other skills' configs.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install agent-safety
  3. After installation, invoke the skill by name or use /agent-safety
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of agent-safety: automated outbound safety tools for autonomous AI agents. - Scans output for secrets, tokens, PII, and internal paths before files leave your machine. - Includes a git pre-commit hook to block commits with API keys, passwords, or sensitive info. - Provides a pre-publish security scanner for files and directories. - Adds a health-check script for monitoring system and workspace security. - Designed for automated enforcement at the git level; does not rely on prompts.
Metadata
Slug agent-safety
Version 1.0.0
License
All-time Installs 1
Active Installs 1
Total Versions 1
Frequently Asked Questions

What is Agent Safety?

Outbound safety for autonomous AI agents — scans YOUR output before it leaves the machine. Git pre-commit hooks that automatically block commits containing A... It is an AI Agent Skill for Claude Code / OpenClaw, with 782 downloads so far.

How do I install Agent Safety?

Run "/install agent-safety" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Agent Safety free?

Yes, Agent Safety is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Agent Safety support?

Agent Safety is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Agent Safety?

It is built and maintained by compass-soul (@compass-soul); the current version is v1.0.0.

💬 Comments