← Back to Skills Marketplace

Agent Safety

Name: Agent Safety
Author: compass-soul

by compass-soul · GitHub ↗ · v1.0.0

cross-platform ✓ Security Clean

782

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install agent-safety

Description

Outbound safety for autonomous AI agents — scans YOUR output before it leaves the machine. Git pre-commit hooks that automatically block commits containing A...

README (SKILL.md)

Agent Safety

Automated safety tools for autonomous AI agents. The principle: don't rely on prompts for safety — automate enforcement.

All scripts are in this skill's scripts/ directory. When OpenClaw loads this skill, resolve paths relative to this file's location.

Pre-Publish Security Scan

Scans files for secrets, PII, and internal paths before publishing.

bash scripts/pre-publish-scan.sh \x3Cfile-or-directory>

Detects:

API keys (AWS, GitHub, Anthropic, OpenAI, generic patterns)
Private keys (PEM blocks), Bearer tokens, hardcoded passwords
Email addresses, phone numbers, SSNs, credit card patterns
Physical addresses, name fields
Home directory paths, internal config paths

Exit 0 = clean. Exit 1 = blocking issues found, do not publish.

Git Pre-Commit Hook

Install once per repo. Automatically scans staged files on every commit:

bash scripts/install-hook.sh \x3Crepo-path>

Scans staged content (what's being committed, not working tree)
Blocks commit if secrets or SSNs found
Flags PII for review
Only bypassed with explicit git commit --no-verify

Install this on every repo you work with. It's the real guardrail.

Health Check

System monitoring for disk, workspace, security, and updates:

bash scripts/health-check.sh

Checks: Disk usage, workspace size, memory file growth, OpenClaw version, macOS updates, firewall status, SIP status.

Run periodically (every few heartbeats). Watch for warnings.

Rules

Run pre-publish scan before ANY external publish action
Install pre-commit hook on EVERY repo you work with
Blocking issues (secrets, SSNs) must be fixed — no override
Review items (emails, paths) need human judgment
If a secret was ever committed, it's compromised — rotate immediately

Usage Guidance

This skill appears to do what it says: inspect the included scripts before use, and be aware that install-hook.sh will create/overwrite a .git/hooks/pre-commit file in any repo you point it at (it is repo-local but will block commits until pass or you use --no-verify). The scanner prints filenames and match categories but does not exfiltrate file contents. Note health-check uses macOS-specific tools (softwareupdate, csrutil, firewall utility) and runs npm/openclaw queries that may contact the network for version checks. Review the regex rules in pre-publish-scan.sh to confirm they match your expectations (may produce false positives/negatives) and test the hook on a safe repo before installing broadly. If you previously committed secrets, follow the skill's guidance to rotate them — the scanner cannot undo prior exposure.

Capability Analysis

Type: OpenClaw Skill Name: agent-safety Version: 1.0.0 The OpenClaw AgentSkills skill bundle is designed for agent safety, providing tools to scan for secrets, PII, and malicious patterns before publishing or committing. The `SKILL.md` and `README.md` clearly state this defensive purpose, and do not contain any prompt injection attempts with malicious intent. The `health-check.sh` script performs legitimate system monitoring, including checking for software updates via external network calls (npm, softwareupdate), without exfiltrating sensitive data. The `install-hook.sh` script installs a git pre-commit hook that uses `pre-publish-scan.sh` to analyze staged content for security issues. The `pre-publish-scan.sh` script is a defensive tool that actively detects and blocks patterns indicative of data exfiltration (e.g., `webhook.site`, `ngrok.io`), reverse shells (`/dev/tcp/`), bulk environment variable harvesting, and sensitive file access (`/etc/passwd`, `~/.ssh`), rather than performing these actions itself. All observed behaviors align with the stated purpose of enhancing security and preventing accidental data leaks.

Capability Assessment

✓ Purpose & Capability

Name/description claim outbound scanning and git-level enforcement; included scripts (pre-publish-scan.sh, install-hook.sh, health-check.sh) implement exactly that. No unrelated credentials, binaries, or install artifacts are requested. Reading OpenClaw workspace files and checking system state is consistent with the described health checks.

✓ Instruction Scope

SKILL.md instructs running the provided scripts and installing the pre-commit hook. The scripts operate on staged files or the specified workspace and do not send scanned content to external endpoints. Health check runs local system queries (openclaw --version, npm view, softwareupdate, csrutil) — these are expected for version/update checks but are macOS-specific and may fail on other OSes.

✓ Install Mechanism

No network download or extract install mechanism; this is an instruction-only skill with included scripts. install-hook.sh writes a repo-local .git/hooks/pre-commit file (intended behavior). There are no remote fetches of arbitrary code during install.

✓ Credentials

The skill declares no required env vars or credentials. Scripts read files under $HOME/.openclaw/workspace and run local system commands; that access is coherent with the purpose of scanning workspace context and health-checking the system.

✓ Persistence & Privilege

always:false and user-invocable: true. The only persistent change performed by the provided scripts is installation of a repo-local git pre-commit hook. The skill does not request global/system privileges or modify other skills' configs.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install agent-safety
After installation, invoke the skill by name or use /agent-safety
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release of agent-safety: automated outbound safety tools for autonomous AI agents. - Scans output for secrets, tokens, PII, and internal paths before files leave your machine. - Includes a git pre-commit hook to block commits with API keys, passwords, or sensitive info. - Provides a pre-publish security scanner for files and directories. - Adds a health-check script for monitoring system and workspace security. - Designed for automated enforcement at the git level; does not rely on prompts.

Metadata

Slug agent-safety

Version 1.0.0

License —

All-time Installs 1

Active Installs 1

Total Versions 1

Frequently Asked Questions

What is Agent Safety?

Outbound safety for autonomous AI agents — scans YOUR output before it leaves the machine. Git pre-commit hooks that automatically block commits containing A... It is an AI Agent Skill for Claude Code / OpenClaw, with 782 downloads so far.

How do I install Agent Safety?

Run "/install agent-safety" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Agent Safety free?

Yes, Agent Safety is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Agent Safety support?

Agent Safety is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Agent Safety?

It is built and maintained by compass-soul (@compass-soul); the current version is v1.0.0.

More Skills

Agent Safety

Agent Safety

Pre-Publish Security Scan

Git Pre-Commit Hook

Health Check

Rules

What is Agent Safety?

How do I install Agent Safety?

Is Agent Safety free?

Which platforms does Agent Safety support?

Who created Agent Safety?

💬 Comments