/install sentinel-proxy
Sentinel AI Firewall
Protect your Open Claw agent from prompt injection, jailbreaks, malicious skill output, and data exfiltration — automatically, on every message and tool result.
What It Does
Sentinel intercepts three critical points in the agent lifecycle:
UserPromptSubmit— user input is scrubbed before your agent processes itPreToolUse— scans what your agent is about to send to a tool, blocking data exfiltration before it leaves the sessionPostToolUse— scans tool/skill responses before they reach the agent, catching malicious skills that try to hijack your agent via crafted output
PreToolUse is the primary defense against the malicious Clawhub skill attack pattern, where a compromised skill returns a crafted response designed to take over the agent or steal session data.
Setup
1. Get a Sentinel API key
Sign up at sentinel-proxy.skyblue-soft.com — free Starter tier available, no credit card required.
2. Set environment variables
export SENTINEL_API_URL=https://sentinel.ircnet.us
export SENTINEL_KEY=sk_live_...
Add these to your shell profile or .env file so they persist across sessions.
3. Install the skill
openclaw skills install sentinel
That's it. The bootstrap hook will verify your credentials on next agent start.
Transparent Proxy Mode (Recommended)
For complete protection — including scanning what your agent sends to external tools — route your LLM traffic through Sentinel's transparent proxy. Sentinel sits between Open Claw and the Anthropic API, scanning all content in both directions with zero changes to your agent code.
export ANTHROPIC_BASE_URL=https://sentinel.ircnet.us/v1
export ANTHROPIC_API_KEY=sk_live_... # your Sentinel key replaces your Anthropic key here
Your agent uses the Anthropic SDK exactly as before. Sentinel proxies the request, scans tool results before they return to your agent, and passes clean traffic through with no overhead.
Detection Layers
Every scrub request runs through three layers:
- Text normalization — strips invisible characters, Unicode homoglyphs, bidi overrides, and Unicode tag blocks before scanning
- Fast-path regex — 22 patterns catch high-confidence attacks (authority hijacks, prompt extraction, persona shifts, tool abuse) with near-zero latency
- Deep-path vector similarity — semantic embedding compared against 30+ attack signatures in pgvector; catches novel attacks that bypass regex
Actions
| Action | Meaning | Hook behavior |
|---|---|---|
clean |
No threat detected | Content passes through |
flagged |
Borderline — above flag threshold | Content passes through, warning logged |
neutralized |
Attack detected and rewritten | Safe version used instead |
blocked |
High-confidence attack (similarity > 0.82) | Content rejected, agent protected |
Scrub Tier
The hooks use standard tier by default. To switch to strict mode (lower thresholds, more aggressive):
export SENTINEL_TIER=strict
What Gets Logged
Sentinel does not log or store the content of clean requests. Flagged, neutralized, and blocked events are logged locally by the hook scripts with the threat score and action taken.
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install sentinel-proxy - After installation, invoke the skill by name or use
/sentinel-proxy - Provide required inputs per the skill's parameter spec and get structured output
What is Sentinel Proxy?
AI Firewall for Open Claw agents. Scrubs inbound messages and tool results for prompt injection, jailbreaks, and data exfiltration attempts using Sentinel's... It is an AI Agent Skill for Claude Code / OpenClaw, with 75 downloads so far.
How do I install Sentinel Proxy?
Run "/install sentinel-proxy" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Sentinel Proxy free?
Yes, Sentinel Proxy is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Sentinel Proxy support?
Sentinel Proxy is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Sentinel Proxy?
It is built and maintained by c0ri (@c0ri); the current version is v1.0.0.