← Back to Skills Marketplace

Prompt Hardening

Name: Prompt Hardening
Author: lanyasheng

by _silhouette · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ Security Clean

130

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install prompt-hardening-v2

Description

硬化 agent prompt、system prompt、SOUL.md、AGENTS.md、cron prompt 使 LLM 可靠遵循指令。触发词：agent 不听话、忽略规则、绕过约束、prompt 优化、指令合规、规则强化、prompt 硬化、LLM 不遵守、模型违规、creative circumve...

Usage Guidance

This skill appears to be what it claims: an advisory/audit toolkit for hardening prompts. Before installing or running it: 1) Inspect and remove or fix tests/test_smoke.py (it checks a hard-coded /Users/sly path which is environment-specific and should not be required). 2) Run scripts/audit.sh only on copies of your prompt files to avoid accidental edits; the script only greps content but verify behavior locally. 3) Clarify where 'model historical violations' data comes from — do not grant the skill access to conversation logs, secrets, or other unrelated data unless you intend to. 4) Review references/sources.md: it cites 'leaked system prompts' — using such material may have legal or ethical implications; vet the sources and licensing. 5) For critical enforcement (safety, code execution, or privileged actions), follow the SKILL.md guidance to pair prompt hardening with code-level (tool/hook) enforcement rather than relying on prompts alone.

Capability Analysis

Type: OpenClaw Skill Name: prompt-hardening-v2 Version: 1.0.0 The bundle is a utility toolkit designed to improve LLM instruction-following through 'prompt hardening' techniques. It includes a shell script (scripts/audit.sh) for static analysis of prompt files and comprehensive documentation (SKILL.md, references/patterns.md) on defensive prompt engineering. No evidence of data exfiltration, malicious execution, or harmful prompt injection was found; the tool's logic is strictly focused on auditing and optimizing agent reliability.

Capability Assessment

ℹ Purpose & Capability

The name, description, SKILL.md, patterns, and scripts all align with 'prompt hardening' and provide audit/checklist functionality. The included scripts and references are relevant. Minor oddity: tests/test_smoke.py asserts a hard-coded local path (/Users/sly/.openclaw/skills/...), which is unrelated to functionality and likely an auto-generated or environment-specific artifact that should be removed or adjusted.

ℹ Instruction Scope

SKILL.md is primarily advisory and instructs reading target prompt files and running scripts/audits. It explicitly states it will not automatically modify targets and requires operator action to apply changes. One ambiguity: it recommends '识别模型历史违反过的规则' (identify historical violations) but does not specify where to obtain conversation history — ensure the operator does not grant the skill access to unrelated logs or private transcripts unless intended.

✓ Install Mechanism

No install spec and no third-party downloads — the skill is instruction-heavy with a small local audit shell script and docs. This is low-risk from an installation/execution perspective. audit.sh merely reads a prompt file and checks for keywords.

✓ Credentials

The skill requests no environment variables, credentials, or config paths. The only potential concern is that references/sources.md cites a repository described as 'leaked system prompts' — this is a content provenance/ethical/legal concern, not a credential request.

✓ Persistence & Privilege

Flags are default (always:false, user-invocable:true). The skill does not request persistent presence or elevated platform privileges and the SKILL.md warns that changes should be applied manually or via another executor skill.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install prompt-hardening-v2
After installation, invoke the skill by name or use /prompt-hardening-v2
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release: harden agent prompts for reliable instruction following

Metadata

Slug prompt-hardening-v2

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Prompt Hardening?

硬化 agent prompt、system prompt、SOUL.md、AGENTS.md、cron prompt 使 LLM 可靠遵循指令。触发词：agent 不听话、忽略规则、绕过约束、prompt 优化、指令合规、规则强化、prompt 硬化、LLM 不遵守、模型违规、creative circumve... It is an AI Agent Skill for Claude Code / OpenClaw, with 130 downloads so far.

How do I install Prompt Hardening?

Run "/install prompt-hardening-v2" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Prompt Hardening free?

Yes, Prompt Hardening is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Prompt Hardening support?

Prompt Hardening is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Prompt Hardening?

It is built and maintained by _silhouette (@lanyasheng); the current version is v1.0.0.

More Skills