← 返回 Skills 市场

AANA Support Reply Guardrail Skill

Name: AANA Support Reply Guardrail Skill
Author: mindbomber

作者 mindbomber · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

总下载

当前安装

版本数

在 OpenClaw 中安装

/install aana-support-reply-guardrail

功能描述

Ensures customer support replies are accurate, policy-compliant, minimize private data, avoid unauthorized promises, and recommend safe next steps.

使用说明 (SKILL.md)

AANA Support Reply Guardrail Skill

Use this skill when an OpenClaw-style agent may draft, revise, send, summarize, or approve a customer support reply.

This is an instruction-only skill. It does not install packages, run commands, write files, call services, persist memory, or execute a checker on its own.

Core Principle

Customer support replies should be helpful without inventing account facts, promising outcomes the agent cannot verify, overstating policy, or exposing private data.

The agent should separate:

facts provided by the customer,
facts available from authorized support tools,
policy text actually available to the agent,
missing facts that require lookup or escalation,
private data that should be minimized or redacted,
actions the agent can safely take,
actions that require human or system approval.

When To Use

Use this skill before:

sending or drafting support emails, chats, tickets, or helpdesk replies,
answering refund, cancellation, charge, subscription, delivery, warranty, eligibility, or account questions,
summarizing customer records or ticket history,
making policy, billing, payment, legal, health, account, shipping, or entitlement claims,
using customer data to personalize a reply,
apologizing for facts that are not verified,
promising refunds, credits, replacements, exceptions, escalations, timelines, or outcomes.

Support Risk Classes

Treat these as higher risk:

refund, credit, chargeback, cancellation, subscription, or billing promises,
policy interpretation, exception handling, eligibility, warranty, or legal claims,
account status, delivery status, order status, identity, address, payment, or private ticket history,
health, legal, financial, employment, student, family, or personal support contexts,
replies that reference internal notes, private records, another person's data, or sensitive attachments,
responses that may bind the organization to an action, timeline, promise, or admission.

AANA Support Reply Loop

Identify the customer request and the reply the agent is about to send.
Classify claims: customer-provided, tool-verified, policy-backed, inferred, unsupported, private, or promissory.
Check account facts: do not invent order status, subscription state, billing outcome, refund eligibility, identity, or timeline.
Check policy claims: cite or paraphrase only policies actually available; avoid overstating exceptions or guarantees.
Check promises: do not promise refunds, credits, replacements, cancellations, escalations, callbacks, or legal outcomes unless authorized.
Check privacy: remove unnecessary private details and avoid exposing another person's data.
Choose action: accept, revise, ask, retrieve, defer, refuse, or route to human review.

Required Pre-Flight Checks

Before sending a support reply, verify:

the customer-visible task,
the exact account or order facts being asserted,
whether each important claim is supported by tool evidence, customer-provided text, or policy,
whether the reply includes a promise, guarantee, admission, exception, or outcome,
whether the user is authorized to receive the private details included,
whether sensitive details can be minimized,
whether the answer should ask for more information or defer to a support system.

Invented Fact Rules

Revise or defer if the reply claims unsupported facts such as:

"Your refund has been approved."
"Your account was cancelled."
"Your package will arrive tomorrow."
"The charge was a mistake."
"You are eligible for a replacement."
"Your subscription is active."
"We called you earlier."
"Your case is resolved."

Safer alternatives:

"I do not have enough verified information to confirm that yet."
"Please check the account record before confirming eligibility."
"Based on the details provided, the next step is..."
"I can help route this for review."

Refund And Policy Promise Rules

Do not promise:

refunds,
credits,
replacements,
charge reversals,
cancellation completion,
policy exceptions,
legal or compliance outcomes,
delivery dates,
callback or escalation timelines,
compensation.

Unless the agent has explicit authorization from a reviewed system or human, use conditional language:

The support team can review whether this qualifies.
I can help submit the request for review.
Eligibility depends on the account record and policy review.

Private Data Rules

Minimize or remove:

account IDs, order IDs, addresses, phone numbers, emails, payment details,
billing history, invoices, balances, subscriptions, and payment methods,
private ticket notes, internal comments, attachments, screenshots, logs,
another person's account or support history,
health, legal, financial, employment, school, family, or personal details.

Do not include raw secrets, credentials, full payment numbers, private messages, or unrelated account details.

Tone And User Experience

Use clear, calm, customer-facing language:

acknowledge the issue without admitting unverified fault,
state what is known and what is not known,
offer the next safe step,
avoid blaming the customer,
avoid legalistic overclaims,
avoid fake certainty.

Review Payload

When using a configured AANA checker, send only a minimal redacted review payload:

task_summary
reply_summary
claim_support_status
refund_or_policy_promise_status
private_data_status
authorization_status
recommended_action

Do not include raw account records, payment data, private messages, health records, legal records, credentials, or full ticket history when a redacted summary is enough.

Decision Rule

If all important claims are supported, private data is minimized, and no unauthorized promise is made, accept.
If the reply is useful but includes unsupported facts, policy overclaims, or unnecessary private data, revise.
If the reply needs customer clarification or identity/context confirmation, ask.
If the reply needs account lookup, policy lookup, supervisor approval, or human review, defer.
If the request asks to expose unauthorized private data or make a false claim, refuse and explain briefly.
If a checker is unavailable or untrusted, use manual support-reply review.

Output Pattern

For support-sensitive replies, prefer:

Support reply review:
- Claim support: ...
- Refund/policy promises: ...
- Private data: ...
- Missing facts: ...
- Decision: accept / revise / ask / defer / refuse

Do not include this review block in the customer-facing reply unless needed by the support workflow.

安全使用建议

The provided files themselves look like a benign instruction-only support-reply safety checklist. However, because the capability signals list unrelated high-impact financial and credential capabilities, install only if the platform confirms those permissions are not actually granted or the publisher corrects the metadata.

功能分析

Type: OpenClaw Skill Name: aana-support-reply-guardrail Version: 1.0.0 This skill is a set of safety guardrails designed to prevent AI agents from hallucinating facts, making unauthorized promises (e.g., refunds), or leaking PII in customer support contexts. It contains no executable code, no dependencies, and explicitly instructs the agent to redact sensitive information such as API keys, payment details, and private records (SKILL.md, manifest.json). The skill is purely instructional and aligns with its stated purpose of enhancing support reply safety.

能力标签

cryptorequires-walletcan-make-purchasescan-sign-transactionsrequires-sensitive-credentials

能力评估

⚠ Purpose & Capability

The instruction-only support guardrail purpose is coherent, but the listed capability signals include wallet, purchase, transaction-signing, and sensitive-credential capabilities that are not explained by the support-reply review function.

ℹ Instruction Scope

The instructions mainly constrain the agent to verify claims, avoid unsupported promises, and minimize private data. They also mention an optional configured checker, but require redacted minimal payloads.

✓ Install Mechanism

No install spec, code files, dependencies, command execution, or file writes are present in the provided artifacts.

⚠ Credentials

The declared capability signals are not proportionate to an instruction-only customer-support guardrail and should be corrected or explicitly justified before installation.

✓ Persistence & Privilege

The manifest and README state that the skill does not persist memory, store payloads by default, execute commands, or require credentials.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install aana-support-reply-guardrail
安装完成后，直接呼叫该 Skill 的名称或使用 /aana-support-reply-guardrail 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

- Initial release of the AANA Support Reply Guardrail Skill. - Provides detailed instructions for drafting, revising, and approving customer support replies with a focus on accuracy, privacy, and risk mitigation. - Outlines risk classifications, pre-flight verification checks, and rules for handling invented facts, policy promises, and private data. - Emphasizes separation of fact sources, authorization checks, and clear, customer-facing tone. - Defines explicit decision criteria and a structured output review pattern for support reply evaluations. - Instruction-only: does not execute code, install packages, or persist data.

元数据

Slug aana-support-reply-guardrail

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题