← 返回 Skills 市场
mzfshark

Anti-hype Filter

作者 Mauricio Z. · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
65
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install anti-hype-filter
功能描述
Detect hype cycles and neutralize emotional triggers by rewriting claims into verifiable structures and explicit risk/uncertainty.
使用说明 (SKILL.md)

SKILL: anti-hype-filter

Purpose

Detect and neutralize hype cycles before they distort system integrity by stripping emotional triggers and replacing them with structural analysis.

When to Use

  • "guaranteed", "moon", "100x", "alpha" style language
  • Urgency without substance ("now or never")
  • Social proof without evidence
  • Claims that minimize risk or constraints

Inputs

  • text (required): message to evaluate
  • context (optional):
    • domain (token|product|governance|community)
  • policy (required):
    • hype_terms (optional list; if omitted, use the embedded default set in this skill)
    • max_response_words (default 100)

Steps

  1. Extract key claims (1-5).
  2. Detect hype triggers:
    • urgency framing
    • certainty language
    • vague upside claims
    • social proof substitution
  3. Classify:
    • signal, noise, or manipulation_risk
  4. Rewrite the message into a verifiable form:
    • replace certainty with uncertainty
    • add required missing variables (data window, metrics, constraints)
  5. Draft a minimal response that:
    • does not repeat hype memes verbatim
    • demands evidence and risk disclosure

Validation

  • If classification is manipulation_risk, provide at least 1 falsifiable request for evidence.
  • Do not amplify hype phrases; paraphrase instead.

Output

  • anti_hype_result:
    • classification ("signal"|"noise"|"manipulation_risk")
    • detected_triggers (list)
    • missing_information (list)
    • rewrite (verifiable version)
    • response_draft (string)

Safety Rules

  • Never accuse individuals of malice without evidence; label as "risk" not "intent".
  • No financial promises.
  • No deception; no fabricated data.

Example

Input: "This will 100x in 2 weeks, everyone knows." Output: manipulation_risk, missing evidence, rewrite into metrics/timeframe/assumptions, and a short demand for proof + risk disclosure.

安全使用建议
This skill appears internally consistent and low-risk, but before installing: (1) avoid feeding sensitive or confidential text (it rewrites whatever you send); (2) review or supply the policy.hype_terms list to ensure it matches your domain and avoids false positives/negatives; (3) test on representative examples to confirm rewrites don't inadvertently change factual meaning or introduce bias; (4) consider logging/auditing outputs if you will apply this automatically to user-facing content; and (5) if you plan to enable autonomous invocation, restrict its scope and monitor outputs to prevent undesired automated moderation decisions.
功能分析
Type: OpenClaw Skill Name: anti-hype-filter Version: 1.0.0 The skill bundle is a purely instructional set of Markdown and metadata files designed to help an AI agent identify and neutralize marketing or crypto 'hype' language. It contains no executable code, network calls, or file system operations, and the instructions in SKILL.md are aligned with its stated defensive purpose without any indicators of prompt injection or malicious intent.
能力评估
Purpose & Capability
Name and description match the SKILL.md steps (detect triggers, classify, rewrite). There are no unexpected environment variables, binaries, or config paths requested that would be unrelated to a text-filtering/rewriting tool.
Instruction Scope
Runtime instructions are narrowly scoped to processing provided text (extract claims, detect triggers, classify, rewrite, draft response). The SKILL.md does not instruct reading system files, environment variables, or sending data to external endpoints, and includes sensible safety rules (avoid accusing individuals, no fabricated data).
Install Mechanism
No install spec or code files requiring downloads or execution are present—this is an instruction-only skill, so nothing is written to disk or installed at runtime.
Credentials
The skill requires no environment variables, credentials, or config paths. The declared inputs (text, optional policy/hype_terms) are proportional to the stated purpose.
Persistence & Privilege
always is false and the skill does not request elevated persistence or modification of other skills. Autonomous invocation is allowed by default but that is expected for normal skills and is not combined with other red flags here.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install anti-hype-filter
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /anti-hype-filter 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
- Initial release of the anti-hype-filter skill. - Detects and classifies hype language in messages to prevent manipulation. - Extracts key claims, identifies hype triggers, and classifies messages as signal, noise, or manipulation risk. - Rewrites hype statements into verifiable claims including explicit risk, uncertainty, and required missing data. - Drafts minimal, non-repetitive responses demanding evidence and disclosure of risks. - Includes clear safety and validation rules to prevent undue accusations and financial promises.
元数据
Slug anti-hype-filter
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Anti-hype Filter 是什么?

Detect hype cycles and neutralize emotional triggers by rewriting claims into verifiable structures and explicit risk/uncertainty. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 65 次。

如何安装 Anti-hype Filter?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install anti-hype-filter」即可一键安装,无需额外配置。

Anti-hype Filter 是免费的吗?

是的,Anti-hype Filter 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Anti-hype Filter 支持哪些平台?

Anti-hype Filter 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Anti-hype Filter?

由 Mauricio Z.(@mzfshark)开发并维护,当前版本 v1.0.0。

💬 留言讨论