← 返回 Skills 市场
harrylabsj

Hallucination Detective

作者 haidong · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
30
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install hallucination-detective
功能描述
Learn to spot, verify, and handle AI-generated factual claims and confabulations.
使用说明 (SKILL.md)

Hallucination Detective

Overview

Hallucination Detective is a practical guide to detecting AI hallucinations — those moments when AI confidently produces plausible-sounding but factually incorrect information. It teaches cross-referencing, source verification, confidence-assessment heuristics, and how to design prompts that reduce hallucination risk. Includes case studies of real AI errors.

This skill teaches methodology, not fact-checking as a service. It does not make determinations about the truth of specific claims.

When to Use

Use this skill when the user asks to:

  • Learn why AI hallucinates and how to spot it
  • Verify an AI-generated claim they are unsure about
  • Develop better fact-checking habits when using AI
  • Understand how to reduce hallucinations through prompt design

Trigger phrases: "How do I know if AI is making things up?", "AI gave me a fact I'm not sure about", "How to fact-check AI output", "Do AI models lie?", "Why does AI hallucinate?"

Workflow

Step 1 — Greet and Set Context

Acknowledge the user's concern. Briefly explain what hallucination means in the AI context: confident-sounding outputs that are factually incorrect, fabricated, or internally inconsistent. Set expectations: this skill teaches detection and prevention methodology.

Step 2 — Assess the Situation

Ask:

  • What kind of AI output are they concerned about? (factual claim, citation, date, statistic, person)
  • How confident did the AI sound?
  • Have they already tried to verify any part of it?

Step 3 — Explain Why Hallucinations Happen

Provide a clear, non-technical explanation:

  • AI models are pattern predictors, not knowledge databases
  • They optimize for plausible-sounding output, not truth
  • Training data contains errors, contradictions, and gaps
  • Models have no mechanism to "know what they don't know"
  • Some topics (obscure facts, recent events, specific numbers) have higher hallucination risk

Step 4 — Teach Detection Techniques

Walk through the verification toolkit:

  • Cross-reference check: Does the claim appear in reliable external sources?
  • Specificity test: Overly specific details (exact dates, quotes, statistics) are higher risk
  • Consistency check: Does the AI contradict itself within the same response?
  • Source request: Ask the AI "Can you cite a source for that?" and verify the source exists
  • Plausibility filter: Does the claim pass basic common-sense checks?
  • Freshness awareness: Information beyond the model's training cutoff is at higher risk

Step 5 — Reduce Hallucinations Through Prompting

Teach prompt design strategies:

  • Ask for confidence indicators ("Rate your confidence from 1-5")
  • Request explicit "I don't know" responses when uncertain
  • Ask for sources or reasoning chains
  • Use "according to [specific domain]" framing
  • Break complex factual queries into smaller, verifiable pieces

Step 6 — Summarize and Exit

Recap key detection techniques and prevention strategies. Emphasize that healthy skepticism is a skill, not paranoia. Suggest related skills.

Safety & Compliance

  • Does not fact-check claims itself — teaches users methodology, does not make determinations about truth
  • Does not encourage distrust of all AI; promotes balanced critical thinking
  • Not a replacement for professional fact-checkers or subject-matter experts
  • Does not target specific AI models or companies with accusations
  • This is a descriptive prompt-flow skill with zero code execution, zero network calls, and zero credential requirements

Acceptance Criteria

  1. User's concern about AI output is assessed and contextualized
  2. Why hallucinations occur is explained in accessible terms
  3. At least 3 detection techniques are taught
  4. At least 2 prevention prompting strategies are provided
  5. Does not fact-check specific claims — teaches method, not determination

Examples

Example 1: Suspicious AI Output

User says: "ChatGPT told me a very specific historical fact with dates and names, but something feels off. How do I check if it's real?"

Skill guides: Explain hallucination causes. Walk through the verification toolkit: cross-reference the dates and names, check if sources exist, test for internal consistency. Show how to ask the AI for sources and then independently verify them.

Example 2: Building Long-Term Habits

User says: "I use AI for research a lot. How do I build a habit of not just trusting everything it says?"

Skill guides: Focus on the prevention side. Teach confidence-assessment prompting, source-request habits, and the "verify-then-use" workflow. Provide a simple daily checklist for AI-assisted research.

安全使用建议
This skill appears safe to install as an educational guide. It does not run code or access external services; users should still independently verify factual claims using reliable sources, as the skill itself is designed to teach methodology rather than certify truth.
功能分析
Type: OpenClaw Skill Name: hallucination-detective Version: 1.0.0 The 'Hallucination Detective' skill is a purely educational prompt-flow bundle designed to teach users how to identify and mitigate AI hallucinations. It contains no executable code, requires no network access or credentials, and its instructions in SKILL.md are strictly aligned with its stated purpose of providing a methodology for fact-checking without any evidence of prompt injection or malicious intent.
能力评估
Purpose & Capability
The stated purpose is educational guidance on detecting AI hallucinations, and the artifacts consistently describe a prompt-flow methodology rather than tools, integrations, or automated fact-checking.
Instruction Scope
Instructions are limited to asking clarifying questions, explaining hallucinations, teaching verification techniques, and summarizing; they do not redirect the agent to unsafe goals or override user intent.
Install Mechanism
There is no install spec, no code files, no required binaries, and no package or script execution.
Credentials
The skill declares no network, API, credential, file, or OS requirements, which is proportionate for an educational prompt-flow skill.
Persistence & Privilege
Artifacts show no persistence, background activity, memory storage, account access, or elevated privileges.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install hallucination-detective
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /hallucination-detective 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of Hallucination Detective — a skill for learning to spot and address AI-generated factual errors. - Teaches users how to detect and verify AI hallucinations, not to fact-check specific claims. - Provides methodology: explains why AI hallucinates, detection techniques, and prevention strategies. - Includes practical prompts, verification toolkits, and guidance on building critical research habits. - Addresses user concerns with accessible explanations and clear, step-by-step instruction. - Prioritizes safety: does not provide determinations of truth or replace expert fact-checkers.
元数据
Slug hallucination-detective
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Hallucination Detective 是什么?

Learn to spot, verify, and handle AI-generated factual claims and confabulations. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 30 次。

如何安装 Hallucination Detective?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install hallucination-detective」即可一键安装,无需额外配置。

Hallucination Detective 是免费的吗?

是的,Hallucination Detective 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Hallucination Detective 支持哪些平台?

Hallucination Detective 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Hallucination Detective?

由 haidong(@harrylabsj)开发并维护,当前版本 v1.0.0。

💬 留言讨论