/install hallucination-detective
Hallucination Detective
Overview
Hallucination Detective is a practical guide to detecting AI hallucinations — those moments when AI confidently produces plausible-sounding but factually incorrect information. It teaches cross-referencing, source verification, confidence-assessment heuristics, and how to design prompts that reduce hallucination risk. Includes case studies of real AI errors.
This skill teaches methodology, not fact-checking as a service. It does not make determinations about the truth of specific claims.
When to Use
Use this skill when the user asks to:
- Learn why AI hallucinates and how to spot it
- Verify an AI-generated claim they are unsure about
- Develop better fact-checking habits when using AI
- Understand how to reduce hallucinations through prompt design
Trigger phrases: "How do I know if AI is making things up?", "AI gave me a fact I'm not sure about", "How to fact-check AI output", "Do AI models lie?", "Why does AI hallucinate?"
Workflow
Step 1 — Greet and Set Context
Acknowledge the user's concern. Briefly explain what hallucination means in the AI context: confident-sounding outputs that are factually incorrect, fabricated, or internally inconsistent. Set expectations: this skill teaches detection and prevention methodology.
Step 2 — Assess the Situation
Ask:
- What kind of AI output are they concerned about? (factual claim, citation, date, statistic, person)
- How confident did the AI sound?
- Have they already tried to verify any part of it?
Step 3 — Explain Why Hallucinations Happen
Provide a clear, non-technical explanation:
- AI models are pattern predictors, not knowledge databases
- They optimize for plausible-sounding output, not truth
- Training data contains errors, contradictions, and gaps
- Models have no mechanism to "know what they don't know"
- Some topics (obscure facts, recent events, specific numbers) have higher hallucination risk
Step 4 — Teach Detection Techniques
Walk through the verification toolkit:
- Cross-reference check: Does the claim appear in reliable external sources?
- Specificity test: Overly specific details (exact dates, quotes, statistics) are higher risk
- Consistency check: Does the AI contradict itself within the same response?
- Source request: Ask the AI "Can you cite a source for that?" and verify the source exists
- Plausibility filter: Does the claim pass basic common-sense checks?
- Freshness awareness: Information beyond the model's training cutoff is at higher risk
Step 5 — Reduce Hallucinations Through Prompting
Teach prompt design strategies:
- Ask for confidence indicators ("Rate your confidence from 1-5")
- Request explicit "I don't know" responses when uncertain
- Ask for sources or reasoning chains
- Use "according to [specific domain]" framing
- Break complex factual queries into smaller, verifiable pieces
Step 6 — Summarize and Exit
Recap key detection techniques and prevention strategies. Emphasize that healthy skepticism is a skill, not paranoia. Suggest related skills.
Safety & Compliance
- Does not fact-check claims itself — teaches users methodology, does not make determinations about truth
- Does not encourage distrust of all AI; promotes balanced critical thinking
- Not a replacement for professional fact-checkers or subject-matter experts
- Does not target specific AI models or companies with accusations
- This is a descriptive prompt-flow skill with zero code execution, zero network calls, and zero credential requirements
Acceptance Criteria
- User's concern about AI output is assessed and contextualized
- Why hallucinations occur is explained in accessible terms
- At least 3 detection techniques are taught
- At least 2 prevention prompting strategies are provided
- Does not fact-check specific claims — teaches method, not determination
Examples
Example 1: Suspicious AI Output
User says: "ChatGPT told me a very specific historical fact with dates and names, but something feels off. How do I check if it's real?"
Skill guides: Explain hallucination causes. Walk through the verification toolkit: cross-reference the dates and names, check if sources exist, test for internal consistency. Show how to ask the AI for sources and then independently verify them.
Example 2: Building Long-Term Habits
User says: "I use AI for research a lot. How do I build a habit of not just trusting everything it says?"
Skill guides: Focus on the prevention side. Teach confidence-assessment prompting, source-request habits, and the "verify-then-use" workflow. Provide a simple daily checklist for AI-assisted research.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install hallucination-detective - 安装完成后,直接呼叫该 Skill 的名称或使用
/hallucination-detective触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Hallucination Detective 是什么?
Learn to spot, verify, and handle AI-generated factual claims and confabulations. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 30 次。
如何安装 Hallucination Detective?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install hallucination-detective」即可一键安装,无需额外配置。
Hallucination Detective 是免费的吗?
是的,Hallucination Detective 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Hallucination Detective 支持哪些平台?
Hallucination Detective 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Hallucination Detective?
由 haidong(@harrylabsj)开发并维护,当前版本 v1.0.0。