← 返回 Skills 市场

Hallucination Detective

Name: Hallucination Detective
Author: harrylabsj

作者 haidong · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ 安全检测通过

总下载

当前安装

版本数

在 OpenClaw 中安装

/install hallucination-detective

功能描述

Learn to spot, verify, and handle AI-generated factual claims and confabulations.

使用说明 (SKILL.md)

Hallucination Detective

Overview

Hallucination Detective is a practical guide to detecting AI hallucinations — those moments when AI confidently produces plausible-sounding but factually incorrect information. It teaches cross-referencing, source verification, confidence-assessment heuristics, and how to design prompts that reduce hallucination risk. Includes case studies of real AI errors.

This skill teaches methodology, not fact-checking as a service. It does not make determinations about the truth of specific claims.

When to Use

Use this skill when the user asks to:

Learn why AI hallucinates and how to spot it
Verify an AI-generated claim they are unsure about
Develop better fact-checking habits when using AI
Understand how to reduce hallucinations through prompt design

Trigger phrases: "How do I know if AI is making things up?", "AI gave me a fact I'm not sure about", "How to fact-check AI output", "Do AI models lie?", "Why does AI hallucinate?"

Workflow

Step 1 — Greet and Set Context

Acknowledge the user's concern. Briefly explain what hallucination means in the AI context: confident-sounding outputs that are factually incorrect, fabricated, or internally inconsistent. Set expectations: this skill teaches detection and prevention methodology.

Step 2 — Assess the Situation

Ask:

What kind of AI output are they concerned about? (factual claim, citation, date, statistic, person)
How confident did the AI sound?
Have they already tried to verify any part of it?

Step 3 — Explain Why Hallucinations Happen

Provide a clear, non-technical explanation:

AI models are pattern predictors, not knowledge databases
They optimize for plausible-sounding output, not truth
Training data contains errors, contradictions, and gaps
Models have no mechanism to "know what they don't know"
Some topics (obscure facts, recent events, specific numbers) have higher hallucination risk

Step 4 — Teach Detection Techniques

Walk through the verification toolkit:

Cross-reference check: Does the claim appear in reliable external sources?
Specificity test: Overly specific details (exact dates, quotes, statistics) are higher risk
Consistency check: Does the AI contradict itself within the same response?
Source request: Ask the AI "Can you cite a source for that?" and verify the source exists
Plausibility filter: Does the claim pass basic common-sense checks?
Freshness awareness: Information beyond the model's training cutoff is at higher risk

Step 5 — Reduce Hallucinations Through Prompting

Teach prompt design strategies:

Ask for confidence indicators ("Rate your confidence from 1-5")
Request explicit "I don't know" responses when uncertain
Ask for sources or reasoning chains
Use "according to [specific domain]" framing
Break complex factual queries into smaller, verifiable pieces

Step 6 — Summarize and Exit

Recap key detection techniques and prevention strategies. Emphasize that healthy skepticism is a skill, not paranoia. Suggest related skills.

Safety & Compliance

Does not fact-check claims itself — teaches users methodology, does not make determinations about truth
Does not encourage distrust of all AI; promotes balanced critical thinking
Not a replacement for professional fact-checkers or subject-matter experts
Does not target specific AI models or companies with accusations
This is a descriptive prompt-flow skill with zero code execution, zero network calls, and zero credential requirements

Acceptance Criteria

User's concern about AI output is assessed and contextualized
Why hallucinations occur is explained in accessible terms
At least 3 detection techniques are taught
At least 2 prevention prompting strategies are provided
Does not fact-check specific claims — teaches method, not determination

Examples

Example 1: Suspicious AI Output

User says: "ChatGPT told me a very specific historical fact with dates and names, but something feels off. How do I check if it's real?"

Skill guides: Explain hallucination causes. Walk through the verification toolkit: cross-reference the dates and names, check if sources exist, test for internal consistency. Show how to ask the AI for sources and then independently verify them.

Example 2: Building Long-Term Habits

User says: "I use AI for research a lot. How do I build a habit of not just trusting everything it says?"

Skill guides: Focus on the prevention side. Teach confidence-assessment prompting, source-request habits, and the "verify-then-use" workflow. Provide a simple daily checklist for AI-assisted research.

安全使用建议

This skill appears safe to install as an educational guide. It does not run code or access external services; users should still independently verify factual claims using reliable sources, as the skill itself is designed to teach methodology rather than certify truth.

功能分析

Type: OpenClaw Skill Name: hallucination-detective Version: 1.0.0 The 'Hallucination Detective' skill is a purely educational prompt-flow bundle designed to teach users how to identify and mitigate AI hallucinations. It contains no executable code, requires no network access or credentials, and its instructions in SKILL.md are strictly aligned with its stated purpose of providing a methodology for fact-checking without any evidence of prompt injection or malicious intent.

能力评估

✓ Purpose & Capability

The stated purpose is educational guidance on detecting AI hallucinations, and the artifacts consistently describe a prompt-flow methodology rather than tools, integrations, or automated fact-checking.

✓ Instruction Scope

Instructions are limited to asking clarifying questions, explaining hallucinations, teaching verification techniques, and summarizing; they do not redirect the agent to unsafe goals or override user intent.

✓ Install Mechanism

There is no install spec, no code files, no required binaries, and no package or script execution.

✓ Credentials

The skill declares no network, API, credential, file, or OS requirements, which is proportionate for an educational prompt-flow skill.

✓ Persistence & Privilege

Artifacts show no persistence, background activity, memory storage, account access, or elevated privileges.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install hallucination-detective
安装完成后，直接呼叫该 Skill 的名称或使用 /hallucination-detective 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

Initial release of Hallucination Detective — a skill for learning to spot and address AI-generated factual errors. - Teaches users how to detect and verify AI hallucinations, not to fact-check specific claims. - Provides methodology: explains why AI hallucinates, detection techniques, and prevention strategies. - Includes practical prompts, verification toolkits, and guidance on building critical research habits. - Addresses user concerns with accessible explanations and clear, step-by-step instruction. - Prioritizes safety: does not provide determinations of truth or replace expert fact-checkers.

元数据

Slug hallucination-detective

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题