← Back to Skills Marketplace
jdchi

is-bullshit

by JDChi · GitHub ↗ · v1.7.0 · MIT-0
cross-platform ✓ Security Clean
350
Downloads
1
Stars
1
Active Installs
16
Versions
Install in OpenClaw
/install is-bullshit
Description
Detect if AI responses contain hallucinations by analyzing tool usage and response quality. Gives credit for correctly identifying invalid premises even with...
README (SKILL.md)

is-bullshit - Hallucination Detector

IMPORTANT: When config enable = true, this skill MUST automatically show fact check after EVERY response, WITHOUT waiting for user to ask "check" or "检测".

When enable = false, this skill triggers when user explicitly asks:

  • Chinese: 检测、检测一下、核实、是真的吗、是不是胡说
  • English:
    • "is that true" / "is this true"
    • "are you serious" / "you serious"
    • "is that bullshit" / "is this nonsense"
    • "verify" / "check" / "fact check"
    • "are you sure" / "are you certain"
    • "that's not right" / "that's wrong"

Purpose

Detect whether the AI's response is trustworthy by checking:

  1. Tool usage - Did the AI call tools to verify facts?
  2. Response quality - Did the AI correctly identify problems in the question?

Configuration

{
  "enable": false    // User must explicitly enable
}

How to Enable

User can say:

  • "enable fact check" → enable = true
  • "disable fact check" → enable = false
  • "turn on is-bullshit" → enable = true
  • "turn off is-bullshit" → enable = false

How It Works

Step 1: Analyze the Response

Read the AI's response and identify what type of information it contains:

  • Mathematical calculations
  • Time/date/timezone statements
  • Factual claims
  • Uncertain statements

Step 2: Check Tool Usage

Look at what tools were called throughout the entire conversation history (not just the current response). Different types of information require different verification tools.

Step 3: Check Response Quality

Analyze the response text for signs of good judgment.

Step 4: Calculate Score

Add up points based on tool usage and response quality patterns.

Detection Rules

A. Tool-Based Checks (Required Verification)

Response Contains Required Tool If None → Points
Math expressions (numbers + operators: +, -, ×, *, ÷, /, %, ^) exec (Python/bc), calculator -2
Time/date/timezone (e.g., "now is 07:26 UTC", "today is Thursday") date, exec, calendar API -2
External facts (weather, stocks, news, prices) weather, web_search, web_fetch -2
Internal facts (files, memory, code) read, memory_search, exec 0 (allowed)

B. Content-Based Checks (Bonus Points)

Pattern Found Points
Detects time contradiction ("明朝...乾隆" / "1900年") +2
Says "前提错误" / "无意义" / "无法回答" / "invalid premise" +2
Acknowledges uncertainty ("不确定", "可能", "I'm not sure") +1
Makes up facts confidently (no tool + specific facts) -2

Verdict per Round

Each round gets its own verdict:

Tool Used Verdict
Correct tool used ✅ Looks good!
No tool (but needed) ❌ Might be wrong
Uncertain answer 🤔 Not sure

Output Format

The fact check should be in the same language as the user's question.

Step-by-Step Analysis

First, analyze each round of conversation:

Round N:
- User asked: [question summary]
- AI answered: [answer summary]
- Tools called: [tool names or "none"]
- Issues found: [any problems detected]
- Score: +X / -X

Output Rules by Conversation Length

Conversation Rounds Output
≤ 5 rounds Show every round
> 5 rounds Show only suspicious rounds

Note: Each round is evaluated independently. No overall summary needed - users can judge themselves.

Style

  • Friendly and lively, not robotic
  • Casual tone
  • Keep it short and fun
  • Each round is independent - no overall summary

Example Output

≤5 rounds (show all):

---
Fact Check:

Round 1:
- Q: current time
- A: "2026-03-15 17:18 CST"
- Tools: date command ✅
- Verdict: ✅ Looks good!

Round 2:
- Q: 15000 × 1.2% = ?
- A: "15180"
- Tools: none ❌
- Verdict: ❌ No tool used for calculation

Round 3:
- Q: is it true
- A: "算对了,15180"
- Tools: python3 ✅
- Verdict: ✅ Verified!
---

>5 rounds (show suspicious only):

---
Fact Check:

⚠️ Suspicious rounds:

Round 1:
- Q: current time
- A: "07:26 UTC" (wrong!)
- Tools: none ❌
- Verdict: ❌ No time tool used, gave wrong time

Round 3:
- Q: 15000 × 1.2%
- A: "15180"
- Tools: none ❌
- Verdict: ❌ No calculation tool used
---

Implementation Notes

  • Default is OFF - user must explicitly enable
  • Checks both tool usage AND response content
  • Gives credit for good judgment even without tools
  • Penalizes confident fabrication
Usage Guidance
This skill appears coherent and low-risk: it only contains instructions and a small config flag (enable=false by default) and does not request credentials or install anything. Two practical cautions: (1) If you turn enable=true it will automatically produce fact-checks after every agent response (which may be noisy or expose content from earlier conversation rounds), and (2) it relies on the agent's ability to see conversation history and tool-invocation metadata — if you have sensitive content in history, consider keeping it off or testing first in non-sensitive chats. If you want to limit its reach, leave enable=false and only invoke it on demand.
Capability Analysis
Type: OpenClaw Skill Name: is-bullshit Version: 1.7.0 The 'is-bullshit' skill is a hallucination detector designed to evaluate the reliability of AI responses by analyzing conversation history and tool usage. It provides a scoring system and feedback mechanism to encourage the use of verification tools (like math execution or web searches) for factual claims. There is no evidence of data exfiltration, malicious code execution, or harmful prompt injection; the skill's instructions in SKILL.md are entirely consistent with its stated purpose of improving AI output quality.
Capability Assessment
Purpose & Capability
Name and description match the instructions: the SKILL.md describes inspecting responses and tool usage to detect hallucinations. It does not ask for unrelated env vars, binaries, or external installs.
Instruction Scope
Instructions ask the agent to inspect the entire conversation history and the set of tools called to determine if verification tools were used. That is coherent for a fact-checker, but it does mean the skill will read/consider all prior messages and tool invocations (expected for the purpose).
Install Mechanism
Instruction-only skill with no install spec and no code files — nothing is written to disk or downloaded, which is proportionate to the described functionality.
Credentials
No environment variables, credentials, or config paths are requested. The declared needs are minimal and aligned with the stated purpose.
Persistence & Privilege
always:false (no forced global inclusion). However, when enabled the skill requires that it automatically produce a fact-check after every response — this is a behavior/UX decision that increases how often the skill runs but does not request elevated system privileges.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install is-bullshit
  3. After installation, invoke the skill by name or use /is-bullshit
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.7.0
is-bullshit 1.7.0 changelog: - Output rules updated: for conversations >5 rounds, now shows only suspicious rounds instead of just the last round. - Example outputs and relevant documentation revised to reflect this change. - No changes to detection logic or core behavior—documentation update only.
v1.6.0
**Version 1.6.0 Changelog** - Adds per-round fact-check verdicts, showing analysis for each conversation round (not just the latest response). - New output format: for ≤5 rounds, displays every round's fact check; for >5 rounds, shows only the most recent round. - Output now includes question/answer/tool used/issues/score/verdict for each round. - Removes overall summary; each round is evaluated independently for clarity. - Documentation updated with new examples and clear output formatting rules.
v1.5.0
- Now checks tool usage across the entire conversation history, not just the current response, for a more accurate assessment. - Documentation updated to clarify that verification tools can be used in previous turns and still count towards fact-checking requirements. - All scoring, configuration, and output formats remain the same.
v1.4.0
- Detection rules and output explanation rewritten for clarity and easier understanding. - Tool-based detection rules now summarized in a concise table for quick reference. - Content-based bonus point criteria made clearer and easier to find. - Step-by-step detection process added with explicit scoring instructions. - Output examples and style guidelines improved for greater consistency and friendliness. - No core algorithm logic changed; documentation only.
v1.3.0
- Added a new "Time/Date Check": if a response includes specific time, date, or timezone information but does not call an appropriate tool, minus points will be applied for possible hallucination. - Updated documentation to explain the new time/date hallucination detection. - No changes to interface or API; only fact check scoring is affected.
v1.2.0
- Updated English math calculation example output to use English instead of Chinese. - No changes to logic or configuration; documentation only. - Clarified example style for better consistency with other output examples.
v1.1.0
**1.1.0 adds explicit math calculation verification for improved accuracy:** - Added a new check for mathematical expressions in responses: if math is present, verifies that a calculation tool was called. - Math answers without calculation tool verification are now penalized (-2 points). - Provided new example outputs for math verification failures. - No changes to behavior for other types of fact checks; general process and scoring remain the same. - Documentation updated to clarify the new math check logic and output style.
v1.0.8
- Updated config.json file (details not shown) - No changes to skill documentation or logic in SKILL.md - Version bump to 1.0.8
v1.0.7
- Removed the `mode` and `language` settings from `config.json`; now only `enable` is configurable. - Fact check output will always match the user's language (Chinese or English) based on the question. - Updated documentation to reflect config and output changes. - No behavioral changes to fact checking logic.
v1.0.6
- Improved the fact check output: now uses a casual, friendly, and lively tone instead of being robotic. - Added guidelines for using emojis and expressive language to reflect the credibility score. - Provided example output to illustrate the new style and format. - Updated documentation to emphasize concise, fun, and non-technical feedback. - No changes to core logic—this update only affects output style and documentation.
v1.0.5
**Added user configuration and trigger flexibility.** - Introduced `config.json` with `enable`, `mode`, and `language` settings for controlling automatic or manual fact check. - Added explicit trigger phrases in English and Chinese when automatic mode is disabled. - Users can enable/disable the skill with messages like "enable fact check" or "turn on is-bullshit". - Now uses conversation context for tool usage checking instead of raw log file access. - Default behavior is OFF; automatic fact-checking requires explicit user action.
v1.0.4
is-bullshit 1.0.4 - Now automatically displays a fact check after every response, without waiting for user input. - Documentation updated to clarify mandatory post-response fact check behavior. - No logic or API changes; update is documentation only.
v1.0.3
- Improved documentation for log file format, including JSON Lines example entries. - Clarified exactly which log file is accessed, with explicit path and example. - Added clear table showing what the log file contains and DOES NOT contain. - Described handling for missing log files, including fallback behavior and reporting. - Noted that OpenClaw currently lacks a tool-call audit API, justifying direct log file access. - General documentation cleanup and expansion for clarity and transparency.
v1.0.2
- Updated documentation to clarify the safety and contents of the OpenClaw log file. - Added a new section detailing that logs contain only system metadata (IDs, tool calls, events) and never user message content or sensitive data. - No changes to skill logic or interfaces.
v1.0.1
- Added a detailed "Security & Safety" section clarifying file access, what data is read, and privacy safeguards. - Specified that the skill only reads the `/tmp/openclaw/openclaw-YYYY-MM-DD.log` log file and does not access any other files. - Clearly listed operations the skill does NOT perform (no user file or config access, no network requests, no file modification). - Core detection logic and scoring system remain unchanged.
v1.0.0
- Initial release of is-bullshit: a skill to detect AI hallucinations by analyzing tool usage logs and response quality. - Assigns credibility levels (HIGH, MEDIUM, LOW) based on external/internal tool calls and bonus points for good judgment. - Recognizes and rewards responses that identify invalid premises or uncertainty, even without tool verification. - Penalizes confident fabrication of facts. - Provides a clear output format summarizing tools used, response quality, score, and final credibility.
Metadata
Slug is-bullshit
Version 1.7.0
License MIT-0
All-time Installs 1
Active Installs 1
Total Versions 16
Frequently Asked Questions

What is is-bullshit?

Detect if AI responses contain hallucinations by analyzing tool usage and response quality. Gives credit for correctly identifying invalid premises even with... It is an AI Agent Skill for Claude Code / OpenClaw, with 350 downloads so far.

How do I install is-bullshit?

Run "/install is-bullshit" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is is-bullshit free?

Yes, is-bullshit is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does is-bullshit support?

is-bullshit is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created is-bullshit?

It is built and maintained by JDChi (@jdchi); the current version is v1.7.0.

💬 Comments