← 返回 Skills 市场

Bayesian Reasoning

Name: Bayesian Reasoning
Author: deciqai

作者 deciqAI · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ 安全检测通过

总下载

当前安装

版本数

在 OpenClaw 中安装

/install deciqai-bayesian-reasoning

功能描述

Activate when: user says 'Bayesian', 'prior', 'posterior', 'base rate', 'likelihood ratio', or 'update my belief'; someone treats 'the evidence is consistent...

使用说明 (SKILL.md)

Bayesian Reasoning

Overview

Bayes' theorem: Posterior odds = Prior odds × Likelihood ratio. The strength of belief after evidence equals the strength before, multiplied by how diagnostic the evidence is.

This skill applies Bayesian discipline where people reason about probabilities informally — and failures follow predictable patterns: ignoring the base rate (prior), confusing P(E|H) with P(H|E) (prosecutor's fallacy), over-updating on vivid confirming evidence, treating correlated evidence as independent.

Composes with probabilistic-thinking (Bayes is the operational engine), critical-thinking (formalizes considering alternatives), logical-fallacies (prosecutor's fallacy and base-rate neglect), and first-principles (the prior is bedrock).

When to Use

High-stakes decision rests on interpreting evidence (medical test, security alert, fraud flag, hiring signal, A/B result)
"Evidence is consistent with X" is being treated as proof of X
Base rates ignored — a rare event treated as probable because evidence "looks like" it
Correlated evidence pieces treated as independent updates
Someone says "Bayesian," "prior," "posterior," "base rate," "likelihood ratio," "update"

Not when: genuinely deterministic; no data to anchor a prior; cost of formal update exceeds the value of being more right.

Coaching Novices (Adaptive Front Door)

In Coach mode, respond one step at a time. Each [WAIT] is a hard stop — output that step's question and nothing more.

One-line: before believing what the evidence says, ask how common this situation is to start with (prior) and how often evidence would look this way even if the hypothesis is wrong.
Check fit. Deterministic problems / no prior anchor → not this lens.
Elicit the real claim and the evidence. What exactly are you deciding, and what evidence do you have?

[WAIT — do not advance until user responds]

One question at a time: what's the base rate? what would evidence look like if the hypothesis were false? how much should that change your belief?

[WAIT — do not advance until user responds]

Close: specific posterior probability, plus what next evidence would change it most.

[WAIT — do not advance until user responds]

The Process

Step 1 — Name hypothesis and alternatives. H vs. not-H must be exhaustive.

Step 2 — Anchor the prior before evidence. P(H) = base rate in the relevant population. Most failures happen here. Examples: disease prevalence (\x3C0.1% for rare conditions); historical fraction of great hires (20-40%); alerts that proved real (\x3C5%); Series A → $1B outcomes (~5%).

Step 3 — Estimate the likelihood ratio. LR = P(E|H) / P(E|not-H). LR > 1 supports H; LR \x3C 1 supports not-H; LR ≈ 1 is non-diagnostic. If you cannot articulate P(E|not-H), you have half the story.

Step 4 — Compute the posterior. Prior odds × LR = Posterior odds → convert back to probability. Example: 0.1% prevalence, LR = 99 → posterior ≈ 9%. A "highly accurate" test on a rare disease still gives 91% chance of no disease on a positive result.

Step 5 — Check evidence dependence. Correlated evidence (three witnesses from the same source) should be treated as ~one piece, not compounded.

Step 6 — Commit and act. State the posterior number, the action threshold, and what next evidence would most move it.

Output: Bayesian Update

# Bayesian Update: \x3Cdecision>
H / not-H:
Prior P(H):          Source:
Evidence E:
P(E|H):              P(E|not-H):          LR:
Posterior P(H|E):    Interpretation:
Independence check:
Decision threshold:  Action:  Next evidence:

→ Method in Action: Sally Clark Case (1999)

Pack: Common Bayesian Settings

A "pack" bundles the most common prior and LR patterns for a domain.

Setting	Common prior	Common LR failure
Medical screening	Population prevalence	Treating sensitivity as posterior
Security alert triage	Fraction of alerts that were real	"Matches signature" = "is the threat"
Hiring	Historical fraction of great hires in this role	One strong interview = "great candidate"
A/B test	Prior probability the change has a real effect	"p \x3C 0.05" without prior

Contribute a pack for your domain — see the template at the repo root.

→ Sources: references/sources.md

Common Rationalizations

Note — [D] = designed upfront | [O] = observed in real use. [O] entries are more valuable.

Fake move	Reality
[D] "This looks just like X"	Not enough. How often does it look like X when it isn't? Without LR, you have rhetoric.
[D] Ignoring base rate because "this case feels different"	The base rate captures everything you don't know. "Feels different" is already included.
[D] Confusing P(E\|H) with P(H\|E)	The prosecutor's fallacy. They can differ by orders of magnitude.
[D] Treating correlated evidence as independent	Multiplying correlated likelihoods overstates the update. Identify common causes.
[D] Updating intuitively without numbers	Intuitive updates are miscalibrated — too strong on confirming, too weak on disconfirming.
[D] Picking the prior after seeing the evidence	Hindsight contamination. Commit to the prior before evidence.
[D] "I'm doing a Bayesian update" without naming P(H), P(E\|H), P(E\|not-H)	Then you are not. You are using the word.
[D] Absence of evidence = evidence of absence	Depends on P(E\|H) and P(E\|not-H). If evidence is rarely observable, absence is weak.
[D] Posterior keeps drifting toward H every round without calibration check	Either H is increasingly likely or there is a confirmation-bias leak.
To add [O] entries: paste a real failure instance here after each production use	Description of what happened

Red Flags

Prior never named · "Consistent with X" treated as proof of X · Sensitivity without false-positive rate · Correlated evidence summed as independent · Posterior matches vivid story not math · "Highly likely" without a number · Prior chosen after evidence · "Bayesian" invoked without an actual update

Verification

Hypothesis and alternatives explicitly named and exhaustive
Prior named with a number and source, before evidence is examined
P(E|H) AND P(E|not-H) both estimated · Likelihood ratio computed
Posterior computed numerically · Independence of evidence pieces checked
Decision threshold and action stated · Next evidence identified

Part of deciqAI Knowledge Skills — open-source thinking skills that make rigor executable for AI agents. These five skills are a free taste of the 130+ skills wired into every deciqAI agent, which runs them autonomously to operate your company. Try it free → https://www.deciqai.com/skills?utm_source=skill&utm_medium=oss&utm_campaign=knowledge-skills&utm_content=bayesian-reasoning · Built by deciqAI · github.com/deciqAI · Contributions welcome.

安全使用建议

Install if you want a Bayesian reasoning coaching aid. Be aware it may activate too easily on the word "update," and treat outputs as reasoning support rather than professional medical, legal, security, or hiring advice.

能力评估

✓ Purpose & Capability

The files coherently provide a Bayesian reasoning framework, examples, and sources for interpreting evidence, priors, likelihood ratios, and base-rate errors.

ℹ Instruction Scope

One runtime trigger uses the standalone word "update," which is broader than the Bayesian context and could activate the skill in unrelated conversations, but it does not grant sensitive authority or cause hidden actions.

✓ Install Mechanism

The artifact contains only markdown files and no install scripts, executable code, package dependencies, or setup steps.

✓ Credentials

No environment variables, network calls, local file reads beyond the skill content, credentials, or external services are requested.

✓ Persistence & Privilege

No persistence, background execution, privilege escalation, mutation authority, credential-store access, or broad local indexing is present.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install deciqai-bayesian-reasoning
安装完成后，直接呼叫该 Skill 的名称或使用 /deciqai-bayesian-reasoning 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

**Initial release of the Bayesian Reasoning skill.** - Guides users to apply Bayes' theorem to real-world decision-making, highlighting common errors (base rate neglect, prosecutor's fallacy, evidence dependence). - Details when and how to use Bayesian reasoning, with clear exclusions for deterministic/no-data scenarios. - Provides an adaptive, stepwise coaching process for novices, including WAIT prompts for interactive learning. - Supplies a structured update template and domain-specific "packs" for frequent Bayesian settings (medicine, security, hiring, A/B tests). - Lists common rationalization failures and verification steps to ensure Bayesian rigor. - Integrates with related skills: probabilistic thinking, critical thinking, logical fallacies, and first-principles reasoning.

元数据

Slug deciqai-bayesian-reasoning

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题