← 返回 Skills 市场
hqzzdsda

Belief Assessor

作者 hqzzdsda · GitHub ↗ · v1.0.1 · MIT-0
macoslinux ✓ 安全检测通过
38
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install belief-assessor
功能描述
LLM-driven epistemic reasoning engine. Evaluates claims against evidence, outputs calibrated confidence and structured belief state (VERIFIED/CONTESTED/UNCER...
使用说明 (SKILL.md)

belief-state-runtime.skill

Evaluates the trustworthiness of a claim based on evidence.

When to use

  • User asks "is this true?" or "can I trust this?"
  • Agent collects information from multiple sources and needs to assess reliability
  • Detecting contradictions between evidence pieces
  • Quantifying confidence in a conclusion

⚡ AUTO WORKFLOW (Default - Always Run This First)

IMPORTANT: When this skill is triggered, you MUST follow this workflow automatically:

from assess import get_assessment_prompt, assess_claim_with_response

# Step 1: SEARCH FOR EVIDENCE (MANDATORY)
# Use your search tool to find real, verifiable evidence about the claim.

# Step 2: Get the assessment prompt
prompt = get_assessment_prompt(claim="The claim", evidence="Search results + user's evidence")

# Step 3: YOU (the AI) answer the prompt with 6 boolean judgments
# Based on your search results and analysis:
#   - direct_support: Does the evidence support the claim?
#   - new_info: Does it provide new information?
#   - logical_consistent: Is it logically consistent?
#   - direct_refute: Does it explicitly refute the claim?
#   - limitation: Are there limitations or exceptions?
#   - error_outdated: Is the claim outdated or wrong?
#
# Your answer format:
#   {"direct_support": true/false, "new_info": true/false, ...}

# Step 4: Get final result
result = assess_claim_with_response(
    claim="The claim",
    evidence="Search results + user's evidence",
    llm_response='{"direct_support": true, ...}'  # YOUR judgment
)

# Step 5: Present the result to the user

Workflow Summary

Step Action Tool/Function
1 Search for evidence online-search / multi-search-engine
2 Get assessment prompt get_assessment_prompt(claim, evidence)
3 Make 6 judgments YOU (the AI)
4 Get result assess_claim_with_response(claim, evidence, llm_response)
5 Present to user Your response

How it works

  1. Search for evidence (MANDATORY): Use search tools to find real, verifiable evidence.
  2. Rule layer (Python): assess.py computes source reliability, evidence density, temporal freshness.
  3. LLM layer (YOU): The AI agent answers 6 boolean questions about the evidence.
  4. Aggregation (Python): Combines rule signals and your judgments into calibrated confidence.

Output

{
  "state": "VERIFIED",
  "confidence": 0.83,
  "confidence_range": [0.68, 0.98],
  "features": {"direct_support": true, ...},
  "summary": "Evidence strongly supports the claim"
}

States:

  • VERIFIED (confidence >= 0.65): Agent can cite this information
  • CONTESTED (0.25 \x3C confidence \x3C 0.65): Agent should note disagreement
  • UNCERTAIN (confidence \x3C= 0.25): Agent should seek more information

Files

  • assess.py — self-contained skill with your custom domain/keyword/threshold/weight rules
  • config.json — your configuration in JSON format

External Endpoints

None. This skill is a pure computation engine. All evidence search is delegated to the host Agent.

Security & Privacy

  • No API keys required
  • No external network calls
  • No user data collection
  • All computation runs locally

Compatible with OpenClaw · Claude Code · Codex · Cursor · GitHub Copilot.

Customized via belief-state-runtime configurator

安全使用建议
Install only if you want an opinionated fact-checking workflow that may search the web and may pass claim/evidence text to your agent's LLM tooling. Avoid using it for confidential, proprietary, or sensitive personal allegations unless your host agent is configured to keep searches and model calls within your acceptable data-handling boundaries.
能力评估
Purpose & Capability
The Python code and instructions match the stated purpose of assessing claims against evidence and producing confidence states; the source-weighting model is opinionated but not hidden or destructive.
Instruction Scope
The trigger language is broad and the workflow says to search automatically when triggered, so ambiguous user requests could lead to external evidence searches unless the host agent asks for clarification.
Install Mechanism
The artifact contains a SKILL.md, a Python module, and JSON config only; it declares a Python requirement and has no installer, package download, shell execution, or dependency setup.
Credentials
The code itself has no network, credential, filesystem, or subprocess behavior beyond local computation, but the skill delegates evidence search to the host agent and can use an injected LLM function for claim and evidence text.
Persistence & Privilege
No persistence, background worker, privilege escalation, credential access, or data mutation was found; the only retained state is an in-process extractor cache.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install belief-assessor
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /belief-assessor 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
## v1.0.1 — Rename to belief-assessor ### Why the rename? The previous listing was published as `belief-state-runtime`, but the v1.0 codebase only implements the **assessment** layer: claim + evidence → confidence + epistemic state. The full event-sourced runtime (event store, statistics engine, projection layer, policy system) is planned for v2. The name `belief-assessor` accurately describes what this skill does today. ### What's the same? - All APIs unchanged: `assess_claim()`, `assess_incremental()`, `assess_claim_with_response()` - Same two-layer engine: rule-based signals (4) + LLM boolean features (6) - Same output: VERIFIED / CONTESTED / UNCERTAIN with calibrated confidence ### What's new? - Corrected skill name and slug - Updated SKILL.md to clarify v1 scope vs planned v2 runtime features
元数据
Slug belief-assessor
版本 1.0.1
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Belief Assessor 是什么?

LLM-driven epistemic reasoning engine. Evaluates claims against evidence, outputs calibrated confidence and structured belief state (VERIFIED/CONTESTED/UNCER... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 38 次。

如何安装 Belief Assessor?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install belief-assessor」即可一键安装,无需额外配置。

Belief Assessor 是免费的吗?

是的,Belief Assessor 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Belief Assessor 支持哪些平台?

Belief Assessor 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(macos, linux)。

谁开发了 Belief Assessor?

由 hqzzdsda(@hqzzdsda)开发并维护,当前版本 v1.0.1。

💬 留言讨论