← 返回 Skills 市场
cutthemustard

Data Ground Truth

作者 CutTheMustard · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ⚠ suspicious
313
总下载
0
收藏
0
当前安装
2
版本数
在 OpenClaw 中安装
/install data-ground-truth
功能描述
Before presenting numbers in reports or recommendations, verify facts and check values against industry baselines.
使用说明 (SKILL.md)

data-ground-truth

When presenting numbers, metrics, or statistics in reports, recommendations, or analysis — verify the facts and contextualize the figures against industry baselines. Combines verify (live fact-checking) with norm (statistical benchmarking).

When to Activate

Use this skill when:

  • Writing a report that cites specific metrics (revenue, churn, conversion rates)
  • A user shares their business numbers and asks "is this good?"
  • Comparing a metric to industry standards ("how does our 5% churn compare?")
  • Building a recommendation that depends on current market data
  • Presenting financial figures that may have changed since training
  • Analyzing a dataset and wanting to flag outliers against known baselines

Do NOT use for: opinions, qualitative assessments, or metrics with no established baseline.

Workflow

Step 1: Classify the data point

Determine whether each number is:

  • A factual claim (exchange rate, stock price, population) → route to verify
  • A business/performance metric (churn rate, NPS, response time) → route to norm
  • Both (e.g., "our conversion rate of 3.2% is above average") → check both

Step 2: Verify factual claims

For current facts (prices, rates, dates), use verify-claim.

MCP (preferred): verify_claim({ claim: "The USD to EUR exchange rate is 0.92" })

HTTP:

curl -X POST https://verify.agentutil.net/v1/verify \
  -H "Content-Type: application/json" \
  -d '{"claim": "The USD to EUR exchange rate is 0.92"}'

Handle verdicts per the verify-claim decision tree (confirmed → use, stale → update, disputed → present both sides, false → correct).

Step 3: Benchmark metrics against baselines

For business metrics, check where the value falls on the distribution.

MCP (preferred): norm_check({ category: "saas:churn_rate_monthly", value: 5.2, unit: "%" })

HTTP:

curl -X POST https://norm.agentutil.net/v1/check \
  -H "Content-Type: application/json" \
  -d '{"category": "saas:churn_rate_monthly", "value": 5.2, "unit": "%"}'

For multiple metrics at once:

curl -X POST https://norm.agentutil.net/v1/batch \
  -H "Content-Type: application/json" \
  -d '{"items": [{"category": "saas:churn_rate_monthly", "value": 5.2}, {"category": "saas:nps_score", "value": 45}]}'

Optional: add company_size (startup/smb/mid_market/enterprise) and region for more specific baselines.

Step 4: Present with context

When reporting findings, combine verification and benchmarking:

Data type How to present
Verified fact "The current [metric] is [current_truth] (verified live, [freshness])."
Benchmarked metric "[Value] is at the [percentile]th percentile — [assessment] for [category]."
Both "At [current_truth] (verified), this is [percentile]th percentile vs. industry ([baseline source])."
Anomalous metric Flag clearly: "[Value] is [assessment] — [percentile]th percentile. The typical range is [p25]-[p75]."

Assessment values from norm: very_low, low, normal, high, very_high, anomalous.

Available baseline categories

121 baselines across 14 domains. Browse with:

curl https://norm.agentutil.net/v1/categories

Common categories: saas:churn_rate_monthly, saas:nps_score, saas:ltv_cac_ratio, ecommerce:cart_abandonment_rate, infrastructure:api_latency_p99, infrastructure:uptime_percentage.

Data Handling

This skill sends claims (natural language text) and metric values (category identifiers + numbers) to two external APIs. No documents, user data, or file contents are transmitted.

Pricing

  • Verify: 25 free/day, then $0.004/query
  • Norm: free category listing, $0.002/check or $0.001/batch item
  • Full ground-truth check (verify + norm): ~$0.006 per data point

All via x402 protocol (USDC on Base). No authentication required for free tiers.

Privacy

No personal data collected. Claims cached up to 1 hour (verify), metric checks are stateless (norm). Rate limiting uses IP hashing only.

安全使用建议
This skill is coherent with its purpose but sends user-supplied claims and metric values to third-party APIs. Before installing or enabling it: 1) Confirm the vendor (agentutil.net) and review their privacy/data-retention policy. 2) Require explicit user consent before sending any numbers or claims derived from documents or datasets. 3) Sanitize/strip PII and only send minimal, necessary fields (e.g., metric name + value + category). 4) Test with non-sensitive dummy data and monitor outgoing network requests. 5) Be aware of pay-as-you-go costs and caching behavior (claims cached up to 1 hour). If you cannot guarantee that transmitted claims will never include sensitive information, treat this skill as higher risk or avoid using it.
功能分析
Type: OpenClaw Skill Name: data-ground-truth Version: 1.0.1 The data-ground-truth skill is a utility designed for fact-checking and benchmarking metrics against industry standards. It functions by sending specific claims or numeric values to external APIs hosted at agentutil.net (specifically verify.agentutil.net and norm.agentutil.net). The SKILL.md instructions are transparent about data handling, pricing, and the scope of information transmitted, with no indicators of malicious intent, unauthorized data exfiltration, or suspicious command execution.
能力评估
Purpose & Capability
Name/description match the runtime instructions: the skill verifies factual claims and benchmarks metrics by calling external verify/norm APIs. It does not request unrelated binaries, credentials, or config paths.
Instruction Scope
Instructions explicitly send natural-language claims and numeric metrics to external endpoints (verify.agentutil.net, norm.agentutil.net). The SKILL.md asserts 'No documents, user data, or file contents are transmitted', but there is no guidance to sanitize or obtain consent before sending claims extracted from user documents or datasets. That gap could lead the agent to exfiltrate sensitive or personally identifiable information inadvertently.
Install Mechanism
Instruction-only skill with no install spec or code files; nothing is written to disk and no external packages are pulled by the skill itself.
Credentials
The skill requests no environment variables or credentials (free tiers reportedly require none), which is proportionate. However it documents paid usage via 'x402' (USDC on Base) and uses remote APIs — network access allows data to leave the agent even without credentials, so lack of required env vars does not eliminate data‑leak risk.
Persistence & Privilege
always is false and the skill has no install hooks or claims to modify other skills or agent config. It is user-invocable and can be called autonomously by the agent (platform default), which is expected.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install data-ground-truth
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /data-ground-truth 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
Fix display name
v1.0.0
Composite fact-checking plus statistical benchmarking workflow
元数据
Slug data-ground-truth
版本 1.0.1
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 2
常见问题

Data Ground Truth 是什么?

Before presenting numbers in reports or recommendations, verify facts and check values against industry baselines. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 313 次。

如何安装 Data Ground Truth?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install data-ground-truth」即可一键安装,无需额外配置。

Data Ground Truth 是免费的吗?

是的,Data Ground Truth 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Data Ground Truth 支持哪些平台?

Data Ground Truth 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Data Ground Truth?

由 CutTheMustard(@cutthemustard)开发并维护,当前版本 v1.0.1。

💬 留言讨论