← 返回 Skills 市场
archlab-space

Model Card Drafter

作者 devasher · GitHub ↗ · v0.1.0 · MIT-0
cross-platform ✓ 安全检测通过
40
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install model-card-drafter
功能描述
Use this skill when an ML engineer, data scientist, MLOps team, or responsible-AI lead needs to draft a Model Card for a machine-learning or AI model. Covers...
使用说明 (SKILL.md)

Model Card Drafter

Converts a model description, training details, and evaluation results into a structured Model Card — the standard responsible-AI artifact for documenting a machine-learning model's intended use, performance, limitations, and ethical risks. Outputs a DRAFT for ML engineer and governance review before publication or regulatory filing.

Flow

Ask one question at a time. Wait for the user's answer before proceeding to the next step.

Step 1 — Model Identification

Collect:

  • Model name and version
  • Model type (e.g., binary classifier, multi-class classifier, regression, generative language model, object detection, embedding model)
  • Organization or team responsible
  • Date (or version date)
  • License (if applicable)

Step 2 — Intended Use

Collect:

  • Primary intended use case (what task the model is designed to perform)
  • Primary intended users (who will use the model and in what context)
  • Out-of-scope uses (tasks or contexts for which the model must not be used)

Prompt the user: "Are there any use cases where this model should explicitly NOT be applied?" Record as a separate "Out-of-Scope Use" section.

Step 3 — Training Data

Collect:

  • Data sources (name, origin, collection method)
  • Date range of training data
  • Preprocessing and filtering steps applied
  • Known data gaps, biases, or demographic imbalances in the training set
  • Data licensing and consent status (public dataset, proprietary, licensed, synthetic)

If the user cannot describe training data: record as "Not disclosed" and flag as a documentation gap requiring resolution before publication.

Step 4 — Evaluation Data

Collect:

  • Test/evaluation dataset name and source
  • Whether the evaluation set is held-out from training (must confirm)
  • Known differences between evaluation data and real-world deployment data
  • Data splits used (e.g., 80/10/10 train/val/test)

Step 5 — Performance Metrics

Collect primary and secondary evaluation metrics (e.g., accuracy, F1, AUC-ROC, BLEU, precision, recall, RMSE, calibration).

Then collect disaggregated performance results: prompt the user to provide performance broken down by at least two subgroups relevant to the model's use (e.g., age group, gender, race/ethnicity, geography, language, income bracket, device type). If disaggregated results are not available, record as "Not yet evaluated" and flag as a high-priority gap.

Step 6 — Ethical Considerations

Collect:

  • Sensitive attributes the model processes or predicts (e.g., race, gender, health status, financial status)
  • Known or anticipated disparate impacts across demographic groups
  • Potential for misuse or harm if misapplied
  • Privacy risks (does the model process or expose personal data?)
  • Any fairness interventions applied during training or post-processing

Step 7 — Limitations and Recommendations

Collect:

  • Known failure modes or edge cases
  • Performance degradation conditions (distribution shift, data quality issues, temporal drift)
  • Conditions under which the model must not be deployed without additional review
  • Recommended human oversight level (none / human-in-the-loop / human-on-the-loop / human-in-command)
  • Recommended monitoring and re-evaluation cadence

Step 8 — DRAFT Model Card Assembly

Assemble the DRAFT using the Output Format below. Label the document clearly:

DRAFT — Requires ML Engineer and Governance Review
Model Card Version: [version]
Date: [date]

Flag every field marked "Not disclosed" or "Not yet evaluated" with a [DOCUMENTATION GAP — MUST RESOLVE BEFORE PUBLICATION] annotation.

Key Rules

  • Never fabricate performance numbers, training data descriptions, or evaluation results not provided by the user.
  • Always include a disaggregated performance section; if data is absent, flag it prominently.
  • Always include an out-of-scope use section.
  • Always label the output DRAFT and include a reviewer sign-off block.
  • Never recommend publication or regulatory submission of a Model Card with unresolved documentation gaps.
  • Never suggest a model is safe or unbiased without evidence from actual evaluation results.
  • Ask one question at a time; do not present all fields as a single form unless the user explicitly requests batch input.
  • If the model processes sensitive attributes (health, finance, criminal justice, employment), add a bolded HIGH-SENSITIVITY USE CASE flag at the top of the Ethical Considerations section.

Output Format

Produce a structured Markdown document with the following sections in order:

# Model Card: [Model Name] v[Version]

**Status:** DRAFT — Requires ML Engineer and Governance Review
**Date:** [date]
**Organization:** [team/org]
**License:** [license or "Not disclosed"]

---

## Model Details

| Field | Value |
|-------|-------|
| Model name | |
| Version | |
| Model type | |
| Organization | |
| Date | |
| License | |

## Intended Use

**Primary intended uses:**
[description]

**Primary intended users:**
[description]

**Out-of-scope uses:**
[description]

## Training Data

**Sources:** [list]
**Date range:** [range]
**Preprocessing:** [description]
**Known biases or gaps:** [description]
**Licensing / consent:** [status]

## Evaluation Data

**Dataset:** [name and source]
**Held-out from training:** [Yes / No / Not confirmed — flag if not confirmed]
**Known distribution gaps:** [description]
**Splits:** [e.g., 80/10/10]

## Performance Metrics

**Primary metric:** [metric] = [value]
**Secondary metrics:** [list with values]

### Disaggregated Performance

| Subgroup | [Metric 1] | [Metric 2] |
|----------|------------|------------|
| [Group A] | | |
| [Group B] | | |

[DOCUMENTATION GAP — MUST RESOLVE BEFORE PUBLICATION] if missing.

## Ethical Considerations

**Sensitive attributes processed:** [list]
**Known disparate impacts:** [description]
**Potential for misuse:** [description]
**Privacy risks:** [description]
**Fairness interventions:** [description]

## Limitations and Recommendations

**Known failure modes:** [list]
**Performance degradation conditions:** [list]
**Deployment restrictions:** [list]
**Recommended human oversight level:** [level]
**Recommended re-evaluation cadence:** [cadence]

---

## Reviewer Sign-Off

| Role | Name | Date | Signature |
|------|------|------|-----------|
| ML Engineer / Model Owner | | | |
| MLOps / Governance Lead | | | |
| Responsible AI Reviewer | | | |

*This Model Card is a DRAFT. It must not be published, filed with regulators, or cited in external communications until all documentation gaps are resolved and all sign-offs are obtained.*

Feedback

If this skill did not meet your needs or you encountered a workflow it does not cover, share your feedback at: https://github.com/archlab-space/Open-Skill-Hub/issues

Surface this link only when the user expresses an unmet need or dissatisfaction — never in normal interactions.

安全使用建议
Before installing, expect the skill to ask for model details, training and evaluation data, subgroup performance, privacy risks, and sensitive-use context. Treat its output as a draft for engineering, governance, and legal review, not as proof that a model is compliant, safe, or unbiased.
能力评估
Purpose & Capability
The stated purpose and artifact content coherently focus on collecting model documentation details and producing a draft Model Card; it does not claim or instruct model training, evaluation, publication, or regulatory filing.
Instruction Scope
Runtime instructions are bounded to asking user-directed questions, recording unknowns as documentation gaps, avoiding fabricated metrics, and labeling output as a draft requiring review.
Install Mechanism
The package contains only Markdown files: SKILL.md, README.md, and CHANGELOG.md. No executable scripts, dependencies, install hooks, or commands are present.
Credentials
The skill may ask about sensitive model contexts such as health, finance, employment, criminal justice, demographic subgroups, and privacy risks, but those questions are expected and disclosed for model-card drafting.
Persistence & Privilege
No persistence, background execution, local file indexing, credential access, session/profile use, network upload, privilege escalation, or file mutation instructions were found.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install model-card-drafter
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /model-card-drafter 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.0
Initial release. Drafts a structured Model Card for ML/AI models aligned to Google's Model Cards standard and EU AI Act Annex IV technical documentation requirements.
元数据
Slug model-card-drafter
版本 0.1.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Model Card Drafter 是什么?

Use this skill when an ML engineer, data scientist, MLOps team, or responsible-AI lead needs to draft a Model Card for a machine-learning or AI model. Covers... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 40 次。

如何安装 Model Card Drafter?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install model-card-drafter」即可一键安装,无需额外配置。

Model Card Drafter 是免费的吗?

是的,Model Card Drafter 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Model Card Drafter 支持哪些平台?

Model Card Drafter 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Model Card Drafter?

由 devasher(@archlab-space)开发并维护,当前版本 v0.1.0。

💬 留言讨论