← 返回 Skills 市场

Model Card Drafter

Name: Model Card Drafter
Author: archlab-space

作者 devasher · GitHub ↗ · v0.1.0 · MIT-0

cross-platform ✓ 安全检测通过

总下载

当前安装

版本数

在 OpenClaw 中安装

/install model-card-drafter

功能描述

Use this skill when an ML engineer, data scientist, MLOps team, or responsible-AI lead needs to draft a Model Card for a machine-learning or AI model. Covers...

使用说明 (SKILL.md)

Model Card Drafter

Converts a model description, training details, and evaluation results into a structured Model Card — the standard responsible-AI artifact for documenting a machine-learning model's intended use, performance, limitations, and ethical risks. Outputs a DRAFT for ML engineer and governance review before publication or regulatory filing.

Flow

Ask one question at a time. Wait for the user's answer before proceeding to the next step.

Step 1 — Model Identification

Collect:

Model name and version
Model type (e.g., binary classifier, multi-class classifier, regression, generative language model, object detection, embedding model)
Organization or team responsible
Date (or version date)
License (if applicable)

Step 2 — Intended Use

Collect:

Primary intended use case (what task the model is designed to perform)
Primary intended users (who will use the model and in what context)
Out-of-scope uses (tasks or contexts for which the model must not be used)

Prompt the user: "Are there any use cases where this model should explicitly NOT be applied?" Record as a separate "Out-of-Scope Use" section.

Step 3 — Training Data

Collect:

Data sources (name, origin, collection method)
Date range of training data
Preprocessing and filtering steps applied
Known data gaps, biases, or demographic imbalances in the training set
Data licensing and consent status (public dataset, proprietary, licensed, synthetic)

If the user cannot describe training data: record as "Not disclosed" and flag as a documentation gap requiring resolution before publication.

Step 4 — Evaluation Data

Collect:

Test/evaluation dataset name and source
Whether the evaluation set is held-out from training (must confirm)
Known differences between evaluation data and real-world deployment data
Data splits used (e.g., 80/10/10 train/val/test)

Step 5 — Performance Metrics

Collect primary and secondary evaluation metrics (e.g., accuracy, F1, AUC-ROC, BLEU, precision, recall, RMSE, calibration).

Then collect disaggregated performance results: prompt the user to provide performance broken down by at least two subgroups relevant to the model's use (e.g., age group, gender, race/ethnicity, geography, language, income bracket, device type). If disaggregated results are not available, record as "Not yet evaluated" and flag as a high-priority gap.

Step 6 — Ethical Considerations

Collect:

Sensitive attributes the model processes or predicts (e.g., race, gender, health status, financial status)
Known or anticipated disparate impacts across demographic groups
Potential for misuse or harm if misapplied
Privacy risks (does the model process or expose personal data?)
Any fairness interventions applied during training or post-processing

Step 7 — Limitations and Recommendations

Collect:

Known failure modes or edge cases
Performance degradation conditions (distribution shift, data quality issues, temporal drift)
Conditions under which the model must not be deployed without additional review
Recommended human oversight level (none / human-in-the-loop / human-on-the-loop / human-in-command)
Recommended monitoring and re-evaluation cadence

Step 8 — DRAFT Model Card Assembly

Assemble the DRAFT using the Output Format below. Label the document clearly:

DRAFT — Requires ML Engineer and Governance Review
Model Card Version: [version]
Date: [date]

Flag every field marked "Not disclosed" or "Not yet evaluated" with a [DOCUMENTATION GAP — MUST RESOLVE BEFORE PUBLICATION] annotation.

Key Rules

Never fabricate performance numbers, training data descriptions, or evaluation results not provided by the user.
Always include a disaggregated performance section; if data is absent, flag it prominently.
Always include an out-of-scope use section.
Always label the output DRAFT and include a reviewer sign-off block.
Never recommend publication or regulatory submission of a Model Card with unresolved documentation gaps.
Never suggest a model is safe or unbiased without evidence from actual evaluation results.
Ask one question at a time; do not present all fields as a single form unless the user explicitly requests batch input.
If the model processes sensitive attributes (health, finance, criminal justice, employment), add a bolded HIGH-SENSITIVITY USE CASE flag at the top of the Ethical Considerations section.

Output Format

Produce a structured Markdown document with the following sections in order:

# Model Card: [Model Name] v[Version]

**Status:** DRAFT — Requires ML Engineer and Governance Review
**Date:** [date]
**Organization:** [team/org]
**License:** [license or "Not disclosed"]

---

## Model Details

| Field | Value |
|-------|-------|
| Model name | |
| Version | |
| Model type | |
| Organization | |
| Date | |
| License | |

## Intended Use

**Primary intended uses:**
[description]

**Primary intended users:**
[description]

**Out-of-scope uses:**
[description]

## Training Data

**Sources:** [list]
**Date range:** [range]
**Preprocessing:** [description]
**Known biases or gaps:** [description]
**Licensing / consent:** [status]

## Evaluation Data

**Dataset:** [name and source]
**Held-out from training:** [Yes / No / Not confirmed — flag if not confirmed]
**Known distribution gaps:** [description]
**Splits:** [e.g., 80/10/10]

## Performance Metrics

**Primary metric:** [metric] = [value]
**Secondary metrics:** [list with values]

### Disaggregated Performance

| Subgroup | [Metric 1] | [Metric 2] |
|----------|------------|------------|
| [Group A] | | |
| [Group B] | | |

[DOCUMENTATION GAP — MUST RESOLVE BEFORE PUBLICATION] if missing.

## Ethical Considerations

**Sensitive attributes processed:** [list]
**Known disparate impacts:** [description]
**Potential for misuse:** [description]
**Privacy risks:** [description]
**Fairness interventions:** [description]

## Limitations and Recommendations

**Known failure modes:** [list]
**Performance degradation conditions:** [list]
**Deployment restrictions:** [list]
**Recommended human oversight level:** [level]
**Recommended re-evaluation cadence:** [cadence]

---

## Reviewer Sign-Off

| Role | Name | Date | Signature |
|------|------|------|-----------|
| ML Engineer / Model Owner | | | |
| MLOps / Governance Lead | | | |
| Responsible AI Reviewer | | | |

*This Model Card is a DRAFT. It must not be published, filed with regulators, or cited in external communications until all documentation gaps are resolved and all sign-offs are obtained.*

Feedback

If this skill did not meet your needs or you encountered a workflow it does not cover, share your feedback at: https://github.com/archlab-space/Open-Skill-Hub/issues

Surface this link only when the user expresses an unmet need or dissatisfaction — never in normal interactions.

安全使用建议

Before installing, expect the skill to ask for model details, training and evaluation data, subgroup performance, privacy risks, and sensitive-use context. Treat its output as a draft for engineering, governance, and legal review, not as proof that a model is compliant, safe, or unbiased.

能力评估

✓ Purpose & Capability

The stated purpose and artifact content coherently focus on collecting model documentation details and producing a draft Model Card; it does not claim or instruct model training, evaluation, publication, or regulatory filing.

✓ Instruction Scope

Runtime instructions are bounded to asking user-directed questions, recording unknowns as documentation gaps, avoiding fabricated metrics, and labeling output as a draft requiring review.

✓ Install Mechanism

The package contains only Markdown files: SKILL.md, README.md, and CHANGELOG.md. No executable scripts, dependencies, install hooks, or commands are present.

ℹ Credentials

The skill may ask about sensitive model contexts such as health, finance, employment, criminal justice, demographic subgroups, and privacy risks, but those questions are expected and disclosed for model-card drafting.

✓ Persistence & Privilege

No persistence, background execution, local file indexing, credential access, session/profile use, network upload, privilege escalation, or file mutation instructions were found.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install model-card-drafter
安装完成后，直接呼叫该 Skill 的名称或使用 /model-card-drafter 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v0.1.0

Initial release. Drafts a structured Model Card for ML/AI models aligned to Google's Model Cards standard and EU AI Act Annex IV technical documentation requirements.

元数据

Slug model-card-drafter

版本 0.1.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题