← Back to Skills Marketplace
archlab-space

Model Card Drafter

by devasher · GitHub ↗ · v0.1.0 · MIT-0
cross-platform ✓ Security Clean
40
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install model-card-drafter
Description
Use this skill when an ML engineer, data scientist, MLOps team, or responsible-AI lead needs to draft a Model Card for a machine-learning or AI model. Covers...
README (SKILL.md)

Model Card Drafter

Converts a model description, training details, and evaluation results into a structured Model Card — the standard responsible-AI artifact for documenting a machine-learning model's intended use, performance, limitations, and ethical risks. Outputs a DRAFT for ML engineer and governance review before publication or regulatory filing.

Flow

Ask one question at a time. Wait for the user's answer before proceeding to the next step.

Step 1 — Model Identification

Collect:

  • Model name and version
  • Model type (e.g., binary classifier, multi-class classifier, regression, generative language model, object detection, embedding model)
  • Organization or team responsible
  • Date (or version date)
  • License (if applicable)

Step 2 — Intended Use

Collect:

  • Primary intended use case (what task the model is designed to perform)
  • Primary intended users (who will use the model and in what context)
  • Out-of-scope uses (tasks or contexts for which the model must not be used)

Prompt the user: "Are there any use cases where this model should explicitly NOT be applied?" Record as a separate "Out-of-Scope Use" section.

Step 3 — Training Data

Collect:

  • Data sources (name, origin, collection method)
  • Date range of training data
  • Preprocessing and filtering steps applied
  • Known data gaps, biases, or demographic imbalances in the training set
  • Data licensing and consent status (public dataset, proprietary, licensed, synthetic)

If the user cannot describe training data: record as "Not disclosed" and flag as a documentation gap requiring resolution before publication.

Step 4 — Evaluation Data

Collect:

  • Test/evaluation dataset name and source
  • Whether the evaluation set is held-out from training (must confirm)
  • Known differences between evaluation data and real-world deployment data
  • Data splits used (e.g., 80/10/10 train/val/test)

Step 5 — Performance Metrics

Collect primary and secondary evaluation metrics (e.g., accuracy, F1, AUC-ROC, BLEU, precision, recall, RMSE, calibration).

Then collect disaggregated performance results: prompt the user to provide performance broken down by at least two subgroups relevant to the model's use (e.g., age group, gender, race/ethnicity, geography, language, income bracket, device type). If disaggregated results are not available, record as "Not yet evaluated" and flag as a high-priority gap.

Step 6 — Ethical Considerations

Collect:

  • Sensitive attributes the model processes or predicts (e.g., race, gender, health status, financial status)
  • Known or anticipated disparate impacts across demographic groups
  • Potential for misuse or harm if misapplied
  • Privacy risks (does the model process or expose personal data?)
  • Any fairness interventions applied during training or post-processing

Step 7 — Limitations and Recommendations

Collect:

  • Known failure modes or edge cases
  • Performance degradation conditions (distribution shift, data quality issues, temporal drift)
  • Conditions under which the model must not be deployed without additional review
  • Recommended human oversight level (none / human-in-the-loop / human-on-the-loop / human-in-command)
  • Recommended monitoring and re-evaluation cadence

Step 8 — DRAFT Model Card Assembly

Assemble the DRAFT using the Output Format below. Label the document clearly:

DRAFT — Requires ML Engineer and Governance Review
Model Card Version: [version]
Date: [date]

Flag every field marked "Not disclosed" or "Not yet evaluated" with a [DOCUMENTATION GAP — MUST RESOLVE BEFORE PUBLICATION] annotation.

Key Rules

  • Never fabricate performance numbers, training data descriptions, or evaluation results not provided by the user.
  • Always include a disaggregated performance section; if data is absent, flag it prominently.
  • Always include an out-of-scope use section.
  • Always label the output DRAFT and include a reviewer sign-off block.
  • Never recommend publication or regulatory submission of a Model Card with unresolved documentation gaps.
  • Never suggest a model is safe or unbiased without evidence from actual evaluation results.
  • Ask one question at a time; do not present all fields as a single form unless the user explicitly requests batch input.
  • If the model processes sensitive attributes (health, finance, criminal justice, employment), add a bolded HIGH-SENSITIVITY USE CASE flag at the top of the Ethical Considerations section.

Output Format

Produce a structured Markdown document with the following sections in order:

# Model Card: [Model Name] v[Version]

**Status:** DRAFT — Requires ML Engineer and Governance Review
**Date:** [date]
**Organization:** [team/org]
**License:** [license or "Not disclosed"]

---

## Model Details

| Field | Value |
|-------|-------|
| Model name | |
| Version | |
| Model type | |
| Organization | |
| Date | |
| License | |

## Intended Use

**Primary intended uses:**
[description]

**Primary intended users:**
[description]

**Out-of-scope uses:**
[description]

## Training Data

**Sources:** [list]
**Date range:** [range]
**Preprocessing:** [description]
**Known biases or gaps:** [description]
**Licensing / consent:** [status]

## Evaluation Data

**Dataset:** [name and source]
**Held-out from training:** [Yes / No / Not confirmed — flag if not confirmed]
**Known distribution gaps:** [description]
**Splits:** [e.g., 80/10/10]

## Performance Metrics

**Primary metric:** [metric] = [value]
**Secondary metrics:** [list with values]

### Disaggregated Performance

| Subgroup | [Metric 1] | [Metric 2] |
|----------|------------|------------|
| [Group A] | | |
| [Group B] | | |

[DOCUMENTATION GAP — MUST RESOLVE BEFORE PUBLICATION] if missing.

## Ethical Considerations

**Sensitive attributes processed:** [list]
**Known disparate impacts:** [description]
**Potential for misuse:** [description]
**Privacy risks:** [description]
**Fairness interventions:** [description]

## Limitations and Recommendations

**Known failure modes:** [list]
**Performance degradation conditions:** [list]
**Deployment restrictions:** [list]
**Recommended human oversight level:** [level]
**Recommended re-evaluation cadence:** [cadence]

---

## Reviewer Sign-Off

| Role | Name | Date | Signature |
|------|------|------|-----------|
| ML Engineer / Model Owner | | | |
| MLOps / Governance Lead | | | |
| Responsible AI Reviewer | | | |

*This Model Card is a DRAFT. It must not be published, filed with regulators, or cited in external communications until all documentation gaps are resolved and all sign-offs are obtained.*

Feedback

If this skill did not meet your needs or you encountered a workflow it does not cover, share your feedback at: https://github.com/archlab-space/Open-Skill-Hub/issues

Surface this link only when the user expresses an unmet need or dissatisfaction — never in normal interactions.

Usage Guidance
Before installing, expect the skill to ask for model details, training and evaluation data, subgroup performance, privacy risks, and sensitive-use context. Treat its output as a draft for engineering, governance, and legal review, not as proof that a model is compliant, safe, or unbiased.
Capability Assessment
Purpose & Capability
The stated purpose and artifact content coherently focus on collecting model documentation details and producing a draft Model Card; it does not claim or instruct model training, evaluation, publication, or regulatory filing.
Instruction Scope
Runtime instructions are bounded to asking user-directed questions, recording unknowns as documentation gaps, avoiding fabricated metrics, and labeling output as a draft requiring review.
Install Mechanism
The package contains only Markdown files: SKILL.md, README.md, and CHANGELOG.md. No executable scripts, dependencies, install hooks, or commands are present.
Credentials
The skill may ask about sensitive model contexts such as health, finance, employment, criminal justice, demographic subgroups, and privacy risks, but those questions are expected and disclosed for model-card drafting.
Persistence & Privilege
No persistence, background execution, local file indexing, credential access, session/profile use, network upload, privilege escalation, or file mutation instructions were found.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install model-card-drafter
  3. After installation, invoke the skill by name or use /model-card-drafter
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.0
Initial release. Drafts a structured Model Card for ML/AI models aligned to Google's Model Cards standard and EU AI Act Annex IV technical documentation requirements.
Metadata
Slug model-card-drafter
Version 0.1.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Model Card Drafter?

Use this skill when an ML engineer, data scientist, MLOps team, or responsible-AI lead needs to draft a Model Card for a machine-learning or AI model. Covers... It is an AI Agent Skill for Claude Code / OpenClaw, with 40 downloads so far.

How do I install Model Card Drafter?

Run "/install model-card-drafter" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Model Card Drafter free?

Yes, Model Card Drafter is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Model Card Drafter support?

Model Card Drafter is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Model Card Drafter?

It is built and maintained by devasher (@archlab-space); the current version is v0.1.0.

💬 Comments