← Back to Skills Marketplace

nanogpt-training

Name: nanogpt-training
Author: lnj22

by lnj22 · GitHub ↗ · v0.1.0 · MIT-0

cross-platform ✓ Security Clean

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install mhc-layer-impl-nanogpt-training

Description

Train GPT-2 scale models (~124M parameters) efficiently on a single GPU. Covers GPT-124M architecture, tokenized dataset loading (e.g., HuggingFace Hub shard...

Usage Guidance

This skill is a coherent, textual training guide that appears safe to inspect and use. Before running: (1) review dataset licenses (downloading large token shards can have legal/ethical implications); (2) run initial experiments on a tiny subset to validate code and resource usage; (3) be aware of resource/cost implications when using GPU clouds (Modal examples request A100); (4) only provide HF/GitHub credentials if you intentionally access private repos; and (5) if you plan to execute code from untrusted sources, do so in isolated environments (containers) and inspect code snippets carefully for any modifications before running.

Capability Analysis

Type: OpenClaw Skill Name: mhc-layer-impl-nanogpt-training Version: 0.1.0 The skill bundle provides a comprehensive and legitimate implementation for training GPT-2 scale models using PyTorch and Modal. The code follows standard practices from well-known community projects (e.g., Karpathy's nanoGPT), including implementations for the Muon optimizer, Rotary Positional Embeddings, and memory-mapped data loading from HuggingFace Hub. No evidence of data exfiltration, malicious execution, or prompt injection was found; all network activity is limited to downloading public datasets from HuggingFace.

Capability Assessment

✓ Purpose & Capability

The name/description (nanogpt training) match the contents: model architecture, tokenized dataset loading, optimizers, and a training loop. Required tooling referenced in SKILL.md (torch, huggingface_hub, einops, numpy) is exactly what you'd expect for this task.

✓ Instruction Scope

Runtime instructions stay on-topic: they show how to download public HF token shards, build datasets via memmap, construct the model, and run mixed-precision training. There are no instructions to read unrelated system files, harvest environment variables, or call endpoints outside the expected external services (HuggingFace/GitHub).

✓ Install Mechanism

This is instruction-only (no install spec). The SKILL.md suggests pip installing common ML packages; that's appropriate and proportional. No archives or remote executables are fetched beyond public Python packages and dataset files from HuggingFace.

✓ Credentials

No environment variables, credentials, or config paths are required. The dataset downloads reference public repos (no auth). If you later point it at a private HF repo, HF credentials would be needed — but the skill itself does not request them.

✓ Persistence & Privilege

always is false and the skill does not request any special persistent privileges or modifications to other skills. Autonomous invocation is allowed (platform default) but not combined with problematic privileges.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install mhc-layer-impl-nanogpt-training
After installation, invoke the skill by name or use /mhc-layer-impl-nanogpt-training
Provide required inputs per the skill's parameter spec and get structured output

Version History

v0.1.0

Bulk publish from all-task-skills-dedup

Metadata

Slug mhc-layer-impl-nanogpt-training

Version 0.1.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is nanogpt-training?

Train GPT-2 scale models (~124M parameters) efficiently on a single GPU. Covers GPT-124M architecture, tokenized dataset loading (e.g., HuggingFace Hub shard... It is an AI Agent Skill for Claude Code / OpenClaw, with 84 downloads so far.

How do I install nanogpt-training?

Run "/install mhc-layer-impl-nanogpt-training" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is nanogpt-training free?

Yes, nanogpt-training is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does nanogpt-training support?

nanogpt-training is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created nanogpt-training?

It is built and maintained by lnj22 (@lnj22); the current version is v0.1.0.

More Skills