← Back to Skills Marketplace
84
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install mhc-layer-impl-nanogpt-training
Description
Train GPT-2 scale models (~124M parameters) efficiently on a single GPU. Covers GPT-124M architecture, tokenized dataset loading (e.g., HuggingFace Hub shard...
Usage Guidance
This skill is a coherent, textual training guide that appears safe to inspect and use. Before running: (1) review dataset licenses (downloading large token shards can have legal/ethical implications); (2) run initial experiments on a tiny subset to validate code and resource usage; (3) be aware of resource/cost implications when using GPU clouds (Modal examples request A100); (4) only provide HF/GitHub credentials if you intentionally access private repos; and (5) if you plan to execute code from untrusted sources, do so in isolated environments (containers) and inspect code snippets carefully for any modifications before running.
Capability Analysis
Type: OpenClaw Skill
Name: mhc-layer-impl-nanogpt-training
Version: 0.1.0
The skill bundle provides a comprehensive and legitimate implementation for training GPT-2 scale models using PyTorch and Modal. The code follows standard practices from well-known community projects (e.g., Karpathy's nanoGPT), including implementations for the Muon optimizer, Rotary Positional Embeddings, and memory-mapped data loading from HuggingFace Hub. No evidence of data exfiltration, malicious execution, or prompt injection was found; all network activity is limited to downloading public datasets from HuggingFace.
Capability Assessment
Purpose & Capability
The name/description (nanogpt training) match the contents: model architecture, tokenized dataset loading, optimizers, and a training loop. Required tooling referenced in SKILL.md (torch, huggingface_hub, einops, numpy) is exactly what you'd expect for this task.
Instruction Scope
Runtime instructions stay on-topic: they show how to download public HF token shards, build datasets via memmap, construct the model, and run mixed-precision training. There are no instructions to read unrelated system files, harvest environment variables, or call endpoints outside the expected external services (HuggingFace/GitHub).
Install Mechanism
This is instruction-only (no install spec). The SKILL.md suggests pip installing common ML packages; that's appropriate and proportional. No archives or remote executables are fetched beyond public Python packages and dataset files from HuggingFace.
Credentials
No environment variables, credentials, or config paths are required. The dataset downloads reference public repos (no auth). If you later point it at a private HF repo, HF credentials would be needed — but the skill itself does not request them.
Persistence & Privilege
always is false and the skill does not request any special persistent privileges or modifications to other skills. Autonomous invocation is allowed (platform default) but not combined with problematic privileges.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install mhc-layer-impl-nanogpt-training - After installation, invoke the skill by name or use
/mhc-layer-impl-nanogpt-training - Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.0
Bulk publish from all-task-skills-dedup
Metadata
Frequently Asked Questions
What is nanogpt-training?
Train GPT-2 scale models (~124M parameters) efficiently on a single GPU. Covers GPT-124M architecture, tokenized dataset loading (e.g., HuggingFace Hub shard... It is an AI Agent Skill for Claude Code / OpenClaw, with 84 downloads so far.
How do I install nanogpt-training?
Run "/install mhc-layer-impl-nanogpt-training" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is nanogpt-training free?
Yes, nanogpt-training is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does nanogpt-training support?
nanogpt-training is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created nanogpt-training?
It is built and maintained by lnj22 (@lnj22); the current version is v0.1.0.
More Skills