/install dataset-splitter
Dataset Splitter
Split image datasets into train/val/test sets. Supports random split, stratified split, and custom ratios. Use when user needs to split dataset for machine learning training.
Features
- Random Split: Randomly shuffle and split
- Stratified Split: Maintain class distribution
- Custom Ratios: Configurable train/val/test ratios
- Annotation Support: Split images and corresponding annotations together
- YOLO Format: Generate YOLO format dataset structure
- Reproducible: Set random seed for reproducibility
Usage
# Simple split (80/10/10)
python scripts/splitter.py split /path/to/images/ --ratios 80 10 10
# With annotations
python scripts/splitter.py split /path/to/images/ --annotations /path/to/labels/
# YOLO format output
python scripts/splitter.py split /path/to/images/ --output /path/to/dataset/ --yolo
# Stratified by class
python scripts/splitter.py split /path/to/images/ --annotations labels/ --stratify
Examples
$ python scripts/splitter.py split ./images --ratios 80 10 10
Splitting dataset...
Total images: 1000
Train: 800 (80%)
Val: 100 (10%)
Test: 100 (10%)
✓ Created train/ (800 images)
✓ Created val/ (100 images)
✓ Created test/ (100 images)
Installation
pip install pillow
Options
--ratios: Split ratios (train val test), default: 80 10 10--seed: Random seed for reproducibility--annotations: Path to annotations (will be split together)--output: Output directory--yolo: Output in YOLO dataset format--stratify: Maintain class distribution--copy: Copy files instead of moving
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install dataset-splitter - After installation, invoke the skill by name or use
/dataset-splitter - Provide required inputs per the skill's parameter spec and get structured output
What is Dataset Splitter?
Split image datasets into train, validation, and test sets with options for random or stratified splits, custom ratios, and annotation support. It is an AI Agent Skill for Claude Code / OpenClaw, with 276 downloads so far.
How do I install Dataset Splitter?
Run "/install dataset-splitter" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Dataset Splitter free?
Yes, Dataset Splitter is completely free (open-source). You can download, install and use it at no cost.
Which platforms does Dataset Splitter support?
Dataset Splitter is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Dataset Splitter?
It is built and maintained by Mingo_318 (@mingo-318); the current version is v1.0.0.