Note QLoRA-Llama-3.2-1B-chatml is fundamentally the same, but trained on a much larger dataset (21k steps).
Note 52 steps, 2 minutes to train