adamkarvonen commited on
Commit
0ea158b
1 Parent(s): c1b888b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -13,4 +13,6 @@ Dataset descriptions:
13
 
14
  All models are trained with their inputs beginning with ";", which is also the delimiter token between games. Performance will go down if this is not used.
15
  Models with optimizers use more storage, but you can easily resume training with them. Models without optimizers use less storage and are fine for training linear probes or inference.
16
- At some point, I started including dataset as metadata in the checkpoint. Some models may not include it.
 
 
 
13
 
14
  All models are trained with their inputs beginning with ";", which is also the delimiter token between games. Performance will go down if this is not used.
15
  Models with optimizers use more storage, but you can easily resume training with them. Models without optimizers use less storage and are fine for training linear probes or inference.
16
+ At some point, I started including dataset as metadata in the checkpoint. Some models may not include it.
17
+
18
+ I also have 31 checkpoints from a training run if you are interested in investigating how skills emerge during a training run. They are located here: https://huggingface.co/adamkarvonen/chess_llm_30_checkpoints