adamkarvonen
/

chess_llms

Model card Files Files and versions Community

adamkarvonen commited on Jan 22

Commit

0ea158b

•

1 Parent(s): c1b888b

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -13,4 +13,6 @@ Dataset descriptions:
 All models are trained with their inputs beginning with ";", which is also the delimiter token between games. Performance will go down if this is not used.
 Models with optimizers use more storage, but you can easily resume training with them. Models without optimizers use less storage and are fine for training linear probes or inference.
-At some point, I started including dataset as metadata in the checkpoint. Some models may not include it.

 All models are trained with their inputs beginning with ";", which is also the delimiter token between games. Performance will go down if this is not used.
 Models with optimizers use more storage, but you can easily resume training with them. Models without optimizers use less storage and are fine for training linear probes or inference.
+At some point, I started including dataset as metadata in the checkpoint. Some models may not include it.
+I also have 31 checkpoints from a training run if you are interested in investigating how skills emerge during a training run. They are located here: https://huggingface.co/adamkarvonen/chess_llm_30_checkpoints