adamkarvonen
commited on
Commit
•
0ea158b
1
Parent(s):
c1b888b
Update README.md
Browse files
README.md
CHANGED
@@ -13,4 +13,6 @@ Dataset descriptions:
|
|
13 |
|
14 |
All models are trained with their inputs beginning with ";", which is also the delimiter token between games. Performance will go down if this is not used.
|
15 |
Models with optimizers use more storage, but you can easily resume training with them. Models without optimizers use less storage and are fine for training linear probes or inference.
|
16 |
-
At some point, I started including dataset as metadata in the checkpoint. Some models may not include it.
|
|
|
|
|
|
13 |
|
14 |
All models are trained with their inputs beginning with ";", which is also the delimiter token between games. Performance will go down if this is not used.
|
15 |
Models with optimizers use more storage, but you can easily resume training with them. Models without optimizers use less storage and are fine for training linear probes or inference.
|
16 |
+
At some point, I started including dataset as metadata in the checkpoint. Some models may not include it.
|
17 |
+
|
18 |
+
I also have 31 checkpoints from a training run if you are interested in investigating how skills emerge during a training run. They are located here: https://huggingface.co/adamkarvonen/chess_llm_30_checkpoints
|