Saving weights and log at step 1000000
Browse files- README.md +1 -2
- flax_model.msgpack +1 -1
- opt_state.msgpack +1 -1
- pytorch_model.bin +1 -1
- runs/events.out.tfevents.1641055391.t1v-n-f9cfcc28-w-0.112189.0.v2 +2 -2
- training_state.json +1 -1
README.md
CHANGED
@@ -27,13 +27,12 @@ Tokenizer:
|
|
27 |
Training details:
|
28 |
|
29 |
* Training started on step 360K (bs 16) ppl 21 of earlier model trained with Adam optimizer.
|
30 |
-
* Training at step
|
31 |
* Block size: 512
|
32 |
* Optimizer: adafactor
|
33 |
* Learning rate: 3.3e-5
|
34 |
* Batch size: 32
|
35 |
* Warmup steps: 5000
|
36 |
-
* Weight decay: 0.01
|
37 |
|
38 |
Work in progress. Dec 2021-Jan2022
|
39 |
|
|
|
27 |
Training details:
|
28 |
|
29 |
* Training started on step 360K (bs 16) ppl 21 of earlier model trained with Adam optimizer.
|
30 |
+
* Training at step 1000K of 2082K (48%) ppl 15,2
|
31 |
* Block size: 512
|
32 |
* Optimizer: adafactor
|
33 |
* Learning rate: 3.3e-5
|
34 |
* Batch size: 32
|
35 |
* Warmup steps: 5000
|
|
|
36 |
|
37 |
Work in progress. Dec 2021-Jan2022
|
38 |
|
flax_model.msgpack
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 3096134690
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2d77af2bdb5c6b7fbeae905633a8986e9c97140208268cd3a34909ebf30b1536
|
3 |
size 3096134690
|
opt_state.msgpack
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5611008
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:751daf3e7e3c5efa79ffa8a1cece06bd7cdb4fc23d141b13c64e75b61c0d4f48
|
3 |
size 5611008
|
pytorch_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 3134045897
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8461ae6051ce46f98cdf810433154389d1871441325eda9f8d4380e0e688be54
|
3 |
size 3134045897
|
runs/events.out.tfevents.1641055391.t1v-n-f9cfcc28-w-0.112189.0.v2
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:391e0abb9a67782191b2beb53ef7eef37caca761eec84e5f63d2000ee7a76404
|
3 |
+
size 149388427
|
training_state.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"step":
|
|
|
1 |
+
{"step": 1000001}
|