Update README.md
Browse files
README.md
CHANGED
@@ -79,7 +79,21 @@ The following hyperparameters were used during training:
|
|
79 |
- training precision: Mixed Precision
|
80 |
|
81 |
### Training results
|
82 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
83 |
|
84 |
|
85 |
### Framework versions
|
|
|
79 |
- training precision: Mixed Precision
|
80 |
|
81 |
### Training results
|
82 |
+
```
|
83 |
+
***** train metrics *****
|
84 |
+
"epoch": 10.0,
|
85 |
+
"train_loss": 3.1787637246621623,
|
86 |
+
"train_runtime": 4372.4031,
|
87 |
+
"train_samples": 114248,
|
88 |
+
"train_samples_per_second": 261.293,
|
89 |
+
"train_steps_per_second": 0.254
|
90 |
+
|
91 |
+
***** eval metrics *****
|
92 |
+
"epoch": 10.0,
|
93 |
+
"eval_loss": 2.990234375,
|
94 |
+
"eval_samples": 240,
|
95 |
+
"perplexity": 19.89034374461794
|
96 |
+
```
|
97 |
|
98 |
|
99 |
### Framework versions
|