Gholamreza
commited on
Commit
•
865c948
1
Parent(s):
63f6fa8
Update README.md
Browse files
README.md
CHANGED
@@ -14,7 +14,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
14 |
|
15 |
# distilbert-fa-zwnj-base-MLM-pquad
|
16 |
|
17 |
-
This model is
|
|
|
|
|
18 |
|
19 |
## Model description
|
20 |
|
@@ -46,6 +48,7 @@ The following hyperparameters were used during training:
|
|
46 |
- mixed_precision_training: Native AMP
|
47 |
|
48 |
### Training results
|
|
|
49 |
|
50 |
|
51 |
|
|
|
14 |
|
15 |
# distilbert-fa-zwnj-base-MLM-pquad
|
16 |
|
17 |
+
This model is pretained only on the [PQuAD](https://huggingface.co/datasets/Gholamreza/pquad) dataset. for educational purposes only.
|
18 |
+
|
19 |
+
Tokenizer and base model configs are from [HooshvareLab/distilbert-fa-zwnj-base](https://huggingface.co/HooshvareLab/distilbert-fa-zwnj-base) on the generator dataset.
|
20 |
|
21 |
## Model description
|
22 |
|
|
|
48 |
- mixed_precision_training: Native AMP
|
49 |
|
50 |
### Training results
|
51 |
+
TrainOutput(global_step=31, training_loss=10.31849128969254, metrics={'train_runtime': 42.7618, 'train_samples_per_second': 188.369, 'train_steps_per_second': 0.725, 'total_flos': 263071290359808.0, 'train_loss': 10.31849128969254, 'epoch': 0.98})
|
52 |
|
53 |
|
54 |
|