ZhiguangHan commited on
Commit
7add387
1 Parent(s): e126e58

End of training

Browse files
Files changed (1) hide show
  1. README.md +10 -9
README.md CHANGED
@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.0516
21
  - Rouge1: 0.0
22
  - Rouge2: 0.0
23
  - Rougel: 0.0
@@ -46,19 +46,20 @@ The following hyperparameters were used during training:
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
- - num_epochs: 7
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
54
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
55
- | 2.0315 | 1.0 | 500 | 0.0727 | 0.0 | 0.0 | 0.0 | 0.0 |
56
- | 0.1434 | 2.0 | 1000 | 0.0660 | 0.0 | 0.0 | 0.0 | 0.0 |
57
- | 0.0975 | 3.0 | 1500 | 0.0613 | 0.0 | 0.0 | 0.0 | 0.0 |
58
- | 0.0822 | 4.0 | 2000 | 0.0554 | 0.0 | 0.0 | 0.0 | 0.0 |
59
- | 0.0806 | 5.0 | 2500 | 0.0543 | 0.0 | 0.0 | 0.0 | 0.0 |
60
- | 0.0745 | 6.0 | 3000 | 0.0535 | 0.0 | 0.0 | 0.0 | 0.0 |
61
- | 0.0713 | 7.0 | 3500 | 0.0516 | 0.0 | 0.0 | 0.0 | 0.0 |
 
62
 
63
 
64
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.0802
21
  - Rouge1: 0.0
22
  - Rouge2: 0.0
23
  - Rougel: 0.0
 
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
+ - num_epochs: 8
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
54
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
55
+ | 4.8701 | 1.0 | 500 | 0.1153 | 0.0 | 0.0 | 0.0 | 0.0 |
56
+ | 0.1204 | 2.0 | 1000 | 0.1054 | 0.0 | 0.0 | 0.0 | 0.0 |
57
+ | 0.1014 | 3.0 | 1500 | 0.0837 | 0.0 | 0.0 | 0.0 | 0.0 |
58
+ | 0.0898 | 4.0 | 2000 | 0.0861 | 0.0 | 0.0 | 0.0 | 0.0 |
59
+ | 0.083 | 5.0 | 2500 | 0.0931 | 0.0 | 0.0 | 0.0 | 0.0 |
60
+ | 0.0784 | 6.0 | 3000 | 0.0810 | 0.0 | 0.0 | 0.0 | 0.0 |
61
+ | 0.0763 | 7.0 | 3500 | 0.0845 | 0.0 | 0.0 | 0.0 | 0.0 |
62
+ | 0.0743 | 8.0 | 4000 | 0.0802 | 0.0 | 0.0 | 0.0 | 0.0 |
63
 
64
 
65
  ### Framework versions