meoo225 commited on
Commit
36da2bb
1 Parent(s): c893147

End of training

Browse files
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  library_name: transformers
3
  license: apache-2.0
4
- base_model: google/mt5-small
5
  tags:
6
  - generated_from_trainer
7
  metrics:
@@ -17,14 +17,14 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  # mT5_base
19
 
20
- This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.3417
23
- - Bleu Score: 47.0526
24
- - Precision: 17.2043
25
- - Recall: 17.2043
26
- - Gen Len: 16.8315
27
- - Err: 17.2043
28
 
29
  ## Model description
30
 
@@ -49,16 +49,15 @@ The following hyperparameters were used during training:
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
- - num_epochs: 4
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Precision | Recall | Gen Len | Err |
57
  |:-------------:|:-----:|:----:|:---------------:|:----------:|:---------:|:-------:|:-------:|:-------:|
58
- | 2.798 | 1.0 | 838 | 0.5495 | 41.8683 | 7.7658 | 7.7658 | 16.7766 | 7.7658 |
59
- | 0.7216 | 2.0 | 1676 | 0.4311 | 44.9002 | 13.0227 | 13.0227 | 16.8148 | 13.0227 |
60
- | 0.5551 | 3.0 | 2514 | 0.3565 | 46.5247 | 16.0096 | 16.0096 | 16.816 | 16.0096 |
61
- | 0.4951 | 4.0 | 3352 | 0.3417 | 47.0526 | 17.2043 | 17.2043 | 16.8315 | 17.2043 |
62
 
63
 
64
  ### Framework versions
 
1
  ---
2
  library_name: transformers
3
  license: apache-2.0
4
+ base_model: google/mt5-base
5
  tags:
6
  - generated_from_trainer
7
  metrics:
 
17
 
18
  # mT5_base
19
 
20
+ This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.1703
23
+ - Bleu Score: 51.176
24
+ - Precision: 27.4791
25
+ - Recall: 27.4791
26
+ - Gen Len: 16.8805
27
+ - Err: 27.4791
28
 
29
  ## Model description
30
 
 
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
+ - num_epochs: 3
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Precision | Recall | Gen Len | Err |
57
  |:-------------:|:-----:|:----:|:---------------:|:----------:|:---------:|:-------:|:-------:|:-------:|
58
+ | 1.3269 | 1.0 | 838 | 0.2396 | 48.4521 | 20.7885 | 20.7885 | 16.8339 | 20.7885 |
59
+ | 0.2831 | 2.0 | 1676 | 0.1861 | 50.5118 | 26.1649 | 26.1649 | 16.8781 | 26.1649 |
60
+ | 0.2167 | 3.0 | 2514 | 0.1703 | 51.176 | 27.4791 | 27.4791 | 16.8805 | 27.4791 |
 
61
 
62
 
63
  ### Framework versions
logs/events.out.tfevents.1728145765.b035966b8ee0.4495.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:82f722805b7b2d262a80022df42133833f7caf40edd1c593fe8846ad3bac085b
3
- size 6684
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0628992bff50a5f319a9c8b6bf2c256091856babf7dd19abe6d40bf59171e75e
3
+ size 7775