meoo225 commited on
Commit
4aeb4e7
1 Parent(s): 1769fe8

End of training

Browse files
README.md CHANGED
@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.3211
20
- - Bleu Score: 48.8286
21
- - Gen Len: 16.8423
22
 
23
  ## Model description
24
 
@@ -38,8 +38,8 @@ More information needed
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 0.0001
41
- - train_batch_size: 2
42
- - eval_batch_size: 2
43
  - seed: 42
44
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
45
  - lr_scheduler_type: linear
@@ -47,11 +47,11 @@ The following hyperparameters were used during training:
47
 
48
  ### Training results
49
 
50
- | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Gen Len |
51
- |:-------------:|:-----:|:-----:|:---------------:|:----------:|:-------:|
52
- | 2.4292 | 1.0 | 3349 | 0.4501 | 45.5394 | 16.8363 |
53
- | 0.5249 | 2.0 | 6698 | 0.3453 | 48.2109 | 16.8315 |
54
- | 0.4109 | 3.0 | 10047 | 0.3211 | 48.8286 | 16.8423 |
55
 
56
 
57
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.4030
20
+ - Bleu Score: 45.1405
21
+ - Gen Len: 16.8196
22
 
23
  ## Model description
24
 
 
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 0.0001
41
+ - train_batch_size: 8
42
+ - eval_batch_size: 8
43
  - seed: 42
44
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
45
  - lr_scheduler_type: linear
 
47
 
48
  ### Training results
49
 
50
+ | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Gen Len |
51
+ |:-------------:|:-----:|:----:|:---------------:|:----------:|:-------:|
52
+ | 2.8374 | 1.0 | 838 | 0.5595 | 41.3763 | 16.7802 |
53
+ | 0.7564 | 2.0 | 1676 | 0.4503 | 44.4878 | 16.8124 |
54
+ | 0.6103 | 3.0 | 2514 | 0.4030 | 45.1405 | 16.8196 |
55
 
56
 
57
  ### Framework versions
logs/events.out.tfevents.1731686437.e7a466c1d5eb.457.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1531aeef6e05caf4927fddcef78285d92ec6169e9988988fa18ebba23f68e14c
3
- size 6697
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:26f0b3e542e3514c835a97d69da9c6e8428dde53843493dfff2c9ee24dce5d5a
3
+ size 7427