meoo225 commited on
Commit
96373f7
1 Parent(s): d600cba

End of training

Browse files
README.md CHANGED
@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [VietAI/vit5-base](https://huggingface.co/VietAI/vit5-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.2178
20
- - Bleu Score: 78.7502
21
- - Gen Len: 12.7826
22
 
23
  ## Model description
24
 
@@ -38,8 +38,8 @@ More information needed
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 0.0001
41
- - train_batch_size: 16
42
- - eval_batch_size: 16
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
@@ -49,9 +49,9 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Gen Len |
51
  |:-------------:|:-----:|:----:|:---------------:|:----------:|:-------:|
52
- | 0.4635 | 1.0 | 419 | 0.2255 | 77.2861 | 12.7969 |
53
- | 0.166 | 2.0 | 838 | 0.2026 | 78.3051 | 12.8041 |
54
- | 0.0752 | 3.0 | 1257 | 0.2178 | 78.7502 | 12.7826 |
55
 
56
 
57
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [VietAI/vit5-base](https://huggingface.co/VietAI/vit5-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.2349
20
+ - Bleu Score: 79.2125
21
+ - Gen Len: 12.7933
22
 
23
  ## Model description
24
 
 
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 0.0001
41
+ - train_batch_size: 8
42
+ - eval_batch_size: 8
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Gen Len |
51
  |:-------------:|:-----:|:----:|:---------------:|:----------:|:-------:|
52
+ | 0.4686 | 1.0 | 838 | 0.2500 | 77.4621 | 12.8244 |
53
+ | 0.1722 | 2.0 | 1676 | 0.2120 | 78.5608 | 12.7933 |
54
+ | 0.0703 | 3.0 | 2514 | 0.2349 | 79.2125 | 12.7933 |
55
 
56
 
57
  ### Framework versions
logs/events.out.tfevents.1727321381.c600b8c03487.1830.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:558d80f10b849cc46fa84601c69e0d7b17480a575de306ad43a056b39d441d32
3
- size 6356
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8e0fa55b8b52d148da1ec5a227b2de8bacef2883bd45e3e85cf2a995c08214d2
3
+ size 7297
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ecfdc3c7fca7e6166726877e3910c67b7f44393b3d5a3e3d4fc755ec4c7918f1
3
  size 903834408
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cebdc591a3e32a392cf8efd815278c785985d8e74ea693fae84b9937758cb2be
3
  size 903834408