rishivijayvargiya commited on
Commit
f5bab52
·
verified ·
1 Parent(s): 0233a8e

training to 8 epochs

Browse files
Files changed (3) hide show
  1. README.md +16 -14
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -23,7 +23,7 @@ model-index:
23
  metrics:
24
  - name: Rouge1
25
  type: rouge
26
- value: 30.8728
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,12 +33,12 @@ should probably proofread and complete it, then remove this comment. -->
33
 
34
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the xsum dataset.
35
  It achieves the following results on the evaluation set:
36
- - Loss: 2.3024
37
- - Rouge1: 30.8728
38
- - Rouge2: 9.5192
39
- - Rougel: 24.4482
40
- - Rougelsum: 24.4482
41
- - Gen Len: 19.7209
42
 
43
  ## Model description
44
 
@@ -63,17 +63,19 @@ The following hyperparameters were used during training:
63
  - seed: 42
64
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
65
  - lr_scheduler_type: linear
66
- - num_epochs: 5
67
  - mixed_precision_training: Native AMP
68
 
69
  ### Training results
70
 
71
- | Training Loss | Epoch | Step | Gen Len | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
72
- |:-------------:|:-----:|:-----:|:-------:|:---------------:|:-------:|:------:|:-------:|:---------:|
73
- | 2.6596 | 1.0 | 12753 | 19.6937 | 2.4320 | 29.2361 | 8.3218 | 22.9893 | 22.9993 |
74
- | 2.5914 | 2.0 | 25506 | 19.7058 | 2.3844 | 29.8093 | 8.7641 | 23.5074 | 23.5169 |
75
- | 2.5171 | 4.0 | 51012 | 19.6821 | 2.3208 | 30.6455 | 9.3612 | 24.2744 | 24.2798 |
76
- | 2.4945 | 5.0 | 63765 | 2.3024 | 30.8728 | 9.5192 | 24.4482| 24.4482 | 19.7209 |
 
 
77
 
78
 
79
  ### Framework versions
 
23
  metrics:
24
  - name: Rouge1
25
  type: rouge
26
+ value: 31.2683
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
33
 
34
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the xsum dataset.
35
  It achieves the following results on the evaluation set:
36
+ - Loss: 2.2682
37
+ - Rouge1: 31.2683
38
+ - Rouge2: 9.8466
39
+ - Rougel: 24.8322
40
+ - Rougelsum: 24.8376
41
+ - Gen Len: 19.7033
42
 
43
  ## Model description
44
 
 
63
  - seed: 42
64
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
65
  - lr_scheduler_type: linear
66
+ - num_epochs: 8
67
  - mixed_precision_training: Native AMP
68
 
69
  ### Training results
70
 
71
+ | Training Loss | Epoch | Step | Gen Len | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
72
+ |:-------------:|:-----:|:------:|:-------:|:---------------:|:-------:|:------:|:-------:|:---------:|
73
+ | 2.6596 | 1.0 | 12753 | 19.6937 | 2.4320 | 29.2361 | 8.3218 | 22.9893 | 22.9993 |
74
+ | 2.5914 | 2.0 | 25506 | 19.7058 | 2.3844 | 29.8093 | 8.7641 | 23.5074 | 23.5169 |
75
+ | 2.5171 | 4.0 | 51012 | 19.6821 | 2.3208 | 30.6455 | 9.3612 | 24.2744 | 24.2798 |
76
+ | 2.4813 | 6.0 | 76518 | 19.6813 | 2.2865 | 31.1312 | 9.7512 | 24.6686 | 24.6688 |
77
+ | 2.4517 | 7.0 | 89271 | 2.2757 | 31.1544 | 9.7509 | 24.6982| 24.7016 | 19.7118 |
78
+ | 2.451 | 8.0 | 102024 | 2.2682 | 31.2683 | 9.8466 | 24.8322| 24.8376 | 19.7033 |
79
 
80
 
81
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:53f4842c50f09ea454bc083c09fddc1824206a01ccf29f2b2c8ecbb39d9d42b2
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:80db64a8b4e04fa154d19abd8005587c42e1a2b05a92cab4b447ec77d807f7d0
3
  size 242041896
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1cd3b9c0d4bc3d9ecfa1ecef87cfc2c28e6da056fa2a1d6f424690a2a6f0e889
3
  size 5496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0b1f8c1eb2957c85221b7597bf21a7e0cb2089840d38c88f6ab88339bf0cd17d
3
  size 5496