GlycerinLOL commited on
Commit
53c1607
1 Parent(s): 34ff7e8

End of training

Browse files
README.md CHANGED
@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [facebook/bart-large-xsum](https://huggingface.co/facebook/bart-large-xsum) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 2.3237
21
- - Rouge1: 0.4756
22
- - Rouge2: 0.203
23
- - Rougel: 0.3677
24
- - Rougelsum: 0.3678
25
- - Gen Len: 41.4318
26
 
27
  ## Model description
28
 
@@ -47,21 +47,15 @@ The following hyperparameters were used during training:
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
- - num_epochs: 8
51
  - mixed_precision_training: Native AMP
52
 
53
  ### Training results
54
 
55
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
- |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
57
- | 1.6644 | 1.0 | 1250 | 1.6972 | 0.4687 | 0.2036 | 0.3619 | 0.362 | 43.4245 |
58
- | 1.3035 | 2.0 | 2500 | 1.6463 | 0.4762 | 0.2104 | 0.3746 | 0.3747 | 39.5091 |
59
- | 1.0206 | 3.0 | 3750 | 1.7278 | 0.476 | 0.2117 | 0.3743 | 0.3746 | 38.9555 |
60
- | 0.8224 | 4.0 | 5000 | 1.8642 | 0.477 | 0.2094 | 0.3724 | 0.3723 | 40.5182 |
61
- | 0.654 | 5.0 | 6250 | 1.9480 | 0.4757 | 0.2083 | 0.3717 | 0.3716 | 39.8736 |
62
- | 0.5302 | 6.0 | 7500 | 2.1332 | 0.4773 | 0.2062 | 0.37 | 0.3699 | 40.8309 |
63
- | 0.4364 | 7.0 | 8750 | 2.2474 | 0.4749 | 0.2008 | 0.3648 | 0.3648 | 42.0391 |
64
- | 0.3782 | 8.0 | 10000 | 2.3237 | 0.4756 | 0.203 | 0.3677 | 0.3678 | 41.4318 |
65
 
66
 
67
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [facebook/bart-large-xsum](https://huggingface.co/facebook/bart-large-xsum) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.6256
21
+ - Rouge1: 0.4761
22
+ - Rouge2: 0.2113
23
+ - Rougel: 0.3732
24
+ - Rougelsum: 0.3731
25
+ - Gen Len: 41.0064
26
 
27
  ## Model description
28
 
 
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
+ - num_epochs: 2
51
  - mixed_precision_training: Native AMP
52
 
53
  ### Training results
54
 
55
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
57
+ | 1.6549 | 1.0 | 1250 | 1.6621 | 0.4696 | 0.2069 | 0.3636 | 0.3636 | 42.6855 |
58
+ | 1.2925 | 2.0 | 2500 | 1.6256 | 0.4761 | 0.2113 | 0.3732 | 0.3731 | 41.0064 |
 
 
 
 
 
 
59
 
60
 
61
  ### Framework versions
runs/Dec15_13-40-37_mgrqu6ctr1702517804785-49zpx/events.out.tfevents.1702618845.mgrqu6ctr1702517804785-49zpx.235326.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:769c81b1c9a0734cdf4f5472a5b155e168fc7ae83ee6405285d153716e9056af
3
- size 6706
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:998bfc4031aaa5cc2ce9e00e95660be099be50cd4ac720df5d59d2eb0a37b9ad
3
+ size 7585