itsanastasiaminina commited on
Commit
cd8087a
·
verified ·
1 Parent(s): d808351

End of training

Browse files
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.8747
22
- - F1: 0.6229
23
 
24
  ## Model description
25
 
@@ -38,22 +38,29 @@ More information needed
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
- - learning_rate: 2e-05
42
  - train_batch_size: 32
43
  - eval_batch_size: 32
44
  - seed: 42
45
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
- - num_epochs: 3
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | F1 |
53
  |:-------------:|:-----:|:----:|:---------------:|:------:|
54
- | No log | 1.0 | 282 | 1.1173 | 0.4836 |
55
- | 0.9565 | 2.0 | 564 | 0.8655 | 0.6161 |
56
- | 0.9565 | 3.0 | 846 | 0.8747 | 0.6229 |
 
 
 
 
 
 
 
57
 
58
 
59
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.9662
22
+ - F1: 0.6266
23
 
24
  ## Model description
25
 
 
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
+ - learning_rate: 1e-05
42
  - train_batch_size: 32
43
  - eval_batch_size: 32
44
  - seed: 42
45
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 10
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | F1 |
53
  |:-------------:|:-----:|:----:|:---------------:|:------:|
54
+ | No log | 1.0 | 282 | 1.0224 | 0.5288 |
55
+ | 1.0295 | 2.0 | 564 | 1.0153 | 0.5571 |
56
+ | 1.0295 | 3.0 | 846 | 0.8929 | 0.6222 |
57
+ | 0.8387 | 4.0 | 1128 | 0.8547 | 0.6331 |
58
+ | 0.8387 | 5.0 | 1410 | 0.8895 | 0.6200 |
59
+ | 0.7514 | 6.0 | 1692 | 0.8920 | 0.6184 |
60
+ | 0.7514 | 7.0 | 1974 | 0.9353 | 0.6229 |
61
+ | 0.6815 | 8.0 | 2256 | 0.9303 | 0.6287 |
62
+ | 0.6083 | 9.0 | 2538 | 0.9879 | 0.6186 |
63
+ | 0.6083 | 10.0 | 2820 | 0.9662 | 0.6266 |
64
 
65
 
66
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:72c743158e93bc0013a472f1bece05a849ae1af1acc22e300f40fc1bbe40b767
3
  size 498625128
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f2c6673efba47ce4df3b9ff759b36d859f45b965492d952d3bf28a2a88e1b52e
3
  size 498625128
runs/Dec09_21-16-34_a90410dcfc7e/events.out.tfevents.1733778996.a90410dcfc7e.23.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f04af8c59dfee4ac1850fc26b6b84c1de54c7b594f13f2624eccbb4bb2fdb73b
3
- size 9564
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a8de8c974743d27d95e0b6eb90c386ddaad4fa388263704f2495dc16612c5daa
3
+ size 9918