Safick-HF commited on
Commit
a049532
verified
1 Parent(s): 57505f9

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [deepset/gelectra-large-germanquad](https://huggingface.co/deepset/gelectra-large-germanquad) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.5728
19
 
20
  ## Model description
21
 
@@ -36,7 +36,7 @@ More information needed
36
  The following hyperparameters were used during training:
37
  - learning_rate: 2e-05
38
  - train_batch_size: 5
39
- - eval_batch_size: 10
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
@@ -46,16 +46,16 @@ The following hyperparameters were used during training:
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | 0.0022 | 1.0 | 3 | 1.3113 |
50
- | 0.0013 | 2.0 | 6 | 1.2872 |
51
- | 0.0006 | 3.0 | 9 | 1.2459 |
52
- | 0.0009 | 4.0 | 12 | 0.9726 |
53
- | 0.0004 | 5.0 | 15 | 0.7666 |
54
- | 0.001 | 6.0 | 18 | 0.6629 |
55
- | 0.0007 | 7.0 | 21 | 0.6085 |
56
- | 0.0008 | 8.0 | 24 | 0.5842 |
57
- | 0.0007 | 9.0 | 27 | 0.5732 |
58
- | 0.0007 | 10.0 | 30 | 0.5728 |
59
 
60
 
61
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [deepset/gelectra-large-germanquad](https://huggingface.co/deepset/gelectra-large-germanquad) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.0130
19
 
20
  ## Model description
21
 
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 2e-05
38
  - train_batch_size: 5
39
+ - eval_batch_size: 5
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
 
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | 3.792 | 1.0 | 3 | 0.0385 |
50
+ | 0.4079 | 2.0 | 6 | 0.0418 |
51
+ | 0.1837 | 3.0 | 9 | 0.0511 |
52
+ | 0.0175 | 4.0 | 12 | 0.0573 |
53
+ | 0.0426 | 5.0 | 15 | 0.0547 |
54
+ | 0.0108 | 6.0 | 18 | 0.0438 |
55
+ | 0.0117 | 7.0 | 21 | 0.0276 |
56
+ | 0.0255 | 8.0 | 24 | 0.0184 |
57
+ | 0.0087 | 9.0 | 27 | 0.0143 |
58
+ | 0.0041 | 10.0 | 30 | 0.0130 |
59
 
60
 
61
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3db19d4c8558b70ba0ea78061c68feceaf733c9515810f3d037379955e79d686
3
  size 1338801016
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2d24a605aba6b1621ac407aa43ae4a678506a03153f6ec5cd1314f0c018e2abf
3
  size 1338801016
runs/Mar22_10-49-11_b525f12b76d7/events.out.tfevents.1711104551.b525f12b76d7.158.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:44e730149ef0adc9613f1e310c1ae5b108392eeaa2ed46bf4108f414b3953b98
3
+ size 14010
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ec94d9538d106a1563c4113c8af08722d4bdfc368cb698510fdf70bee72b1e20
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2cb4d4d184f08dbad05680201f58e58253c5f057681df15d559e86b0b6e69b83
3
  size 4920