tringuyen-uit commited on
Commit
3afeb19
1 Parent(s): 816adc0

End of training

Browse files
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- base_model: uitnlp/visobert
3
  tags:
4
  - generated_from_trainer
5
  metrics:
@@ -14,10 +14,10 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  # VP_ViSoBERT_syl_ViWikiFC
16
 
17
- This model is a fine-tuned version of [uitnlp/visobert](https://huggingface.co/uitnlp/visobert) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.9243
20
- - Accuracy: 0.6364
21
 
22
  ## Model description
23
 
@@ -37,48 +37,58 @@ More information needed
37
 
38
  The following hyperparameters were used during training:
39
  - learning_rate: 2e-05
40
- - train_batch_size: 16
41
- - eval_batch_size: 16
42
  - seed: 42
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
- - num_epochs: 3
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
50
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
51
- | 1.1299 | 0.1 | 100 | 1.1182 | 0.3411 |
52
- | 1.0816 | 0.19 | 200 | 1.0678 | 0.3976 |
53
- | 1.0181 | 0.29 | 300 | 1.0163 | 0.4823 |
54
- | 1.0121 | 0.38 | 400 | 0.9956 | 0.5072 |
55
- | 0.9617 | 0.48 | 500 | 0.9718 | 0.5048 |
56
- | 0.9297 | 0.57 | 600 | 0.9665 | 0.5239 |
57
- | 0.9332 | 0.67 | 700 | 0.9252 | 0.5646 |
58
- | 0.9057 | 0.76 | 800 | 0.9667 | 0.5421 |
59
- | 0.8756 | 0.86 | 900 | 0.8884 | 0.5871 |
60
- | 0.879 | 0.96 | 1000 | 0.8907 | 0.5718 |
61
- | 0.8249 | 1.05 | 1100 | 0.8793 | 0.5981 |
62
- | 0.7177 | 1.15 | 1200 | 0.8951 | 0.5957 |
63
- | 0.7145 | 1.24 | 1300 | 0.9523 | 0.6062 |
64
- | 0.7469 | 1.34 | 1400 | 0.9001 | 0.5986 |
65
- | 0.7358 | 1.43 | 1500 | 0.8865 | 0.6081 |
66
- | 0.7112 | 1.53 | 1600 | 0.9099 | 0.6057 |
67
- | 0.7299 | 1.62 | 1700 | 0.8496 | 0.6144 |
68
- | 0.6949 | 1.72 | 1800 | 0.8580 | 0.6124 |
69
- | 0.6988 | 1.81 | 1900 | 0.8840 | 0.6215 |
70
- | 0.6524 | 1.91 | 2000 | 0.8753 | 0.6134 |
71
- | 0.6914 | 2.01 | 2100 | 0.8729 | 0.6330 |
72
- | 0.5427 | 2.1 | 2200 | 0.9494 | 0.6431 |
73
- | 0.5628 | 2.2 | 2300 | 0.9531 | 0.6120 |
74
- | 0.5607 | 2.29 | 2400 | 0.9050 | 0.6340 |
75
- | 0.5396 | 2.39 | 2500 | 0.9149 | 0.6335 |
76
- | 0.5178 | 2.48 | 2600 | 0.9848 | 0.6124 |
77
- | 0.5322 | 2.58 | 2700 | 0.9198 | 0.6330 |
78
- | 0.5406 | 2.67 | 2800 | 0.9206 | 0.6364 |
79
- | 0.5183 | 2.77 | 2900 | 0.9150 | 0.6392 |
80
- | 0.5369 | 2.87 | 3000 | 0.9200 | 0.6340 |
81
- | 0.5105 | 2.96 | 3100 | 0.9243 | 0.6364 |
 
 
 
 
 
 
 
 
 
 
82
 
83
 
84
  ### Framework versions
 
1
  ---
2
+ base_model: tringuyen-uit/VP_ViSoBERT_syl_ViWikiFC
3
  tags:
4
  - generated_from_trainer
5
  metrics:
 
14
 
15
  # VP_ViSoBERT_syl_ViWikiFC
16
 
17
+ This model is a fine-tuned version of [tringuyen-uit/VP_ViSoBERT_syl_ViWikiFC](https://huggingface.co/tringuyen-uit/VP_ViSoBERT_syl_ViWikiFC) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 1.1555
20
+ - Accuracy: 0.6445
21
 
22
  ## Model description
23
 
 
37
 
38
  The following hyperparameters were used during training:
39
  - learning_rate: 2e-05
40
+ - train_batch_size: 8
41
+ - eval_batch_size: 8
42
  - seed: 42
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
+ - num_epochs: 2
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
50
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
51
+ | 0.6994 | 0.05 | 100 | 0.9688 | 0.6158 |
52
+ | 0.6904 | 0.1 | 200 | 0.9753 | 0.6014 |
53
+ | 0.7969 | 0.14 | 300 | 0.9446 | 0.5871 |
54
+ | 0.6801 | 0.19 | 400 | 0.9912 | 0.6057 |
55
+ | 0.7089 | 0.24 | 500 | 0.9617 | 0.5861 |
56
+ | 0.6627 | 0.29 | 600 | 1.0585 | 0.5689 |
57
+ | 0.6792 | 0.33 | 700 | 1.0064 | 0.6230 |
58
+ | 0.6702 | 0.38 | 800 | 1.0593 | 0.5818 |
59
+ | 0.6252 | 0.43 | 900 | 0.9621 | 0.5967 |
60
+ | 0.6262 | 0.48 | 1000 | 1.0152 | 0.5957 |
61
+ | 0.6515 | 0.53 | 1100 | 0.9539 | 0.6225 |
62
+ | 0.6596 | 0.57 | 1200 | 0.9188 | 0.6067 |
63
+ | 0.6458 | 0.62 | 1300 | 0.9318 | 0.6201 |
64
+ | 0.6087 | 0.67 | 1400 | 0.9532 | 0.6172 |
65
+ | 0.6282 | 0.72 | 1500 | 1.0107 | 0.6244 |
66
+ | 0.6266 | 0.76 | 1600 | 1.0199 | 0.6096 |
67
+ | 0.6165 | 0.81 | 1700 | 1.0973 | 0.6096 |
68
+ | 0.5869 | 0.86 | 1800 | 0.9177 | 0.6325 |
69
+ | 0.596 | 0.91 | 1900 | 0.8821 | 0.6364 |
70
+ | 0.6073 | 0.96 | 2000 | 0.9350 | 0.6306 |
71
+ | 0.5921 | 1.0 | 2100 | 0.9606 | 0.6282 |
72
+ | 0.4551 | 1.05 | 2200 | 1.0386 | 0.6373 |
73
+ | 0.3922 | 1.1 | 2300 | 1.1936 | 0.6368 |
74
+ | 0.39 | 1.15 | 2400 | 1.1922 | 0.6316 |
75
+ | 0.442 | 1.19 | 2500 | 1.1599 | 0.6220 |
76
+ | 0.4092 | 1.24 | 2600 | 1.3106 | 0.6196 |
77
+ | 0.4582 | 1.29 | 2700 | 1.1817 | 0.6316 |
78
+ | 0.4356 | 1.34 | 2800 | 1.1257 | 0.6316 |
79
+ | 0.4145 | 1.39 | 2900 | 1.1899 | 0.6354 |
80
+ | 0.4379 | 1.43 | 3000 | 1.1385 | 0.6388 |
81
+ | 0.4222 | 1.48 | 3100 | 1.1844 | 0.6249 |
82
+ | 0.3758 | 1.53 | 3200 | 1.2444 | 0.6311 |
83
+ | 0.4114 | 1.58 | 3300 | 1.1908 | 0.6349 |
84
+ | 0.4449 | 1.62 | 3400 | 1.1483 | 0.6273 |
85
+ | 0.4046 | 1.67 | 3500 | 1.1977 | 0.6306 |
86
+ | 0.4274 | 1.72 | 3600 | 1.1520 | 0.6450 |
87
+ | 0.3785 | 1.77 | 3700 | 1.1665 | 0.6330 |
88
+ | 0.3854 | 1.82 | 3800 | 1.1680 | 0.6474 |
89
+ | 0.3562 | 1.86 | 3900 | 1.1616 | 0.6459 |
90
+ | 0.3938 | 1.91 | 4000 | 1.1823 | 0.6397 |
91
+ | 0.5083 | 1.96 | 4100 | 1.1555 | 0.6445 |
92
 
93
 
94
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:984c82763e79c226b39f13f7c20c8f68f7441612b8e6647165ce3a121ac9896f
3
  size 390297116
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0a135b09a6b83e30a8233b3596306a71f75d132620de5d3bfa2016fdddf9c211
3
  size 390297116
runs/Jun06_19-04-41_e3ad251fbc84/events.out.tfevents.1717700682.e3ad251fbc84.34.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1ec2ace6760855a4890eb9d9396df4ce2ae043c3395b8ea46340decae4e9dde2
3
- size 26235
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:351c0597d8d9a7b672d25bef818cdbdb0991d625665adc8dc21beccb64fc7c38
3
+ size 27123