MiuN2k3 commited on
Commit
b5c808f
1 Parent(s): 3ce8fae

End of training

Browse files
Files changed (2) hide show
  1. README.md +18 -18
  2. model.safetensors +1 -1
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  library_name: transformers
3
- base_model: microsoft/infoxlm-base
4
  tags:
5
  - generated_from_trainer
6
  metrics:
@@ -9,22 +9,22 @@ metrics:
9
  - precision
10
  - recall
11
  model-index:
12
- - name: vp-infoxlm-base-dsc
13
  results: []
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
  should probably proofread and complete it, then remove this comment. -->
18
 
19
- # vp-infoxlm-base-dsc
20
 
21
- This model is a fine-tuned version of [microsoft/infoxlm-base](https://huggingface.co/microsoft/infoxlm-base) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.4642
24
- - Accuracy: 0.8251
25
- - F1: 0.8249
26
- - Precision: 0.8259
27
- - Recall: 0.8251
28
 
29
  ## Model description
30
 
@@ -44,8 +44,8 @@ More information needed
44
 
45
  The following hyperparameters were used during training:
46
  - learning_rate: 1e-05
47
- - train_batch_size: 16
48
- - eval_batch_size: 16
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
@@ -54,13 +54,13 @@ The following hyperparameters were used during training:
54
 
55
  ### Training results
56
 
57
- | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
58
- |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
59
- | 0.9971 | 1.0 | 1590 | 0.8708 | 0.5664 | 0.5565 | 0.6042 | 0.5664 |
60
- | 0.7175 | 2.0 | 3180 | 0.5943 | 0.7631 | 0.7626 | 0.7713 | 0.7631 |
61
- | 0.5942 | 3.0 | 4770 | 0.5007 | 0.8069 | 0.8069 | 0.8075 | 0.8069 |
62
- | 0.4981 | 4.0 | 6360 | 0.4676 | 0.8188 | 0.8182 | 0.8218 | 0.8188 |
63
- | 0.4669 | 5.0 | 7950 | 0.4642 | 0.8251 | 0.8249 | 0.8259 | 0.8251 |
64
 
65
 
66
  ### Framework versions
 
1
  ---
2
  library_name: transformers
3
+ base_model: microsoft/infoxlm-large
4
  tags:
5
  - generated_from_trainer
6
  metrics:
 
9
  - precision
10
  - recall
11
  model-index:
12
+ - name: vp-infoxlm-large-dsc
13
  results: []
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
  should probably proofread and complete it, then remove this comment. -->
18
 
19
+ # vp-infoxlm-large-dsc
20
 
21
+ This model is a fine-tuned version of [microsoft/infoxlm-large](https://huggingface.co/microsoft/infoxlm-large) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.6113
24
+ - Accuracy: 0.8706
25
+ - F1: 0.8705
26
+ - Precision: 0.8713
27
+ - Recall: 0.8706
28
 
29
  ## Model description
30
 
 
44
 
45
  The following hyperparameters were used during training:
46
  - learning_rate: 1e-05
47
+ - train_batch_size: 8
48
+ - eval_batch_size: 4
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
 
54
 
55
  ### Training results
56
 
57
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
58
+ |:-------------:|:-----:|:-----:|:---------------:|:--------:|:------:|:---------:|:------:|
59
+ | 0.8771 | 1.0 | 3180 | 0.8099 | 0.6890 | 0.6914 | 0.7003 | 0.6890 |
60
+ | 0.5911 | 2.0 | 6360 | 0.5717 | 0.8014 | 0.8007 | 0.8107 | 0.8014 |
61
+ | 0.4608 | 3.0 | 9540 | 0.5323 | 0.8442 | 0.8442 | 0.8449 | 0.8442 |
62
+ | 0.407 | 4.0 | 12720 | 0.5047 | 0.8680 | 0.8679 | 0.8683 | 0.8680 |
63
+ | 0.3372 | 5.0 | 15900 | 0.6113 | 0.8706 | 0.8705 | 0.8713 | 0.8706 |
64
 
65
 
66
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5762f19428c6ac10284c7800f392c10ae1ea86e48bba89b771f08021bc9400c3
3
  size 2239622772
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0ee8ea16aa227918316176fb62f6471edda8613b0933373abda285efaaf3aec9
3
  size 2239622772