Jana1994 commited on
Commit
faf76d2
1 Parent(s): 7d96b0e

End of training

Browse files
Files changed (1) hide show
  1. README.md +18 -9
README.md CHANGED
@@ -22,7 +22,7 @@ model-index:
22
  metrics:
23
  - name: Wer
24
  type: wer
25
- value: 0.6882143267793492
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 0.8092
36
- - Wer: 0.6882
37
 
38
  ## Model description
39
 
@@ -53,23 +53,32 @@ More information needed
53
 
54
  The following hyperparameters were used during training:
55
  - learning_rate: 0.0003
56
- - train_batch_size: 16
57
  - eval_batch_size: 8
58
  - seed: 42
59
  - gradient_accumulation_steps: 2
60
- - total_train_batch_size: 32
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
  - lr_scheduler_warmup_steps: 300
64
- - num_epochs: 10
65
 
66
  ### Training results
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Wer |
69
  |:-------------:|:-----:|:----:|:---------------:|:------:|
70
- | 5.5231 | 3.33 | 200 | 2.9353 | 1.0000 |
71
- | 2.1315 | 6.67 | 400 | 1.0531 | 0.8513 |
72
- | 0.5451 | 10.0 | 600 | 0.8092 | 0.6882 |
 
 
 
 
 
 
 
 
 
73
 
74
 
75
  ### Framework versions
 
22
  metrics:
23
  - name: Wer
24
  type: wer
25
+ value: 0.6113372427273772
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 1.0444
36
+ - Wer: 0.6113
37
 
38
  ## Model description
39
 
 
53
 
54
  The following hyperparameters were used during training:
55
  - learning_rate: 0.0003
56
+ - train_batch_size: 8
57
  - eval_batch_size: 8
58
  - seed: 42
59
  - gradient_accumulation_steps: 2
60
+ - total_train_batch_size: 16
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
  - lr_scheduler_warmup_steps: 300
64
+ - num_epochs: 20
65
 
66
  ### Training results
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Wer |
69
  |:-------------:|:-----:|:----:|:---------------:|:------:|
70
+ | 5.5551 | 1.67 | 200 | 2.9315 | 1.0 |
71
+ | 2.7259 | 3.33 | 400 | 1.6133 | 0.9592 |
72
+ | 1.1027 | 5.0 | 600 | 0.9823 | 0.8117 |
73
+ | 0.5978 | 6.67 | 800 | 0.9360 | 0.7384 |
74
+ | 0.4142 | 8.33 | 1000 | 0.9242 | 0.6867 |
75
+ | 0.3098 | 10.0 | 1200 | 0.9829 | 0.6749 |
76
+ | 0.2511 | 11.67 | 1400 | 1.0105 | 0.6674 |
77
+ | 0.2181 | 13.33 | 1600 | 1.0412 | 0.6524 |
78
+ | 0.1765 | 15.0 | 1800 | 1.0473 | 0.6415 |
79
+ | 0.1602 | 16.67 | 2000 | 1.0681 | 0.6256 |
80
+ | 0.1415 | 18.33 | 2200 | 1.0362 | 0.6107 |
81
+ | 0.1311 | 20.0 | 2400 | 1.0444 | 0.6113 |
82
 
83
 
84
  ### Framework versions