arturslogins
/

whisper-medium-lv

Automatic Speech Recognition

hf-asr-leaderboard

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

arturslogins commited on Apr 30

Commit

332378e

•

1 Parent(s): 6f73192

End of training

Files changed (1) hide show

README.md +13 -13

README.md CHANGED Viewed

@@ -25,7 +25,7 @@ model-index:
     metrics:
     - name: Wer
       type: wer
-      value: 44.06162804804076
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -35,8 +35,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [arturslogins/whisper-medium-lv](https://huggingface.co/arturslogins/whisper-medium-lv) on the Recorded Voice dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.1246
-- Wer: 44.0616
 ## Model description
@@ -56,10 +56,10 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 2
-- eval_batch_size: 8
 - seed: 42
-- gradient_accumulation_steps: 16
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -69,13 +69,13 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Wer     |
-|:-------------:|:-----:|:----:|:---------------:|:-------:|
-| 0.0175        | 6.4   | 1000 | 0.9442          | 45.7358 |
-| 0.0043        | 12.8  | 2000 | 0.9789          | 45.6994 |
-| 0.0012        | 19.2  | 3000 | 1.0763          | 47.4221 |
-| 0.0001        | 25.6  | 4000 | 1.1194          | 44.1708 |
-| 0.0001        | 32.0  | 5000 | 1.1246          | 44.0616 |
 ### Framework versions

     metrics:
     - name: Wer
       type: wer
+      value: 43.705463182897866
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [arturslogins/whisper-medium-lv](https://huggingface.co/arturslogins/whisper-medium-lv) on the Recorded Voice dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0843
+- Wer: 43.7055
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 16
+- eval_batch_size: 12
 - seed: 42
+- gradient_accumulation_steps: 2
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch   | Step | Validation Loss | Wer     |
+|:-------------:|:-------:|:----:|:---------------:|:-------:|
+| 0.0097        | 6.3898  | 1000 | 0.9780          | 45.8432 |
+| 0.0023        | 12.7796 | 2000 | 1.0158          | 44.9525 |
+| 0.0002        | 19.1693 | 3000 | 1.0549          | 46.1758 |
+| 0.0001        | 25.5591 | 4000 | 1.0765          | 44.9881 |
+| 0.0001        | 31.9489 | 5000 | 1.0843          | 43.7055 |
 ### Framework versions