geninhu
/

xls-asr-vi-40h-1B

Automatic Speech Recognition

hf-asr-leaderboard

robust-speech-event

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

geninhu commited on Jan 29, 2022

Commit

48f71da

·

1 Parent(s): 5ffaf78

update model card README.md

Files changed (1) hide show

README.md +14 -36

README.md CHANGED Viewed

@@ -1,28 +1,12 @@
 ---
 license: apache-2.0
-language:
-- vi
 tags:
 - automatic-speech-recognition
-- robust-speech-event
-- common-voice
 model-index:
 - name: xls-asr-vi-40h-1B
-  results:
-  - task:
-      name: Speech Recognition
-      type: automatic-speech-recognition
-    dataset:
-      name: Common Voice 7.0 vi
-      type: common_voice
-      args: vi
-    metrics:
-       - name: Test WER
-         type: wer
-         value: 34.21
-       - name: Test CER
-         type: cer
-         value: 19.94
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,17 +14,18 @@ should probably proofread and complete it, then remove this comment. -->
 # xls-asr-vi-40h-1B
-This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on the common voice 7.0 vi & private dataset.
-### Benchmark WER result:
-| | [VIVOS](https://huggingface.co/datasets/vivos) | [COMMON VOICE 7.0 VI](https://huggingface.co/datasets/mozilla-foundation/common_voice_7_0) |
-|---|---|---|
-|without LM| 25.92 | 34.21 |
-### Benchmark CER result:
-| | [VIVOS](https://huggingface.co/datasets/vivos) | [COMMON VOICE 7.0 VI](https://huggingface.co/datasets/mozilla-foundation/common_voice_7_0) |
-|---|---|---|
-|without LM| 9.24 | 19.94 |
 ## Training and evaluation data
@@ -60,15 +45,8 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 1500
-- num_epochs: 50.0
 - mixed_precision_training: Native AMP
-- attention_dropout: 0.2
-- activation_dropout: 0.1
-- warmup_steps: 1500
-- mask_time_prob: .15
-- mask_time_length: 10
-- mask_feature_prob: 0.25
-- mask_feature_length: 64
 ### Training results

 ---
 license: apache-2.0
 tags:
 - automatic-speech-recognition
+- geninhu/fpt-vi
+- generated_from_trainer
 model-index:
 - name: xls-asr-vi-40h-1B
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # xls-asr-vi-40h-1B
+This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on the GENINHU/FPT-VI - NA dataset.
+It achieves the following results on the evaluation set:
+- Loss: 4.1691
+- Wer: 0.4133
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
 ## Training and evaluation data
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 1500
+- num_epochs: 10.0
 - mixed_precision_training: Native AMP
 ### Training results