naver
/

mHuBERT-147-ASR-fr

Automatic Speech Recognition

Model card Files Files and versions Community

mzboito commited on Aug 23, 2024

Commit

6dbaabe

·

verified ·

1 Parent(s): e89396e

Update README.md

Files changed (1) hide show

README.md +8 -6

README.md CHANGED Viewed

@@ -16,6 +16,7 @@ pipeline_tag: automatic-speech-recognition
 **This is a CTC-based Automatic Speech Recognition system for French.**
 This model is part of the SLU demo available here: [LINK TO THE DEMO GOES HERE]
 It is based on the [mHuBERT-147](https://huggingface.co/utter-project/mHuBERT-147) speech foundation model.
 * Training data: XX hours
@@ -24,20 +25,21 @@ It is based on the [mHuBERT-147](https://huggingface.co/utter-project/mHuBERT-14
 # Table of Contents:
-1. Training Parameters
-2. [ASR Model class](https://huggingface.co/naver/mHuBERT-147-ASR-fr#ASR-Model-class)
-3. Running inference
 ## Training Parameters
-The training parameters are available in config.yaml.
-## ASR Model class
 We use the mHubertForCTC class for our model, which is nearly identical to the existing HubertForCTC class.
 The key difference is that we've added a few additional hidden layers at the end of the Transformer stack, just before the lm_head.
 The code is available in [CTC_model.py](https://huggingface.co/naver/mHuBERT-147-ASR-fr/blob/main/CTC_model.py).
-## Running inference
 The run_asr.py file illustrates how to load the model for inference (**load_asr_model**), and how to produce transcription for a file (**run_asr_inference**).
 Please follow the [requirements file](https://huggingface.co/naver/mHuBERT-147-ASR-fr/blob/main/requirements.txt) to avoid incorrect model loading.

 **This is a CTC-based Automatic Speech Recognition system for French.**
 This model is part of the SLU demo available here: [LINK TO THE DEMO GOES HERE]
 It is based on the [mHuBERT-147](https://huggingface.co/utter-project/mHuBERT-147) speech foundation model.
 * Training data: XX hours
 # Table of Contents:
+1. [Training Parameters](https://huggingface.co/naver/mHuBERT-147-ASR-fr#Training-Parameters)
+2. [ASR Model class](https://huggingface.co/naver/mHuBERT-147-ASR-fr#ASR-Model-Class)
+3. [Running inference](https://huggingface.co/naver/mHuBERT-147-ASR-fr#Running-Inference)
 ## Training Parameters
+The training parameters are available in [config.yaml](https://huggingface.co/naver/mHuBERT-147-ASR-fr/blob/main/config.yaml).
+We highlight the use of 0.3 for hubert.final_dropout, which we found to be very helpful in convergence. We also use fp32 training, as we found fp16 training to be unstable.
+## ASR Model Class
 We use the mHubertForCTC class for our model, which is nearly identical to the existing HubertForCTC class.
 The key difference is that we've added a few additional hidden layers at the end of the Transformer stack, just before the lm_head.
 The code is available in [CTC_model.py](https://huggingface.co/naver/mHuBERT-147-ASR-fr/blob/main/CTC_model.py).
+## Running Inference
 The run_asr.py file illustrates how to load the model for inference (**load_asr_model**), and how to produce transcription for a file (**run_asr_inference**).
 Please follow the [requirements file](https://huggingface.co/naver/mHuBERT-147-ASR-fr/blob/main/requirements.txt) to avoid incorrect model loading.