Update README.md
Browse files
README.md
CHANGED
@@ -33,7 +33,7 @@ The ASR system is composed of:
|
|
33 |
- the **Acoustic model** (wav2vec2.0 + DNN + CTC greedy decode). The pretrained wav2vec 2.0 model [LeBenchmark/wav2vec2-FR-7K-large](https://huggingface.co/LeBenchmark/wav2vec2-FR-7K-large) is combined with two DNN layers and fine-tuned on Orféo.
|
34 |
The final acoustic representation is given to the CTC greedy decode.
|
35 |
|
36 |
-
We used recordings sampled at 16kHz (single channel).
|
37 |
|
38 |
## 💻 How to transcribe a file with the model
|
39 |
|
|
|
33 |
- the **Acoustic model** (wav2vec2.0 + DNN + CTC greedy decode). The pretrained wav2vec 2.0 model [LeBenchmark/wav2vec2-FR-7K-large](https://huggingface.co/LeBenchmark/wav2vec2-FR-7K-large) is combined with two DNN layers and fine-tuned on Orféo.
|
34 |
The final acoustic representation is given to the CTC greedy decode.
|
35 |
|
36 |
+
We used recordings sampled at 16kHz (single channel). For training, we did not use audio files longer than 10 seconds to prevent memory issues.
|
37 |
|
38 |
## 💻 How to transcribe a file with the model
|
39 |
|