projecte-aina
/

stt-ca-citrinet-512

Automatic Speech Recognition

Model card Files Files and versions Community

angel-poc commited on Dec 9, 2022

Commit

f14ba7e

·

1 Parent(s): e44abf5

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -89,6 +89,12 @@ After cleaning the dataset and normalizing the `ñ` character to `ny`, we have u
 ```python
 ['c', ' ', 'ó', 'g', 'a', 'o', 'ü', 'v', 'p', 't', "'", '—', 'f', 'k', 'à', 'ï', 'í', 'ú', 'd', 'l', 'z', 'é', 'w', 'm', 'r', 'n', 'y', '-', 'u', 'i', 'h', 'ç', 'e', '·', 'q', 'è', 'ò', 'j', 'x', 's', 'b']
 ```
 ## Evaluation
 After evaluation on the test split of Common Voice 11.0 we have obtained a WER of 6.684.

 ```python
 ['c', ' ', 'ó', 'g', 'a', 'o', 'ü', 'v', 'p', 't', "'", '—', 'f', 'k', 'à', 'ï', 'í', 'ú', 'd', 'l', 'z', 'é', 'w', 'm', 'r', 'n', 'y', '-', 'u', 'i', 'h', 'ç', 'e', '·', 'q', 'è', 'ò', 'j', 'x', 's', 'b']
 ```
+### Training procedure
+This model was trained starting from a pre-trained Spanish [stt-es-citrinet-512](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/nemo/models/stt_es_citrinet_512) model. The initial learning rate was set to 0.005 and the minimum lr for weight decay was set to 1e-7.
+The model was trained for 90 steps and then continued training for another 90 steps starting from a learning rate of 0.0001.
 ## Evaluation
 After evaluation on the test split of Common Voice 11.0 we have obtained a WER of 6.684.