Update README.md
Browse files
README.md
CHANGED
@@ -57,7 +57,7 @@ Clone the repository to download the model:
|
|
57 |
git clone https://huggingface.co/projecte-aina/stt-ca-citrinet-512
|
58 |
```
|
59 |
|
60 |
-
Given that `NEMO_PATH` is the path that points to the downloaded stt-ca-citrinet-512.nemo file, to do inference over a set of `.wav` files you should:
|
61 |
|
62 |
```python
|
63 |
# Load the model
|
@@ -77,6 +77,10 @@ print(transcriptions)
|
|
77 |
|
78 |
```
|
79 |
|
|
|
|
|
|
|
|
|
80 |
## Training
|
81 |
### Data preparation
|
82 |
We have processed [Common Voice 11.0](https://commonvoice.mozilla.org/en/datasets) using the NeMo toolkit. We used [get_commonvoice_data.py](https://github.com/NVIDIA/NeMo/blob/main/scripts/dataset_processing/get_commonvoice_data.py) to process the manifests and made a subsequent data cleaning step.
|
@@ -85,6 +89,8 @@ After cleaning the dataset and normalizing the `ñ` character to `ny`, we have u
|
|
85 |
```python
|
86 |
['c', ' ', 'ó', 'g', 'a', 'o', 'ü', 'v', 'p', 't', "'", '—', 'f', 'k', 'à', 'ï', 'í', 'ú', 'd', 'l', 'z', 'é', 'w', 'm', 'r', 'n', 'y', '-', 'u', 'i', 'h', 'ç', 'e', '·', 'q', 'è', 'ò', 'j', 'x', 's', 'b']
|
87 |
```
|
|
|
|
|
88 |
|
89 |
## Additional information
|
90 |
|
|
|
57 |
git clone https://huggingface.co/projecte-aina/stt-ca-citrinet-512
|
58 |
```
|
59 |
|
60 |
+
Given that `NEMO_PATH` is the path that points to the downloaded `stt-ca-citrinet-512.nemo` file, to do inference over a set of `.wav` files you should:
|
61 |
|
62 |
```python
|
63 |
# Load the model
|
|
|
77 |
|
78 |
```
|
79 |
|
80 |
+
## Training data
|
81 |
+
|
82 |
+
This model has been trained on the [Common Voice 11.0](https://commonvoice.mozilla.org/en/datasets) training split.
|
83 |
+
|
84 |
## Training
|
85 |
### Data preparation
|
86 |
We have processed [Common Voice 11.0](https://commonvoice.mozilla.org/en/datasets) using the NeMo toolkit. We used [get_commonvoice_data.py](https://github.com/NVIDIA/NeMo/blob/main/scripts/dataset_processing/get_commonvoice_data.py) to process the manifests and made a subsequent data cleaning step.
|
|
|
89 |
```python
|
90 |
['c', ' ', 'ó', 'g', 'a', 'o', 'ü', 'v', 'p', 't', "'", '—', 'f', 'k', 'à', 'ï', 'í', 'ú', 'd', 'l', 'z', 'é', 'w', 'm', 'r', 'n', 'y', '-', 'u', 'i', 'h', 'ç', 'e', '·', 'q', 'è', 'ò', 'j', 'x', 's', 'b']
|
91 |
```
|
92 |
+
## Evaluation
|
93 |
+
After evaluation on the test split of Common Voice 11.0 we have obtained a WER of 6.684.
|
94 |
|
95 |
## Additional information
|
96 |
|