Update README.md
Browse files
README.md
CHANGED
@@ -79,7 +79,7 @@ print(transcriptions)
|
|
79 |
|
80 |
## Training
|
81 |
### Data preparation
|
82 |
-
We have processed [Common Voice 11.0](https://commonvoice.mozilla.org/en/datasets) using the NeMo toolkit. We used [get_commonvoice_data.py](https://github.com/NVIDIA/NeMo/blob/main/scripts/dataset_processing/get_commonvoice_data.py) to process the manifests and made
|
83 |
|
84 |
After cleaning the dataset and normalizing the `ñ` character to `ny`, we have used the following charset to create the final NeMo manifests for training.
|
85 |
```python
|
|
|
79 |
|
80 |
## Training
|
81 |
### Data preparation
|
82 |
+
We have processed [Common Voice 11.0](https://commonvoice.mozilla.org/en/datasets) using the NeMo toolkit. We used [get_commonvoice_data.py](https://github.com/NVIDIA/NeMo/blob/main/scripts/dataset_processing/get_commonvoice_data.py) to process the manifests and made a subsequent data cleaning step.
|
83 |
|
84 |
After cleaning the dataset and normalizing the `ñ` character to `ny`, we have used the following charset to create the final NeMo manifests for training.
|
85 |
```python
|