sadrasabouri
commited on
Commit
•
9603413
1
Parent(s):
52867a6
Update README.md
Browse files
README.md
CHANGED
@@ -32,7 +32,11 @@ model-index:
|
|
32 |
|
33 |
# Sharif-wav2vec2
|
34 |
|
35 |
-
This is the fine-tuned version of Sharif Wav2vec2 for Farsi. The base model was fine-tuned on 108 hours of Commonvoice's Farsi samples with a sampling rate equal to 16kHz. Afterward, we trained a 5gram using [kenlm](https://github.com/kpu/kenlm) toolkit and used it in the processor which increased our accuracy on online ASR.
|
|
|
|
|
|
|
|
|
36 |
|
37 |
```shell
|
38 |
pip install pyctcdecode
|
@@ -74,11 +78,18 @@ print(prediction[0])
|
|
74 |
|
75 |
*Result (WER)*:
|
76 |
|
77 |
-
|
|
78 |
|---|---|
|
79 |
| 3.4 | 8.6 |
|
80 |
|
81 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
82 |
## Citation
|
83 |
If you want to cite this model you can use this:
|
84 |
|
|
|
32 |
|
33 |
# Sharif-wav2vec2
|
34 |
|
35 |
+
This is the fine-tuned version of Sharif Wav2vec2 for Farsi. The base model was fine-tuned on 108 hours of Commonvoice's Farsi samples with a sampling rate equal to 16kHz. Afterward, we trained a 5gram using [kenlm](https://github.com/kpu/kenlm) toolkit and used it in the processor which increased our accuracy on online ASR.
|
36 |
+
|
37 |
+
## Usage
|
38 |
+
|
39 |
+
When using the model make sure that your speech input is sampled at 16Khz. Prior to the usage, you may need to install the below dependencies:
|
40 |
|
41 |
```shell
|
42 |
pip install pyctcdecode
|
|
|
78 |
|
79 |
*Result (WER)*:
|
80 |
|
81 |
+
| clean | other |
|
82 |
|---|---|
|
83 |
| 3.4 | 8.6 |
|
84 |
|
85 |
|
86 |
+
## Evaluation
|
87 |
+
|
88 |
+
For the evaluation use the code below:
|
89 |
+
```python
|
90 |
+
?
|
91 |
+
```
|
92 |
+
|
93 |
## Citation
|
94 |
If you want to cite this model you can use this:
|
95 |
|