naver
/

multilingual-distilwhisper-10k

Automatic Speech Recognition

text2text-generation

Model card Files Files and versions Community

mzboito commited on Nov 30, 2023

Commit

ac05f38

·

1 Parent(s): 44a9df5

Update README.md

Files changed (1) hide show

README.md +30 -0

README.md CHANGED Viewed

@@ -1,3 +1,33 @@
 ---
 license: mit
 ---

 ---
 license: mit
+datasets:
+- mozilla-foundation/common_voice_13_0
+language:
+- ca
+- cs
+- gl
+- hu
+- pl
+- ta
+- th
+- uk
 ---
+## About
+Multilingual Distilwhisper allows for better ASR performance in target languages by adding lightweight CLSR modules on top of whisper-small.
+These modules are trained on a mix of cross-entropy (ASR) and knowledge distillation losses, where whisper-large-v2 is used as teacher.
+## Inference
+Loader will be made available soon at https://github.com/naver
+## Citation (submitted to ICASSP 2024)
+```
+@article{ferraz2023distilwhisper,
+  title={DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts},
+  author={Ferraz, Thomas Palmeira and Boito, Marcely Zanon and Brun, Caroline and Nikoulina, Vassilina},
+  journal={arXiv preprint arXiv:2311.01070},
+  year={2023}
+}
+```