Orange
/

Speaker-wavLM-pro

Model card Files Files and versions Community

ggmbr commited on 17 days ago

Commit

7d9c128

·

verified ·

1 Parent(s): 82152ec

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -70,10 +70,15 @@ The fine tuning data used to produce this model (VoxCeleb, VCTK) are mostly in e
 # Publication
 Details about the method used to build this model have been published at Interspeech 2024 in the paper entitled
-[Disentangling prosody and timbre embeddings via voice conversion](https://www.isca-archive.org/interspeech_2024/gengembre24_interspeech.pdf).
 Please consider citing this paper if you use this model in your own research work.
 ### Citation
 Gengembre, N., Le Blouch, O., Gendrot, C. (2024) Disentangling prosody and timbre embeddings via voice conversion. Proc. Interspeech 2024, 2765-2769, doi: 10.21437/Interspeech.2024-207

 # Publication
 Details about the method used to build this model have been published at Interspeech 2024 in the paper entitled
+[Disentangling prosody and timbre embeddings via voice conversion](https://www.isca-archive.org/interspeech_2024/gengembre24_interspeech.pdf).
 Please consider citing this paper if you use this model in your own research work.
+In this paper the model is denoted as W-PRO. The other two models used in this study can also be found on HuggingFace :
+- [W-TBR](https://huggingface.co/Orange/Speaker-wavLM-tbr) for timber related embeddings
+- [W-SPK](https://huggingface.co/Orange/Speaker-wavLM-id) for speaker embeddings (ASV)
 ### Citation
 Gengembre, N., Le Blouch, O., Gendrot, C. (2024) Disentangling prosody and timbre embeddings via voice conversion. Proc. Interspeech 2024, 2765-2769, doi: 10.21437/Interspeech.2024-207