|
--- |
|
license: mit |
|
--- |
|
|
|
This is a DVAE filetune for xttsv2, based on the scripts presented here. |
|
https://github.com/daswer123/xtts-finetune-tests/tree/main/dvae-finetune |
|
|
|
Trained on 100h of Russian high quality speech, potentially should improve finetune quality of GPT-2 and Perceiver models. |
|
|
|
You can try to use it in xtts-finetune-webui as a custom DVAE |
|
|
|
``` |
|
wandb: Run summary: |
|
wandb: commit_loss 0.04019 |
|
wandb: cur_step 2571 |
|
wandb: epoch 19 |
|
wandb: loss 0.10499 |
|
wandb: recon_loss 0.06481 |
|
``` |
|
|
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/65a26decbfaec7e7cad01948/ViV5c14i5qLr-v9p__LTy.png) |
|
|
|
|