File size: 633 Bytes
647ab02
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
---
license: mit
---

This is a DVAE filetune for xttsv2, based on the scripts presented here. 
https://github.com/daswer123/xtts-finetune-tests/tree/main/dvae-finetune

Trained on 100h of Russian high quality speech, potentially should improve finetune quality of GPT-2 and Perceiver models.

You can try to use it in xtts-finetune-webui as a custom DVAE

```
wandb: Run summary:
wandb: commit_loss 0.04019
wandb:    cur_step 2571
wandb:       epoch 19
wandb:        loss 0.10499
wandb:  recon_loss 0.06481
```

![image/png](https://cdn-uploads.huggingface.co/production/uploads/65a26decbfaec7e7cad01948/ViV5c14i5qLr-v9p__LTy.png)