|
--- |
|
datasets: |
|
- mesolitica/Malaysian-Emilia |
|
language: |
|
- ms |
|
- en |
|
base_model: |
|
- charactr/vocos-mel-24khz |
|
--- |
|
# Malaysian Vocos |
|
|
|
Continue pretraining [charactr/vocos-mel-24khz](https://huggingface.co/charactr/vocos-mel-24khz) on [Malaysian Emilia](https://huggingface.co/datasets/mesolitica/Malaysian-Emilia) to make it more crispy for Malaysian context! |
|
|
|
## Installation |
|
|
|
To use Vocos only in inference mode, install it using: |
|
|
|
```bash |
|
pip install vocos |
|
``` |
|
|
|
## Usage |
|
|
|
### Reconstruct audio from mel-spectrogram |
|
|
|
```python |
|
import torch |
|
|
|
from vocos import Vocos |
|
|
|
vocos = Vocos.from_pretrained("mesolitica/malaysian-vocos-mel-24khz") |
|
|
|
mel = torch.randn(1, 100, 256) # B, C, T |
|
audio = vocos.decode(mel) |
|
``` |