kadirnar commited on
Commit
7557714
·
verified ·
1 Parent(s): 7f1a995

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +67 -0
README.md ADDED
@@ -0,0 +1,67 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # MeloTTS Model Checkpoint
2
+
3
+ This repository contains trained model checkpoints for MeloTTS, a high-quality multi-lingual text-to-speech system. These checkpoints are part of a trained model that can be used for text-to-speech synthesis.
4
+
5
+ ## Model Details
6
+
7
+ - **Model Type**: MeloTTS
8
+ - **Language Support**: English (Default)
9
+ - **Sampling Rate**: 44.1kHz
10
+ - **Mel Channels**: 128
11
+ - **Hidden Channels**: 192
12
+ - **Filter Channels**: 768
13
+
14
+ ### Architecture Details
15
+ - Inter channels: 192
16
+ - Number of heads: 2
17
+ - Number of layers: 6
18
+ - Flow layers: 3
19
+ - Kernel size: 3
20
+ - Dropout rate: 0.1
21
+
22
+ ## Training Dataset
23
+
24
+ This model was trained on the [Jenny TTS Dataset](https://huggingface.co/datasets/reach-vb/jenny_tts_dataset), which is available on Hugging Face. The dataset consists of high-quality English speech recordings suitable for text-to-speech training.
25
+
26
+ ## Model Files
27
+
28
+ The repository contains several checkpoint files:
29
+ - `DUR_*.pth`: Duration predictor checkpoints
30
+ - `G_*.pth`: Generator model checkpoints
31
+ - `D_*.pth`: Discriminator model checkpoints
32
+ - `config.json`: Model configuration file
33
+
34
+ ## Usage
35
+
36
+ To use this model with MeloTTS:
37
+
38
+ ```python
39
+ from melo.api import TTS
40
+
41
+ # Initialize TTS with the model path
42
+ tts = TTS(model_path="kadirnar/melotts-model")
43
+
44
+ # Generate speech
45
+ tts.tts_to_file(
46
+ text="Your text here",
47
+ speaker="EN-default",
48
+ language="EN",
49
+ output_path="output.wav"
50
+ )
51
+ ```
52
+
53
+ ## Training Details
54
+
55
+ The model was trained with the following specifications:
56
+ - Batch size: 6
57
+ - Learning rate: 0.0003
58
+ - Beta values: [0.8, 0.99]
59
+ - Segment size: 16384
60
+
61
+ ## Original Repository
62
+
63
+ This model is based on [MeloTTS](https://github.com/myshell-ai/MeloTTS) by MyShell.ai. Visit the original repository for more details about the architecture and implementation.
64
+
65
+ ## License
66
+
67
+ This model follows the same licensing as the original MeloTTS repository (MIT License).