MUSTAR commited on
Commit
16a9f96
1 Parent(s): ae07a18

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +44 -0
README.md ADDED
@@ -0,0 +1,44 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ### Dataset is about ~2000 hours of speech and vocals
2
+ ### Supported languages (english or spanish?) who ever moves first is:
3
+
4
+ ~800 hrs of English (with vast verity of speakers and every emotion)
5
+
6
+ ~200 Spanish
7
+
8
+ ~42 French
9
+
10
+ ~188 Russian
11
+
12
+ ~70 Arabic
13
+
14
+ ~140 Japanese
15
+
16
+ ~70 Chinese (Mandarin)
17
+
18
+ ~80 Korean
19
+
20
+ ~30 Hindi
21
+
22
+ ~53 Indonesian
23
+
24
+ ~30 Tagalog
25
+
26
+ ~40 Portuguese
27
+
28
+ ~35 German
29
+
30
+ ~190 singing (all languages)
31
+
32
+ common language (I don't remember how much data was there)
33
+
34
+ ## Type: big-base for finetuning
35
+ Batch: 2-40-80
36
+ # Sampling frequency: 32k 40k
37
+ Total steps count: 371406
38
+ # Hardware used:
39
+ 1 - h100, 4 - L40s
40
+
41
+ Expected release date - 22 july
42
+
43
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65041c19e88eb2d0d521d46c/NfsOJxAzRbllBDCDjFC5e.png)
44
+ ()