eamag commited on
Commit
f0aa260
·
1 Parent(s): 446437f

remove models and add 3k duration de

Browse files
README.md CHANGED
@@ -10,13 +10,17 @@ tags:
10
  - tts
11
  - audio
12
  - german
 
13
  ---
14
 
 
15
 
16
- Inference with .safetensors option
17
- ```
18
- F5TTS_Base/
19
- f5_tts_german_1010000.safetensors
 
 
20
  ```
21
  Github: https://github.com/SWivid/F5-TTS
22
  Paper: [F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching](https://huggingface.co/papers/2410.06885)
 
10
  - tts
11
  - audio
12
  - german
13
+ - mlx
14
  ---
15
 
16
+ Copied from https://huggingface.co/marduk-ra/F5-TTS-German, added trained duration model on emilia dataset
17
 
18
+ Inference with https://github.com/lucasnewman/f5-tts-mlx
19
+ ```bash
20
+ python -m f5_tts_mlx.generate --model "eamag/f5-tts-mlx-german" \
21
+ --text "The quick brown fox jumped over the lazy dog." \
22
+ --ref-audio /path/to/audio.wav \
23
+ --ref-text "This is the caption for the reference audio."
24
  ```
25
  Github: https://github.com/SWivid/F5-TTS
26
  Paper: [F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching](https://huggingface.co/papers/2410.06885)
duration_v2.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:25c4701cf765a9175d4fac624b39baab9714cffb41a73465438326952a23348e
3
- size 86238439
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:16e7cbdeec894fa5430ad459da7efc64ee77964e04fa46738e13641319cce80a
3
+ size 87633072
model_430000b.safetensors DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:ef497c10082f1e16aaf7af1fe540f4f676af187aaad662e907955ce581c1e53a
3
- size 1349621553
 
 
 
 
model_90000b.safetensors DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:bb1b6b1ca893849591918fd266b16fa4f3c4d1197da236b5498315406db829ad
3
- size 1349621553