remove models and add 3k duration de
Browse files- README.md +8 -4
- duration_v2.safetensors +2 -2
- model_430000b.safetensors +0 -3
- model_90000b.safetensors +0 -3
README.md
CHANGED
@@ -10,13 +10,17 @@ tags:
|
|
10 |
- tts
|
11 |
- audio
|
12 |
- german
|
|
|
13 |
---
|
14 |
|
|
|
15 |
|
16 |
-
Inference with .
|
17 |
-
```
|
18 |
-
|
19 |
-
|
|
|
|
|
20 |
```
|
21 |
Github: https://github.com/SWivid/F5-TTS
|
22 |
Paper: [F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching](https://huggingface.co/papers/2410.06885)
|
|
|
10 |
- tts
|
11 |
- audio
|
12 |
- german
|
13 |
+
- mlx
|
14 |
---
|
15 |
|
16 |
+
Copied from https://huggingface.co/marduk-ra/F5-TTS-German, added trained duration model on emilia dataset
|
17 |
|
18 |
+
Inference with https://github.com/lucasnewman/f5-tts-mlx
|
19 |
+
```bash
|
20 |
+
python -m f5_tts_mlx.generate --model "eamag/f5-tts-mlx-german" \
|
21 |
+
--text "The quick brown fox jumped over the lazy dog." \
|
22 |
+
--ref-audio /path/to/audio.wav \
|
23 |
+
--ref-text "This is the caption for the reference audio."
|
24 |
```
|
25 |
Github: https://github.com/SWivid/F5-TTS
|
26 |
Paper: [F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching](https://huggingface.co/papers/2410.06885)
|
duration_v2.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:16e7cbdeec894fa5430ad459da7efc64ee77964e04fa46738e13641319cce80a
|
3 |
+
size 87633072
|
model_430000b.safetensors
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:ef497c10082f1e16aaf7af1fe540f4f676af187aaad662e907955ce581c1e53a
|
3 |
-
size 1349621553
|
|
|
|
|
|
|
|
model_90000b.safetensors
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:bb1b6b1ca893849591918fd266b16fa4f3c4d1197da236b5498315406db829ad
|
3 |
-
size 1349621553
|
|
|
|
|
|
|
|