HKUST-Audio
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -3,6 +3,13 @@ license: cc-by-4.0
|
|
3 |
tags:
|
4 |
- Speech tokenizer
|
5 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
6 |
# Getting Started with XCodec2 on Hugging Face
|
7 |
XCodec2 is a speech tokenizer that offers the following key features:
|
8 |
|
@@ -47,4 +54,6 @@ with torch.no_grad():
|
|
47 |
|
48 |
sf.write("reconstructed.wav", recon_wav[0, 0, :].numpy(), sr)
|
49 |
print("Done! Check reconstructed.wav")
|
50 |
-
```
|
|
|
|
|
|
3 |
tags:
|
4 |
- Speech tokenizer
|
5 |
---
|
6 |
+
|
7 |
+
|
8 |
+
## Paper
|
9 |
+
LLaSA: Scaling Train Time and Test Time Compute for LLaMA based Speech Synthesis (Comming soon)
|
10 |
+
|
11 |
+
|
12 |
+
|
13 |
# Getting Started with XCodec2 on Hugging Face
|
14 |
XCodec2 is a speech tokenizer that offers the following key features:
|
15 |
|
|
|
54 |
|
55 |
sf.write("reconstructed.wav", recon_wav[0, 0, :].numpy(), sr)
|
56 |
print("Done! Check reconstructed.wav")
|
57 |
+
```
|
58 |
+
|
59 |
+
If you want to train your own xcodec2 or require large-scale code extraction, the code is released [here](https://github.com/zhenye234/X-Codec-2.0).
|