alibabasglab commited on
Commit
c5200a2
·
verified ·
1 Parent(s): 5f3864b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -3,7 +3,7 @@ license: apache-2.0
3
  ---
4
  # Introduction
5
 
6
- The MossFormer2_SR_48K model weights for 48 kHz speech super-resolution in [ClearerVoice-Studio](https://github.com/modelscope/ClearerVoice-Studio/tree/main) repo.
7
 
8
  This model is trained on large scale datasets inclduing open-sourced and private data.
9
 
@@ -64,4 +64,6 @@ myClearVoice(input_path='samples/scp/audio_samples_sr.scp', online_write=True, o
64
  ```
65
 
66
  Model Limitations: The current speech super-resolution model is trained on a clean speech dataset and is designed to work with clean speech inputs. For speech super-resolution on noisy speech audio,
67
- we recommend using our 'MossFormer2_SE_48K' model for speech enhancement first, followed by 'MossFormer2_SR_48K' for speech super-resolution.
 
 
 
3
  ---
4
  # Introduction
5
 
6
+ The MossFormer2_SR_48K model weights for 48 kHz speech super-resolution [1] provdied in [ClearerVoice-Studio](https://github.com/modelscope/ClearerVoice-Studio/tree/main) repo.
7
 
8
  This model is trained on large scale datasets inclduing open-sourced and private data.
9
 
 
64
  ```
65
 
66
  Model Limitations: The current speech super-resolution model is trained on a clean speech dataset and is designed to work with clean speech inputs. For speech super-resolution on noisy speech audio,
67
+ we recommend using our 'MossFormer2_SE_48K' model for speech enhancement first, followed by 'MossFormer2_SR_48K' for speech super-resolution.
68
+
69
+ [1] Shengkui Zhao, Kun Zhou, Zexu Pan, Yukun Ma, Chong Zhang, and Bin Ma, "HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution", ICASSP 2025.