Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,9 @@ widget:
|
|
11 |
|
12 |
## Model description
|
13 |
|
14 |
-
The model is
|
|
|
|
|
15 |
|
16 |
## How to use
|
17 |
|
@@ -76,4 +78,12 @@ python3 scripts/convert_gpt2_from_uer_to_huggingface.py --input_model_path lyric
|
|
76 |
pages={241},
|
77 |
year={2019}
|
78 |
}
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
79 |
```
|
|
|
11 |
|
12 |
## Model description
|
13 |
|
14 |
+
The model is pre-trained by [UER-py](https://github.com/dbiir/UER-py/), which is introduced in [this paper](https://arxiv.org/abs/1909.05658). Besides, the model could also be pre-trained by [TencentPretrain](https://github.com/Tencent/TencentPretrain) introduced in [this paper](https://arxiv.org/abs/2212.06385), which inherits UER-py to support models with parameters above one billion, and extends it to a multimodal pre-training framework.
|
15 |
+
|
16 |
+
The model is used to generate Chinese lyrics. You can download the model from the [UER-py Modelzoo page](https://github.com/dbiir/UER-py/wiki/Modelzoo), or [GPT2-Chinese Github page](https://github.com/Morizeyao/GPT2-Chinese), or via HuggingFace from the link [gpt2-chinese-lyric](https://huggingface.co/uer/gpt2-chinese-lyric)
|
17 |
|
18 |
## How to use
|
19 |
|
|
|
78 |
pages={241},
|
79 |
year={2019}
|
80 |
}
|
81 |
+
|
82 |
+
@article{zhao2023tencentpretrain,
|
83 |
+
title={TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities},
|
84 |
+
author={Zhao, Zhe and Li, Yudong and Hou, Cheng and Zhao, Jing and others},
|
85 |
+
journal={ACL 2023},
|
86 |
+
pages={217},
|
87 |
+
year={2023}
|
88 |
+
}
|
89 |
```
|