Update README.md
Browse files
README.md
CHANGED
@@ -19,7 +19,7 @@ Github:https://github.com/CrazyBoyM/llama3-Chinese-chat
|
|
19 |
放出训练配方细节供网友参考分享:
|
20 |
DPO(beta 0.5) + lora rank128, alpha256 + 打开"lm_head", "input_layernorm", "post_attention_layernorm", "norm"层训练.
|
21 |
特点:偏好中文和emoji表情,且不损伤原instruct版模型能力。实测中文DPO版问答性能体验超过现在市面上任何llama3中文微调版 (微调会破坏llama3原版能力,导致遗忘)
|
22 |
-

|
23 |
|
24 |
### 模型部署
|
25 |
网页脚本文件:https://github.com/CrazyBoyM/llama3-Chinese-chat/blob/main/deploy/web_streamlit_for_instruct_v2.py
|
|
|
19 |
放出训练配方细节供网友参考分享:
|
20 |
DPO(beta 0.5) + lora rank128, alpha256 + 打开"lm_head", "input_layernorm", "post_attention_layernorm", "norm"层训练.
|
21 |
特点:偏好中文和emoji表情,且不损伤原instruct版模型能力。实测中文DPO版问答性能体验超过现在市面上任何llama3中文微调版 (微调会破坏llama3原版能力,导致遗忘)
|
22 |
+

|
23 |
|
24 |
### 模型部署
|
25 |
网页脚本文件:https://github.com/CrazyBoyM/llama3-Chinese-chat/blob/main/deploy/web_streamlit_for_instruct_v2.py
|