SuperkingbasSKB
commited on
Commit
•
615caba
1
Parent(s):
ba742be
Update README.md
Browse files
README.md
CHANGED
@@ -25,21 +25,15 @@ constrained generation, and reasoning tasks.is a Thai 🇹🇠& China 🇨🇳
|
|
25 |
|
26 |
For release notes, please see our [blog](https://medium.com/@superkingbasskb/openthaillm-prebuilt-release-f1b0e22be6a5).
|
27 |
|
28 |
-
|
29 |
|
|
|
30 |
The code of Qwen2.5 has been in the latest Hugging face `transformers` and we advise you to use the latest version of `transformers`.
|
31 |
|
32 |
With `transformers<4.37.0`, you will encounter the following error:
|
33 |
```
|
34 |
KeyError: 'qwen2'
|
35 |
-
|
36 |
-
## Training details
|
37 |
-
|
38 |
-
We pretrained the models with a large amount of data, and we post-trained the models with both supervised finetuning and direct preference optimization.
|
39 |
-
|
40 |
-
## Implementation
|
41 |
-
|
42 |
-
Here provides a code snippet with `apply_chat_template` to show you how to load the tokenizer and model and how to generate contents.
|
43 |
|
44 |
## Evaluation Performance
|
45 |
| Model | ONET | IC | TGAT | TPAT-1 | A-Level | Average (ThaiExam) | M3Exam (1 shot) | MMLU |
|
|
|
25 |
|
26 |
For release notes, please see our [blog](https://medium.com/@superkingbasskb/openthaillm-prebuilt-release-f1b0e22be6a5).
|
27 |
|
28 |
+
**We do not recommend using base language models for conversations.** Instead, you can apply post-training, e.g., SFT, RLHF, continued pretraining, etc., on this model.
|
29 |
|
30 |
+
## Requirements
|
31 |
The code of Qwen2.5 has been in the latest Hugging face `transformers` and we advise you to use the latest version of `transformers`.
|
32 |
|
33 |
With `transformers<4.37.0`, you will encounter the following error:
|
34 |
```
|
35 |
KeyError: 'qwen2'
|
36 |
+
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
37 |
|
38 |
## Evaluation Performance
|
39 |
| Model | ONET | IC | TGAT | TPAT-1 | A-Level | Average (ThaiExam) | M3Exam (1 shot) | MMLU |
|