Update README.md
Browse files
README.md
CHANGED
@@ -32,27 +32,26 @@ https://github.com/OpenThaiGPT/openthaigpt1.5_api_examples
|
|
32 |
|
33 |
## Benchmark on [OpenThaiGPT Eval](https://huggingface.co/datasets/openthaigpt/openthaigpt_eval)
|
34 |
** Please take a look at ``openthaigpt/openthaigpt1.5-14b-instruct`` for this model's evaluation result.
|
35 |
-
| **Exam names** | **
|
36 |
-
|
37 |
-
| **01_a_level** |
|
38 |
-
| **02_tgat** |
|
39 |
-
| **03_tpat1** |
|
40 |
-
| **04_investment_consult** | 76.00%
|
41 |
-
| **05_facebook_beleble_th_200** |
|
42 |
-
| **06_xcopa_th_200** |
|
43 |
-
| **07_xnli2.0_th_200** |
|
44 |
-
| **08_onet_m3_thai** |
|
45 |
-
| **09_onet_m3_social** |
|
46 |
-
| **10_onet_m3_math** |
|
47 |
-
| **11_onet_m3_science** |
|
48 |
-
| **12_onet_m3_english** |
|
49 |
-
| **13_onet_m6_thai** |
|
50 |
-
| **14_onet_m6_math** |
|
51 |
-
| **15_onet_m6_social** |
|
52 |
-
| **16_onet_m6_science** |
|
53 |
-
| **17_onet_m6_english** |
|
54 |
-
| **Micro Average** |
|
55 |
-
|
56 |
|
57 |
Thai language multiple choice exams, Test on unseen test set, Zero-shot learning. Benchmark source code and exams information: https://github.com/OpenThaiGPT/openthaigpt_eval
|
58 |
|
|
|
32 |
|
33 |
## Benchmark on [OpenThaiGPT Eval](https://huggingface.co/datasets/openthaigpt/openthaigpt_eval)
|
34 |
** Please take a look at ``openthaigpt/openthaigpt1.5-14b-instruct`` for this model's evaluation result.
|
35 |
+
| **Exam names** | **scb10x/llama-3-typhoon-v1.5x-70b-instruct** | **Qwen/Qwen2.5-14B-Instruct** | **openthaigpt/openthaigpt1.5-14b** | **openthaigpt/openthaigpt1.5-72b** |
|
36 |
+
|--------------------------------|-----------------------------------------------|-------------------------------|------------------------------------|------------------------------------|
|
37 |
+
| **01_a_level** | 59.17% | 61.67% | 65.00% | 76.67% |
|
38 |
+
| **02_tgat** | 46.00% | 44.00% | 50.00% | 46.00% |
|
39 |
+
| **03_tpat1** | 52.50% | 60.00% | 52.50% | 55.00% |
|
40 |
+
| **04_investment_consult** | 60.00% | 76.00% | 72.00% | 72.00% |
|
41 |
+
| **05_facebook_beleble_th_200** | 87.50% | 84.50% | 87.00% | 90.00% |
|
42 |
+
| **06_xcopa_th_200** | 84.50% | 85.00% | 86.50% | 90.50% |
|
43 |
+
| **07_xnli2.0_th_200** | 62.50% | 69.50% | 64.50% | 70.50% |
|
44 |
+
| **08_onet_m3_thai** | 76.00% | 76.00% | 84.00% | 84.00% |
|
45 |
+
| **09_onet_m3_social** | 95.00% | 90.00% | 90.00% | 95.00% |
|
46 |
+
| **10_onet_m3_math** | 43.75% | 43.75% | 12.50% | 37.50% |
|
47 |
+
| **11_onet_m3_science** | 53.85% | 50.00% | 53.85% | 73.08% |
|
48 |
+
| **12_onet_m3_english** | 93.33% | 93.33% | 93.33% | 96.67% |
|
49 |
+
| **13_onet_m6_thai** | 55.38% | 52.31% | 56.92% | 56.92% |
|
50 |
+
| **14_onet_m6_math** | 41.18% | 23.53% | 41.18% | 41.18% |
|
51 |
+
| **15_onet_m6_social** | 67.27% | 60.00% | 61.82% | 65.45% |
|
52 |
+
| **16_onet_m6_science** | 50.00% | 50.00% | 57.14% | 67.86% |
|
53 |
+
| **17_onet_m6_english** | 73.08% | 82.69% | 78.85% | 90.38% |
|
54 |
+
| **Micro Average** | 69.97% | 71.00% | <b style="color:blue">71.51</b> | 76.73% |
|
|
|
55 |
|
56 |
Thai language multiple choice exams, Test on unseen test set, Zero-shot learning. Benchmark source code and exams information: https://github.com/OpenThaiGPT/openthaigpt_eval
|
57 |
|