openthaigpt
/

openthaigpt1.5-14b-instruct

@@ -32,31 +32,31 @@ https://github.com/OpenThaiGPT/openthaigpt1.5_api_examples
 ## Benchmark on [OpenThaiGPT Eval](https://huggingface.co/datasets/openthaigpt/openthaigpt_eval)
 ** Please take a look at ``openthaigpt/openthaigpt1.5-7b-instruct`` for this model's evaluation result.
-| **Exam names**                 | **scb10x/llama-3-typhoon-v1.5x-8b-instruct** | **meta-llama/Llama-3.1-7B-Instruct** | **Qwen/Qwen2.5-7B-Instruct_stat** | **openthaigpt/openthaigpt1.5-7b** |
-|:------------------------------:|:--------------------------------------------:|:------------------------------------:|:---------------------------------:|:---------------------------------:|
-| **01_a_level**                 | 46.67%                                       | 47.50%                               | 58.33%                            | 60.00%                            |
-| **02_tgat**                    | 32.00%                                       | 36.00%                               | 32.00%                            | 36.00%                            |
-| **03_tpat1**                   | 52.50%                                       | 55.00%                               | 57.50%                            | 57.50%                            |
-| **04_investment_consult**      | 56.00%                                       | 48.00%                               | 68.00%                            | 76.00%                            |
-| **05_facebook_beleble_th_200** | 78.00%                                       | 73.00%                               | 79.00%                            | 81.00%                            |
-| **06_xcopa_th_200**            | 79.50%                                       | 69.00%                               | 80.50%                            | 81.00%                            |
-| **07_xnli2.0_th_200**          | 56.50%                                       | 55.00%                               | 53.00%                            | 54.50%                            |
-| **08_onet_m3_thai**            | 48.00%                                       | 32.00%                               | 72.00%                            | 64.00%                            |
-| **09_onet_m3_social**          | 75.00%                                       | 50.00%                               | 90.00%                            | 80.00%                            |
-| **10_onet_m3_math**            | 25.00%                                       | 18.75%                               | 31.25%                            | 31.25%                            |
-| **11_onet_m3_science**         | 46.15%                                       | 42.31%                               | 46.15%                            | 46.15%                            |
-| **12_onet_m3_english**         | 70.00%                                       | 76.67%                               | 86.67%                            | 83.33%                            |
-| **13_onet_m6_thai**            | 47.69%                                       | 29.23%                               | 46.15%                            | 53.85%                            |
-| **14_onet_m6_math**            | 29.41%                                       | 17.65%                               | 29.41%                            | 29.41%                            |
-| **15_onet_m6_social**          | 50.91%                                       | 43.64%                               | 56.36%                            | 58.18%                            |
-| **16_onet_m6_science**         | 42.86%                                       | 32.14%                               | 57.14%                            | 57.14%                            |
-| **17_onet_m6_english**         | 65.38%                                       | 71.15%                               | 78.85%                            | 80.77%                            |
-| **Micro Average**              | 60.65%                                       | 55.60%                               | 64.41%                            | <b style="color:blue">65.78%</b>                            |
 Thai language multiple choice exams, Test on unseen test set, Zero-shot learning. Benchmark source code and exams information: https://github.com/OpenThaiGPT/openthaigpt_eval
-(Updated on: 30 September 2024)
 ## Benchmark on [scb10x/thai_exam](https://huggingface.co/datasets/scb10x/thai_exam)
@@ -315,7 +315,7 @@ print(response)
 | **Number of Parameters** | **FP 16 bits** | **8 bits (Quantized)** | **4 bits (Quantized)** | **Example Graphic Card for 4 bits** |
 |------------------|----------------|------------------------|------------------------|---------------------------------------------|
 | **7b**           | 24 GB          | 12 GB                  | 6 GB                   | Nvidia RTX 4060 8GB                         |
-| **13b**          | 48 GB          | 24 GB                  | 12 GB                  | Nvidia RTX 4070 16GB                        |
 | **72b**          | 192 GB         | 96 GB                  | 48 GB                  | Nvidia RTX 4090 24GB x 2 cards              |
 ### Authors

 ## Benchmark on [OpenThaiGPT Eval](https://huggingface.co/datasets/openthaigpt/openthaigpt_eval)
 ** Please take a look at ``openthaigpt/openthaigpt1.5-7b-instruct`` for this model's evaluation result.
+| **Exam names**                 | **openthaigpt/openthaigpt1.5-7b** | **openthaigpt/openthaigpt1.5-14b** | **openthaigpt/openthaigpt1.5-72b** |
+|--------------------------------|-----------------------------------|------------------------------------|------------------------------------|
+| **01_a_level**                 | 60.00%                            | 65.00%                             | 76.67%                             |
+| **02_tgat**                    | 36.00%                            | 50.00%                             | 46.00%                             |
+| **03_tpat1**                   | 57.50%                            | 52.50%                             | 55.00%                             |
+| **04_investment_consult**      | 76.00%                            | 72.00%                             | 72.00%                             |
+| **05_facebook_beleble_th_200** | 81.00%                            | 87.00%                             | 90.00%                             |
+| **06_xcopa_th_200**            | 81.00%                            | 86.50%                             | 90.50%                             |
+| **07_xnli2.0_th_200**          | 54.50%                            | 64.50%                             | 70.50%                             |
+| **08_onet_m3_thai**            | 64.00%                            | 84.00%                             | 84.00%                             |
+| **09_onet_m3_social**          | 80.00%                            | 90.00%                             | 95.00%                             |
+| **10_onet_m3_math**            | 31.25%                            | 12.50%                             | 37.50%                             |
+| **11_onet_m3_science**         | 46.15%                            | 53.85%                             | 73.08%                             |
+| **12_onet_m3_english**         | 83.33%                            | 93.33%                             | 96.67%                             |
+| **13_onet_m6_thai**            | 53.85%                            | 56.92%                             | 56.92%                             |
+| **14_onet_m6_math**            | 29.41%                            | 41.18%                             | 41.18%                             |
+| **15_onet_m6_social**          | 58.18%                            | 61.82%                             | 65.45%                             |
+| **16_onet_m6_science**         | 57.14%                            | 57.14%                             | 67.86%                             |
+| **17_onet_m6_english**         | 80.77%                            | 78.85%                             | 90.38%                             |
+| **Micro Average**              | 65.78%                            | <b style="color:blue">71.51%</b>                             | 76.73%                             |
 Thai language multiple choice exams, Test on unseen test set, Zero-shot learning. Benchmark source code and exams information: https://github.com/OpenThaiGPT/openthaigpt_eval
+(Updated on: 13 October 2024)
 ## Benchmark on [scb10x/thai_exam](https://huggingface.co/datasets/scb10x/thai_exam)
 | **Number of Parameters** | **FP 16 bits** | **8 bits (Quantized)** | **4 bits (Quantized)** | **Example Graphic Card for 4 bits** |
 |------------------|----------------|------------------------|------------------------|---------------------------------------------|
 | **7b**           | 24 GB          | 12 GB                  | 6 GB                   | Nvidia RTX 4060 8GB                         |
+| **14b**          | 48 GB          | 24 GB                  | 12 GB                  | Nvidia RTX 4070 16GB                        |
 | **72b**          | 192 GB         | 96 GB                  | 48 GB                  | Nvidia RTX 4090 24GB x 2 cards              |
 ### Authors