beomi
/

llama-2-ko-7b

@@ -75,7 +75,7 @@ TBD
 ### COPA (F1)
-<img src=https://user-images.githubusercontent.com/11323660/255506675-b4f600ba-9a52-45e8-92b9-60710b454c73.png style="max-width: 700px; width: 100%" />
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
@@ -88,12 +88,13 @@ TBD
 | https://huggingface.co/EleutherAI/polyglot-ko-12.8b | 0.7937 | 0.8108 | 0.8037 | 0.8369 |
 | Llama-2 Original 7B* | 0.562033 | 0.575982 | 0.576216 | 0.595532 |
 | Llama-2-Ko-7b 20B (10k) | 0.738780 | 0.762639 | 0.780761 | 0.797863 |
 *Llama-2 Original 7B used https://huggingface.co/meta-llama/Llama-2-7b-hf (No tokenizer updated)
 ### HellaSwag (F1)
-<img src=https://user-images.githubusercontent.com/11323660/255506695-39a1ce78-b535-4df9-9666-dccbd2b64de0.png style="max-width: 700px; width: 100%" />
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
@@ -106,12 +107,13 @@ TBD
 | https://huggingface.co/EleutherAI/polyglot-ko-12.8b | 0.5954 | 0.6306 | 0.6098 | 0.6118 |
 | Llama-2 Original 7B* | 0.415390 | 0.431382 | 0.421342 | 0.442003 |
 | Llama-2-Ko-7b 20B (10k) | 0.451757 | 0.466751 | 0.472607 | 0.482776 |
 *Llama-2 Original 7B used https://huggingface.co/meta-llama/Llama-2-7b-hf (No tokenizer updated)
 ### BoolQ (F1)
-<img src=https://user-images.githubusercontent.com/11323660/255506714-cd8fe3fc-d03e-4235-8d09-c084d96c353b.png style="max-width: 700px; width: 100%" />
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
@@ -124,12 +126,13 @@ TBD
 | https://huggingface.co/EleutherAI/polyglot-ko-12.8b | 0.4818 | 0.6041 | 0.6289 | 0.6448 |
 | Llama-2 Original 7B* | 0.352050 | 0.563238 | 0.474788 | 0.419222 |
 | Llama-2-Ko-7b 20B (10k) | 0.360656 | 0.679743 | 0.680109 | 0.662152 |
 *Llama-2 Original 7B used https://huggingface.co/meta-llama/Llama-2-7b-hf (No tokenizer updated)
 ### SentiNeg (F1)
-<img src=https://user-images.githubusercontent.com/11323660/255506731-7ae21819-f09c-4c43-9817-033d29953980.png style="max-width: 700px; width: 100%" />
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
@@ -142,7 +145,7 @@ TBD
 | https://huggingface.co/EleutherAI/polyglot-ko-12.8b | 0.9117 | 0.9015 | 0.9345 | 0.9723 |
 | Llama-2 Original 7B* | 0.347502 | 0.529124 | 0.480641 | 0.788457 |
 | Llama-2-Ko-7b 20B (10k) | 0.485546 | 0.829503 | 0.871141 | 0.851253 |
 *Llama-2 Original 7B used https://huggingface.co/meta-llama/Llama-2-7b-hf (No tokenizer updated)

 ### COPA (F1)
+<img src=https://user-images.githubusercontent.com/11323660/255575809-c037bc6e-0566-436a-a6c1-2329ac92187a.png style="max-width: 700px; width: 100%" />
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
 | https://huggingface.co/EleutherAI/polyglot-ko-12.8b | 0.7937 | 0.8108 | 0.8037 | 0.8369 |
 | Llama-2 Original 7B* | 0.562033 | 0.575982 | 0.576216 | 0.595532 |
 | Llama-2-Ko-7b 20B (10k) | 0.738780 | 0.762639 | 0.780761 | 0.797863 |
+| Llama-2-Ko-7b 40B (20k) | 0.743630 | 0.792716 | 0.803746 | 0.825944 |
 *Llama-2 Original 7B used https://huggingface.co/meta-llama/Llama-2-7b-hf (No tokenizer updated)
 ### HellaSwag (F1)
+<img src=https://user-images.githubusercontent.com/11323660/255576090-a2bfc1ae-d117-44b7-9f7b-262e41179ec1.png style="max-width: 700px; width: 100%" />
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
 | https://huggingface.co/EleutherAI/polyglot-ko-12.8b | 0.5954 | 0.6306 | 0.6098 | 0.6118 |
 | Llama-2 Original 7B* | 0.415390 | 0.431382 | 0.421342 | 0.442003 |
 | Llama-2-Ko-7b 20B (10k) | 0.451757 | 0.466751 | 0.472607 | 0.482776 |
+| Llama-2-Ko-7b 40B (20k) | 0.456246 | 0.465665 | 0.469810 | 0.477374 |
 *Llama-2 Original 7B used https://huggingface.co/meta-llama/Llama-2-7b-hf (No tokenizer updated)
 ### BoolQ (F1)
+<img src=https://user-images.githubusercontent.com/11323660/255576343-5d847a6f-3b6a-41a7-af37-0f11940a5ea4.png style="max-width: 700px; width: 100%" />
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
 | https://huggingface.co/EleutherAI/polyglot-ko-12.8b | 0.4818 | 0.6041 | 0.6289 | 0.6448 |
 | Llama-2 Original 7B* | 0.352050 | 0.563238 | 0.474788 | 0.419222 |
 | Llama-2-Ko-7b 20B (10k) | 0.360656 | 0.679743 | 0.680109 | 0.662152 |
+| Llama-2-Ko-7b 40B (20k) | 0.578640 | 0.697747 | 0.708358 | 0.714423 |
 *Llama-2 Original 7B used https://huggingface.co/meta-llama/Llama-2-7b-hf (No tokenizer updated)
 ### SentiNeg (F1)
+<img src=https://user-images.githubusercontent.com/11323660/255576572-b005a81d-fa4d-4709-b48a-f0fe4eed17a3.png style="max-width: 700px; width: 100%" />
 | Model | 0-shot | 5-shot | 10-shot | 50-shot |
 | --- | --- | --- | --- | --- |
 | https://huggingface.co/EleutherAI/polyglot-ko-12.8b | 0.9117 | 0.9015 | 0.9345 | 0.9723 |
 | Llama-2 Original 7B* | 0.347502 | 0.529124 | 0.480641 | 0.788457 |
 | Llama-2-Ko-7b 20B (10k) | 0.485546 | 0.829503 | 0.871141 | 0.851253 |
+| Llama-2-Ko-7b 40B (20k) | 0.459447 | 0.761079 | 0.727611 | 0.936988 |
 *Llama-2 Original 7B used https://huggingface.co/meta-llama/Llama-2-7b-hf (No tokenizer updated)