Update README.md
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ license: llama3
|
|
9 |
|
10 |
**Llama-3-Typhoon-1.5X-8B-instruct** is an 8 billion parameter instruct model designed for Thai 🇹🇠language. It demonstrates competitive performance with GPT-3.5-turbo, and is optimized for **production** environments, **Retrieval-Augmented Generation (RAG), constrained generation**, and **reasoning** tasks.
|
11 |
|
12 |
-
Built on Typhoon 1.5 8B and Llama 3 8B Instruct. This model is a result of our experiment on cross-lingual transfer
|
13 |
|
14 |
Remark: To acknowledge Meta's efforts in creating the foundation model and comply with the license, we explicitly include "llama-3" in the model name.
|
15 |
|
|
|
9 |
|
10 |
**Llama-3-Typhoon-1.5X-8B-instruct** is an 8 billion parameter instruct model designed for Thai 🇹🇠language. It demonstrates competitive performance with GPT-3.5-turbo, and is optimized for **production** environments, **Retrieval-Augmented Generation (RAG), constrained generation**, and **reasoning** tasks.
|
11 |
|
12 |
+
Built on Typhoon 1.5 8B and Llama 3 8B Instruct. This model is a result of our experiment on **cross-lingual transfer**. It utilizes the [task-arithmetic model editing](https://arxiv.org/abs/2212.04089) technique, combining the Thai understanding capability of Typhoon with the human alignment performance of Llama 3 Instruct.
|
13 |
|
14 |
Remark: To acknowledge Meta's efforts in creating the foundation model and comply with the license, we explicitly include "llama-3" in the model name.
|
15 |
|