File size: 1,129 Bytes

f4bfc9e
 
 
bbb49f4
f4bfc9e
 
 
 
 
 
 
 
 
 
 
 
 
b24d4c3
f4bfc9e
 
 
ac47d48
f4bfc9e

---
library_name: transformers
license: apache-2.0
base_model: meta-llama/Meta-Llama-3-8B-Instruct
datasets:
- Saxo/total_ko_train_set_1_without_wiki_with_orca
language:
- ko
- en
- ja
- zh
pipeline_tag: text-generation
---

# Model Card for Model ID

<div align="center">
<img src="http://www.linkbricks.com/wp-content/uploads/2024/11/fulllogo.png" />
</div>


AI 와 빅데이터 분석 전문 기업인 Linkbricks의 데이터사이언티스트인 지윤성(Saxo) 이사가 meta-llama/Meta-Llama-3-8B를 베이스모델로 GCP상의 H100-80G 8개를 통해 SFT-DPO 훈련한 한글 기반 LLAMA3-8b 4개의 MoE(Mixture of Expert)모델.
토크나이저는 라마3랑 동일하며 한글 VOCA 확장은 하지 않은 버전 입니다.
일반질의응답(채팅)-의료-군사-코딩 특화 LLM을 통합

Dr. Yunsung Ji (Saxo), a data scientist at Linkbricks, a company specializing in AI and big data analytics, trained the meta-llama/Meta-Llama-3-8B base model on 8 H100-60Gs on GCP for 4 hours of instructional training (8000 Tokens).
Accelerate, Deepspeed Zero-3 libraries were used. 

www.linkbricks.com, www.linkbricks.vc