iljoo's picture
Update README.md
e66be8a verified
---
license: apache-2.0
datasets:
- iljoo/chatbot_dataset1
base_model:
- MLP-KTLim/llama-3-Korean-Bllossom-8B
---
### Base model
MLP-KTLim/llama-3-Korean-Bllossom-8B
### Dataset
1) AIHub : ํ•œ๊ตญ์–ด ์„ฑ๋Šฅ์ด ๊ฐœ์„ ๋œ ์ดˆ๊ฑฐ๋Œ€ AI ์–ธ์–ด๋ชจ๋ธ ๊ฐœ๋ฐœ ๋ฐ ๋ฐ์ดํ„ฐ
- ๋ฐ์ดํ„ฐ ์†Œ๊ฐœ : ํ•œ๊ตญ์–ด ๋ง๋ญ‰์น˜ ๋ฐ์ดํ„ฐ 20์–ต ์–ด์ ˆ/310๋งŒ ๊ฑด๊ณผ RLHF ๋ฐ์ดํ„ฐ 7๋งŒ 7์ฒœ๊ฑด์œผ๋กœ ๊ตฌ์„ฑ ๋จ
- RLHF ๋ฐ์ดํ„ฐ > SFT ๋ฐ์ดํ„ฐ 10,579๊ฐœ ๋ฐ์ดํ„ฐ ํ™œ์šฉ
2) ์žฌ์ •์ •๋ณด AI ๊ฒ€์ƒ‰ ์•Œ๊ณ ๋ฆฌ์ฆ˜ ๊ฒฝ์ง„๋Œ€ํšŒ ๋ฐ์ดํ„ฐ
- ๋ฐ์ดํ„ฐ ์†Œ๊ฐœ : ์ค‘์•™์ •๋ถ€ ์žฌ์ • ์ •๋ณด ๊ด€๋ จ ์งˆ์˜ ์‘๋‹ต ๋ฐ์ดํ„ฐ
- train dataset : 498๊ฐœ ๋ฐ์ดํ„ฐ ํ™œ์šฉ
### Data structure
- Data format
- {instruction:, response:}
### Parameter
[2024.10.24]
- QLoRA : Rank 4
- max_length : 512
- Epoch : 10