Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -64,6 +64,12 @@ All notebooks are **beginner friendly**! Add your dataset, click "Run All", and
 - \* Kaggle has 2x T4s, but we use 1. Due to overhead, 1x T4 is 5x faster.
 ## Model Summary
 Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.

 - \* Kaggle has 2x T4s, but we use 1. Due to overhead, 1x T4 is 5x faster.
 ## Model Summary
+### Unsloth bug fixes:
+1. Padding and EOS tokens are the same - fixed this
+2. Chat template had extra EOS token - removed this
+3. EOS token should be <|end|> not <|endoftext|>
+4. Changed unk_token to �
 Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.