File size: 202 Bytes
153e419
 
 
ee52e5e
20247de
1
2
3
4
5
---
license: apache-2.0
---

dpo-phi2 is an instruction-tuned model from microsoft/phi-2. Direct preference optimization (DPO) is used for fine-tuning on argilla/distilabel-intel-orca-dpo-pairs dataset.