RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter5 Text Generation • Updated 26 days ago • 555
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4_OpenMathIt2_iter1 Text Generation • Updated 24 days ago • 28
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4_metaMathQA_dpo_iter5 Text Generation • Updated 24 days ago • 30
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter5_lr1e-7 Text Generation • Updated 24 days ago • 33
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter5_lr3e-7 Text Generation • Updated 23 days ago • 34