plaguss/Llama-3.1-8B-Instruct-FineTome-APO-zero-6epoch-rmsprop Text Generation • Updated 29 days ago • 25
plaguss/Llama-3.1-8B-Instruct-FineTome-APO-zero-12epoch-rmsprop-2048 Text Generation • Updated 28 days ago • 39
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter1 Text Generation • Updated 25 days ago • 328
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2 Text Generation • Updated 27 days ago • 73
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter3 Text Generation • Updated 26 days ago • 90
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2-only2nd-2e-7 Text Generation • Updated 26 days ago • 48
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2-only2nd-4e-7 Text Generation • Updated 26 days ago • 32
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2-only2nd-6e-7 Text Generation • Updated 26 days ago • 29
RyanYr/self-correct_Llama-3.1-8B-Instruct_metaMathQA_dpo_iter1 Text Generation • Updated 25 days ago • 26
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4 Text Generation • Updated 24 days ago • 162
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter5 Text Generation • Updated 24 days ago • 534
RyanYr/self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter6 Text Generation • Updated 23 days ago • 70
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter3-gguf Updated 23 days ago • 655
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter2-gguf Updated 23 days ago • 817
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4-gguf Updated 23 days ago • 651