MoM: Linear Sequence Modeling with Mixture-of-Memories Paper • 2502.13685 • Published 1 day ago • 21
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Paper • 2501.12895 • Published 29 days ago • 56
ringos/output_Llama-3.1-8B-simpleqa-0_1000-m_generation-n_128-t_1.0-k_50-p_0.95-l_128 Updated Dec 25, 2024 • 108
Running 516 516 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
ringos/output_Llama-3.1-8B-simpleqa-0_-1-m_generation-n_128-t_1.0-k_50-p_0.95-l_128 Updated Dec 17, 2024 • 100
ringos/output_Mistral-Nemo-Base-2407-simpleqa-0_1000-m_generation-n_32-t_1.0-k_40-p_0.9-l_128 Viewer • Updated Dec 2, 2024 • 216 • 225
ringos/bio-detailed-Llama-3.1-8B-gemma2-rm-gold_True-n_32 Viewer • Updated Nov 13, 2024 • 371 • 67
ringos/bio-detailed-Llama-3.1-8B-gemma2-rm-gold_True-n_32 Viewer • Updated Nov 13, 2024 • 371 • 67
ringos/ultrafeedback_binarized-vanilla-filtered_as_Llama_n32 Viewer • Updated Nov 13, 2024 • 58.2k • 49