Demystifying Long Chain-of-Thought Reasoning in LLMs Paper • 2502.03373 • Published 4 days ago • 41
Demystifying Long Chain-of-Thought Reasoning in LLMs Paper • 2502.03373 • Published 4 days ago • 41
Pangea Collection A Fully Open Multilingual Multimodal LLM for 39 Languages • 26 items • Updated 8 days ago • 18
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published 11 days ago • 51