REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published Jan 4 • 90
MoritzLaurer/mDeBERTa-v3-base-xnli-multilingual-nli-2mil7 Zero-Shot Classification • Updated Apr 11, 2024 • 65.6k • 309
Running on CPU Upgrade 47 47 OpenLLM Turkish leaderboard v0.2 🥇 Browse and submit model evaluations in LLM benchmarks
InternVL2.5-MPO Collection Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated 7 days ago • 26
No More Adam: Learning Rate Scaling at Initialization is All You Need Paper • 2412.11768 • Published Dec 16, 2024 • 41