Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published 7 days ago • 11
SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models Paper • 2502.09604 • Published 12 days ago • 32
Great Models Think Alike and this Undermines AI Oversight Paper • 2502.04313 • Published 19 days ago • 30
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 21 days ago • 192
Optimizing Large Language Model Training Using FP4 Quantization Paper • 2501.17116 • Published 28 days ago • 36
BlockPruner: Fine-grained Pruning for Large Language Models Paper • 2406.10594 • Published Jun 15, 2024
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Paper • 2501.13629 • Published Jan 23 • 44
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback Paper • 2501.10799 • Published Jan 18 • 15
EpiCoder: Encompassing Diversity and Complexity in Code Generation Paper • 2501.04694 • Published Jan 8 • 14
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning Paper • 2412.15797 • Published Dec 20, 2024 • 18