Distributed Training: Train BART/T5 for Summarization using 🤗 Transformers and Amazon SageMaker Apr 8, 2021
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 17 items • Updated 2 days ago • 89
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling Paper • 2407.21787 • Published Jul 31 • 12
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 17 items • Updated 2 days ago • 89
Post-Training Releases November 2024 Collection Includes papers with post-training sides from best open-models from November, including OpenCoder, SmolLM-v2, Orca Agent Instruct, Tülü 3 • 3 items • Updated Nov 23
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20 • 38