SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published 27 days ago β’ 196
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 11 items β’ Updated 19 days ago β’ 92
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb β’ Nov 28, 2024 β’ 138
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models Paper β’ 2411.18613 β’ Published Nov 27, 2024 β’ 52
TΓLU 3: Pushing Frontiers in Open Language Model Post-Training Paper β’ 2411.15124 β’ Published Nov 22, 2024 β’ 59
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Paper β’ 2411.04996 β’ Published Nov 7, 2024 β’ 51
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations Paper β’ 2410.02707 β’ Published Oct 3, 2024 β’ 48
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness Paper β’ 2409.18125 β’ Published Sep 26, 2024 β’ 34
SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction Paper β’ 2409.11211 β’ Published Sep 17, 2024 β’ 9
Implicit Neural Representations with Fourier Kolmogorov-Arnold Networks Paper β’ 2409.09323 β’ Published Sep 14, 2024 β’ 5