SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper β’ 2502.14786 β’ Published 5 days ago β’ 115
On Domain-Specific Post-Training for Multimodal Large Language Models Paper β’ 2411.19930 β’ Published Nov 29, 2024 β’ 27
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release β’ 12 items β’ Updated 5 days ago β’ 69
How to Synthesize Text Data without Model Collapse? Paper β’ 2412.14689 β’ Published Dec 19, 2024 β’ 50
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb β’ Nov 28, 2024 β’ 138
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks β’ 6 items β’ Updated Dec 13, 2024 β’ 10
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi β’ 13 items β’ Updated Sep 18, 2024 β’ 227
LLMs for Extremely Low-Resource Finno-Ugric Languages Paper β’ 2410.18902 β’ Published Oct 24, 2024 β’ 2
MaLA-LM Collection MaLA-LM: Massive Language Adaptation of Large Language Models β’ 7 items β’ Updated Oct 7, 2024 β’ 1
4M Models Collection Multimodal models from https://4m.epfl.ch/ β’ 14 items β’ Updated Jun 14, 2024 β’ 31
AIMv2 Collection A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. β’ 19 items β’ Updated Nov 22, 2024 β’ 73
LLM2CLIP Collection LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. β’ 10 items β’ Updated Jan 8 β’ 55
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais and 2 others β’ Nov 13, 2024 β’ 99
view article Article Decoding Strategies in Large Language Models By mlabonne β’ Oct 29, 2024 β’ 44
GLiClass Collection Generalist and Light-weighted Models for Zero-shot Text Classification β’ 13 items β’ Updated Sep 17, 2024 β’ 14