view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled β’ Oct 14, 2024 β’ 65
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 10 items β’ Updated 5 days ago β’ 84
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper β’ 2501.17703 β’ Published 5 days ago β’ 45
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 20 days ago β’ 132
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 3 items β’ Updated 8 days ago β’ 311
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published 12 days ago β’ 284
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper β’ 2411.19943 β’ Published Nov 29, 2024 β’ 57
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation Paper β’ 2412.02592 β’ Published Dec 3, 2024 β’ 22
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper β’ 2501.04519 β’ Published 26 days ago β’ 252
LLM4SR: A Survey on Large Language Models for Scientific Research Paper β’ 2501.04306 β’ Published 27 days ago β’ 33
Agent Laboratory: Using LLM Agents as Research Assistants Paper β’ 2501.04227 β’ Published 27 days ago β’ 84
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Paper β’ 2501.03895 β’ Published 27 days ago β’ 48
Personalized Graph-Based Retrieval for Large Language Models Paper β’ 2501.02157 β’ Published about 1 month ago β’ 28
OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System Paper β’ 2412.20005 β’ Published Dec 28, 2024 β’ 17
πͺ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos β’ 12 items β’ Updated Dec 22, 2024 β’ 213
view article Article β΄οΈ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use By Ziyang β’ Jan 3 β’ 13
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Paper β’ 2501.01257 β’ Published Jan 2 β’ 48
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 β’ Nov 21, 2024 β’ 35
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ Jan 2 β’ 39