DeepSeek-R1-Distill Collection This is a collection of Llama and Qwen-based models ranging from 1.5B to 70B parameters with are distilled from DeepSeek's new R1 models. • 6 items • Updated 20 days ago