rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper โข 2501.04519 โข Published 11 days ago โข 232
view post Post 3314 The deepseek-ai/DeepSeek-V3 is very good! I have been playing with it and found it is really good at one-shotting a pretty good landing page.You can play with it here: https://deepseek-artifacts.vercel.appAll the responses get saved in the cfahlgren1/react-code-instructions dataset. Hopefully we can build one of the biggest, highest quality frontend datasets on the hub ๐ช See translation ๐ 11 11 ๐ 7 7 + Reply
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper โข 2402.14905 โข Published Feb 22, 2024 โข 127
MambaByte: Token-free Selective State Space Model Paper โข 2401.13660 โข Published Jan 24, 2024 โข 53