LeMaterial: an open source initiative to accelerate materials discovery and research Dec 10, 2024 β’ 36
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 10 items β’ Updated 4 days ago β’ 82
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI β’ 19 days ago β’ 40
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Paper β’ 2406.11896 β’ Published Jun 14, 2024 β’ 20
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ Jan 2 β’ 39
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 26 items β’ Updated 26 days ago β’ 550
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper β’ 2412.04454 β’ Published Dec 5, 2024 β’ 60
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. β’ 40 items β’ Updated 26 days ago β’ 80
view article Article FineWeb2-C: Help Build Better Language Models in Your Language By davanstrien β’ Dec 23, 2024 β’ 18
TabuLa-8B Collection Training, eval suite, and model from the paper "Large Scale Transfer Learning for Tabular Data via Language Modeling" https://arxiv.org/abs/2406.12031 β’ 4 items β’ Updated Jun 19, 2024 β’ 11
LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment Paper β’ 2412.04814 β’ Published Dec 6, 2024 β’ 45
Solving Quantitative Reasoning Problems with Language Models Paper β’ 2206.14858 β’ Published Jun 29, 2022 β’ 1
GUI agents Collection A collection of papers on GUI agents β’ 3 items β’ Updated Dec 14, 2024 β’ 5
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials Paper β’ 2412.09605 β’ Published Dec 12, 2024 β’ 28