26 56 72

Elie Bakouch

eliebak

AI & ML interests

Training LLM's @ 🤗

Recent Activity

liked a Space about 5 hours ago

bigcode/bigcode-models-leaderboard

commented on their article about 7 hours ago

Open-R1: a fully open reproduction of DeepSeek-R1

View all activity

Articles

Organizations

eliebak's activity

upvoted an article 1 day ago

Article

Open-R1: Update #1

•

2 days ago

• 181

upvoted an article 3 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

4 days ago

• 22

upvoted 2 articles 5 days ago

Article

Mastering Long Contexts in LLMs with KVPress

•

12 days ago

• 57

Article

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

•

5 days ago

• 12

upvoted a paper 5 days ago

Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts

Paper • 2501.14334 • Published 11 days ago • 15

upvoted a paper 6 days ago

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published 24 days ago • 42

upvoted an article 6 days ago

Article

Welcome to Inference Providers on the Hub 🔥

7 days ago

• 220

upvoted an article 7 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

7 days ago

• 587

upvoted an article 14 days ago

Article

Yay! Organizations can now publish blog Articles

•

14 days ago

• 30

upvoted an article 19 days ago

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

•

20 days ago

• 40

upvoted a collection 21 days ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated Dec 22, 2024 • 211

upvoted a paper 25 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 26 days ago • 252

upvoted a collection 27 days ago

DolphinLabeled Datasets

Collection

Eric Hartford has added labels to help you filter datasets, for your pleasure. • 5 items • Updated 28 days ago • 11

upvoted 2 papers about 1 month ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 16

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published Dec 23, 2024 • 64

upvoted an article about 1 month ago

Article

🌁#81: Key AI Concepts to Follow in 2025

•

Dec 23, 2024

• 24

upvoted a paper about 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 345

upvoted 2 papers 3 months ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 50

Balancing Pipeline Parallelism with Vocabulary Parallelism

Paper • 2411.05288 • Published Nov 8, 2024 • 19

upvoted a collection 4 months ago

LoLCATS

Collection

Linearizing LLMs with high quality and efficiency. We linearize the full Llama 3.1 model family -- 8b, 70b, 405b -- for the first time! • 4 items • Updated Oct 14, 2024 • 15

Elie Bakouch

AI & ML interests

Recent Activity

Articles

Open-R1: a fully open reproduction of DeepSeek-R1

Diving into MiniMax01 405B MoE

SmolVLM - small yet mighty Vision Language Model

SmolLM - blazingly fast and remarkably powerful

Organizations

eliebak's activity

Open-R1: Update #1

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

Mastering Long Contexts in LLMs with KVPress

**How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents**

Welcome to Inference Providers on the Hub 🔥

Open-R1: a fully open reproduction of DeepSeek-R1

Yay! Organizations can now publish blog Articles

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

🌁#81: Key AI Concepts to Follow in 2025

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents