Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
205516.8
TFLOPS
187
52
67
Leandro von Werra
lvwerra
Follow
Csplk's profile picture
AriaRen's profile picture
nhanho2707's profile picture
325 followers
·
57 following
https://github.com/lvwerra
lvwerra
lvwerra
AI & ML interests
NLP and RL
Recent Activity
liked
a Space
about 24 hours ago
andrewrreed/closed-vs-open-arena-elo
liked
a Space
6 days ago
nanotron/ultrascale-playbook
published
a Space
6 days ago
nanotron/ultrascale-playbook
View all activity
Organizations
Articles
29
Article
50
DABStep: Data Agent Benchmark for Multi-step Reasoning
Article
287
Open-R1: Update #1
View all Articles
Papers
14
arxiv:
2502.02737
arxiv:
2501.08365
arxiv:
2410.24198
arxiv:
2406.17557
Expand 14 papers
spaces
21
Sort: Recently updated
Running
1
Executor
📚
Sleeping
3d Bench Viz
📈
Running
7
3d
🔥
Visualize 3D parallelism configuration
Running
10
Train LLMs
⚡
Calculate training cost and model efficiency
Sleeping
Text Source Viz
👁
Runtime error
20
Harm Space
⚡
Expand 21 spaces
models
33
Sort: Recently updated
lvwerra/the-tokenizer-v1
Updated
Feb 12, 2024
•
1
lvwerra/sc2
Updated
Feb 11, 2024
•
2
lvwerra/starcoder-98k-no-regex-no-digits
Updated
Sep 29, 2023
lvwerra/starcoder-393k
Updated
Sep 28, 2023
lvwerra/starcoder-196k
Updated
Sep 28, 2023
lvwerra/starcoder-98k
Updated
Sep 27, 2023
lvwerra/starcoder-24k
Updated
Sep 27, 2023
lvwerra/starcoder-12k
Updated
Sep 27, 2023
lvwerra/starcoder-6k
Updated
Sep 27, 2023
lvwerra/starcoderbase-gsm8k
Text Generation
•
Updated
Aug 30, 2023
•
26
Expand 33 models
datasets
22
Sort: Recently updated
lvwerra/dabstep
Viewer
•
Updated
21 days ago
•
3
•
4.3k
lvwerra/needle-llama3-16x524k
Viewer
•
Updated
Apr 26, 2024
•
1.41k
•
326
•
1
lvwerra/needle-llama3-16x65k
Viewer
•
Updated
Apr 26, 2024
•
1.41k
•
114
•
1
lvwerra/needle-llama3-16x8k
Viewer
•
Updated
Apr 26, 2024
•
1.41k
•
81
•
1
lvwerra/needle-llama3-16x512
Viewer
•
Updated
Apr 26, 2024
•
1.41k
•
56
•
1
lvwerra/admin
Viewer
•
Updated
Mar 6, 2024
•
1
•
463
lvwerra/stack-exchange-paired
Viewer
•
Updated
Mar 13, 2023
•
31.3M
•
3.05k
•
143
lvwerra/git-commits-clean
Updated
Mar 2, 2023
•
6
lvwerra/changeit
Viewer
•
Updated
Jan 8, 2023
•
31
•
234
lvwerra/code-ml
Viewer
•
Updated
Jan 4, 2023
•
1.5k
•
30
Expand 22 datasets