Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1206
59
53
Quentin Gallouédec
qgallouedec
Follow
sanguedemonstro's profile picture
adriwitek's profile picture
suayptalha's profile picture
126 followers
·
75 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 3 hours ago
trl-lib/documentation-images
updated
a dataset
1 day ago
qgallouedec/trl-metrics
upvoted
a
paper
5 days ago
Presumed Cultural Identity: How Names Shape LLM Responses
View all activity
Organizations
Articles
4
Article
287
Open-R1: Update #1
Article
188
Visualize and understand GPU memory in PyTorch
View all Articles
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
1
Running
9
Train Memory
📈
Generate memory usage forecast for model training
models
715
Sort: Recently updated
qgallouedec/Qwen2.5-0.5B-GRPO-main
Text Generation
•
Updated
6 days ago
•
4
qgallouedec/gemma-2-2B-it-thinking-function_calling
Updated
7 days ago
qgallouedec/Qwen2.5-0.5B-GRPO-2873
Updated
8 days ago
qgallouedec/Qwen2.5-0.5B-GRPO-2776-next
Updated
13 days ago
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
17 days ago
•
9
qgallouedec/Qwen2.5-32B-Open-R1-GRPO
Updated
19 days ago
•
1
qgallouedec/Qwen2.5-14B-Open-R1-GRPO
Updated
19 days ago
qgallouedec/Qwen2.5-7B-Open-R1-GRPO
Updated
19 days ago
qgallouedec/Qwen2-0.5B-GRPO
Updated
Jan 19
qgallouedec/tiny-Qwen2ForSequenceClassification-2.5
Text Classification
•
Updated
Jan 14
•
12
Expand 715 models
datasets
67
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
1 day ago
•
86.1k
•
3.43k
•
1
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
141
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
71
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
95
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
61
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
74
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9, 2024
•
179k
•
72
qgallouedec/tldr
Viewer
•
Updated
Sep 9, 2024
•
130k
•
72
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
63
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
87
Expand 67 datasets