Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nbeerbower
's Collections
Dumplings
abliteration loras
DPO
bruphin
flammen
llama 3 experiments
Nemo
DPO
updated
27 days ago
Various useful datasets with preference optimization
Upvote
4
jondurbin/gutenberg-dpo-v0.1
Viewer
•
Updated
Jan 12, 2024
•
918
•
1.05k
•
134
nbeerbower/gutenberg2-dpo
Viewer
•
Updated
Nov 16, 2024
•
293
•
111
•
19
jondurbin/truthy-dpo-v0.1
Viewer
•
Updated
Jan 11, 2024
•
1.02k
•
422
•
133
kyujinpy/orca_math_dpo
Viewer
•
Updated
Apr 12, 2024
•
15.3k
•
64
•
18
antiven0m/physical-reasoning-dpo
Viewer
•
Updated
8 days ago
•
899
•
97
•
10
flammenai/MahouMix-v1
Viewer
•
Updated
May 30, 2024
•
267
•
61
•
4
flammenai/Date-DPO-NoAsterisks
Viewer
•
Updated
Sep 18, 2024
•
330
•
65
•
4
nbeerbower/Arkhaios-DPO
Viewer
•
Updated
Nov 12, 2024
•
222
•
121
•
8
nbeerbower/Purpura-DPO
Viewer
•
Updated
Nov 12, 2024
•
230
•
89
•
8
nbeerbower/Schule-DPO
Viewer
•
Updated
Nov 16, 2024
•
34
•
86
•
1
HumanLLMs/Human-Like-DPO-Dataset
Viewer
•
Updated
Jan 12
•
10.9k
•
2.61k
•
196
nbeerbower/gutenberg-moderne-dpo
Viewer
•
Updated
Nov 17, 2024
•
346
•
109
•
2
nbeerbower/reddit-dpo
Viewer
•
Updated
18 days ago
•
76.9k
•
175
•
1
Atsunori/HelpSteer2-DPO
Viewer
•
Updated
Jul 11, 2024
•
7.59k
•
87
•
6
abacusai/MetaMath_DPO_FewShot
Viewer
•
Updated
Feb 26, 2024
•
395k
•
144
•
26
nbeerbower/GreatFirewall-DPO
Viewer
•
Updated
28 days ago
•
492
•
197
•
6
Upvote
4
Share collection
View history
Collection guide
Browse collections