Pairwise Difficulty Rankings for Easy-to-Hard Datasets
Mucong's Organization
university
AI & ML interests
Machine Learning
Organization Card
Mucong's Organization
Collections
1
spaces
1
models
85
mcding-org/CorrectDPO-Model-DDP_Pm3B_U0_beta0.10r0.30rho0.20
Updated
mcding-org/CorrectDPO-Model-DDP_L8B_U0_beta0.10r0.30rho0.20
Updated
mcding-org/CorrectDPO-Model-DPO_L8B_U0_beta0.10
Updated
mcding-org/CorrectDPO-Model-DPO_Pm3B_U0_beta0.10
Updated
mcding-org/CorrectDPO-Model-SFT_Pm3B_U0
Updated
mcding-org/CorrectDPO-Model-SFT_L8B_U0
Updated
mcding-org/CorrectDPO-Model-DPR_Q0.5B_PP10_beta0.10g0.10gamma0.50
Updated
mcding-org/CorrectDPO-Model-DPR_Q0.5B_PP10_beta0.10g0.20gamma0.50
Updated
mcding-org/CorrectDPO-Model-DDP_Q0.5B_PP10_beta0.10r0.10rho0.50
Updated
mcding-org/CorrectDPO-Model-DDP_Q0.5B_PP10_beta0.10r0.10rho0.40
Updated
datasets
89
mcding-org/Easy2Hard-IRT-tune
Updated
•
80
mcding-org/Easy2Hard-Winogrande-GPT
Updated
•
145
mcding-org/Easy2Hard-ARC-GPT
Updated
•
132
mcding-org/Easy2Hard-GSM8K-GPT
Updated
•
85
mcding-org/CorrectDPO-Eval-DPO_L8B_U0_beta0.10
Updated
•
37
mcding-org/CorrectDPO-Eval-DPO_Pm3B_U0_beta0.10
Updated
•
36
mcding-org/CorrectDPO-Eval-DDP_L8B_U0_beta0.10r0.30rho0.20
Updated
•
36
mcding-org/CorrectDPO-Eval-DDP_Pm3B_U0_beta0.10r0.30rho0.20
Updated
•
37
mcding-org/CorrectDPO-Dataset-U0
Updated
•
36
mcding-org/CorrectDPO-Dataset-U30
Updated
•
37