arxiv:2412.13670
Mingzhe Du
Elfsong
AI & ML interests
Code Generation / Preference Alignment / Bias Mitigation
Recent Activity
updated
a dataset
about 23 hours ago
Elfsong/CrowdEval_R
published
a dataset
1 day ago
Elfsong/CrowdEval_R
upvoted
a
paper
4 days ago
GuardReasoner: Towards Reasoning-based LLM Safeguards
Organizations
Papers
2
spaces
5
models
17
Elfsong/Phi-4-14B-Instruct-sft
Text Generation
•
Updated
•
2
Elfsong/Llama-3.1-8B-Instruct-sft
Text Generation
•
Updated
•
63
•
1
Elfsong/Phi-3.5-4B-instruct-sft
Text Generation
•
Updated
•
11
•
1
Elfsong/Llama-3.3-70B-Instruct-dpo
Text Generation
•
Updated
•
15
Elfsong/Llama-3.3-70B-Instruct-stf
Text Generation
•
Updated
•
68
Elfsong/Llama-3.1-8B-Instruct-dpo
Text Generation
•
Updated
•
24
Elfsong/mouadsfilter
Text2Text Generation
•
Updated
•
100
Elfsong/dpo
Updated
Elfsong/debias_model
Updated
Elfsong/my_awesome_model
Updated
datasets
68
Elfsong/CrowdEval_R
Updated
•
9
Elfsong/apps_generation
Updated
•
600
Elfsong/Venus_t
Viewer
•
Updated
•
2.08k
•
644
Elfsong/Venus_KTO
Viewer
•
Updated
•
631k
•
31
Elfsong/Venus_SFT
Viewer
•
Updated
•
276k
•
44
Elfsong/Venus_DPO
Viewer
•
Updated
•
127k
•
27
Elfsong/Llama-3.3-70B-Instruct-sft-response
Viewer
•
Updated
•
256
•
36
Elfsong/Llama-3.3-70B-Instruct-dpo-response
Viewer
•
Updated
•
256
•
36
Elfsong/Llama-3.3-70B-Instruct-response
Viewer
•
Updated
•
256
•
41
Elfsong/Llama-3.1-8B-Instruct-dpo-response
Viewer
•
Updated
•
256
•
38