Mingzhe Du's picture

5 7 14

Mingzhe Du

Elfsong

·

https://mingzhe.space

Elfsong

AI & ML interests

Code Generation / Preference Alignment / Bias Mitigation

Recent Activity

updated a dataset about 8 hours ago

Elfsong/DenseRuntime

published a dataset about 8 hours ago

Elfsong/DenseRuntime

updated a model 1 day ago

Elfsong/Qwen2-0.5B-Instruct-grpo-dir

View all activity

Organizations

Papers 4

arxiv:2503.01295

arxiv:2412.13670

arxiv:2409.09464

arxiv:2402.07844

spaces 6

Monolith

Sandbox for Code Generation

Lucky Reactor

Manage server activation and termination

Committee

Venus Annotation System

Gcp Allocator

Mingzhe

models 24

Elfsong/Qwen2-0.5B-Instruct-grpo-dir

Updated 1 day ago

Elfsong/DeepSeek-R1-Distill-Qwen-32B-GRPO-test

Updated 1 day ago

Elfsong/Qwen2-0.5B-GRPO-test

Updated 2 days ago

Elfsong/Llama-3.1-8B-Instruct-QG-SFT-Model

Text Generation • Updated 6 days ago • 105

Elfsong/Llama-3.3-70B-Instruct-QG-SFT-Adapter

Updated 17 days ago

Elfsong/Llama-3.1-8B-Instruct-QG-DPO-Model

Text Generation • Updated 18 days ago • 18

Elfsong/Llama-3.1-8B-Instruct-QG-SFT-Adapter

Updated 18 days ago • 100

Elfsong/Phi-4-14B-Instruct-sft

Text Generation • Updated Jan 18 • 9

Elfsong/Llama-3.1-8B-Instruct-sft

Text Generation • Updated Jan 18 • 93 • 1

Elfsong/Phi-3.5-4B-instruct-sft

Text Generation • Updated Jan 18 • 23 • 1

datasets 71

Elfsong/DenseRuntime

Updated about 8 hours ago

Elfsong/Llama8B-SFT-Questions

Viewer • Updated 4 days ago • 3.57k • 21

Elfsong/Llama8B-DPO-Questions

Viewer • Updated 4 days ago • 3.47k • 22

Elfsong/Llama8B-DPO-QG

Updated 4 days ago • 166

Elfsong/Llama8B-SFT-QG

Updated 4 days ago • 102

Elfsong/apps_generation

Updated Jan 31 • 10.4k

Elfsong/Venus_t

Viewer • Updated Jan 22 • 2.08k • 480

Elfsong/Venus_KTO

Viewer • Updated Jan 18 • 631k • 67

Elfsong/Venus_SFT

Viewer • Updated Jan 18 • 276k • 69

Elfsong/Venus_DPO

Viewer • Updated Jan 18 • 127k • 61