arxiv:2501.11873
Zekun Wang
kugwzk
·
AI & ML interests
None yet
Recent Activity
authored
a paper
5 days ago
Demons in the Detail: On Implementing Load Balancing Loss for Training
Specialized Mixture-of-Expert Models
authored
a paper
12 days ago
OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for
LLM Training