arxiv:2501.13629
Ziyue Yang
ziyueyang37
AI & ML interests
None yet
Recent Activity
authored
a paper
about 15 hours ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient
Language Models
Organizations
Papers
1
models
None public yet
datasets
None public yet