arxiv:2501.13629
Feiyang Chen
PhilipChen
AI & ML interests
None yet
Recent Activity
authored
a paper
about 15 hours ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient
Language Models
updated
a model
2 months ago
PhilipChen/llama
updated
a model
2 months ago
PhilipChen/LLM1
Organizations
None yet