章其涛
Joker114339
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 20 hours ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient
Language Models
Organizations
None yet
models
None public yet
datasets
None public yet