Simeng Sun's picture

3 2

Simeng Sun

simsun131

https://people.cs.umass.edu/~simengsun/

AI & ML interests

Language Modeling, Machine Translation

Recent Activity

liked a Space 12 days ago

HuggingFaceH4/blogpost-scaling-test-time-compute

upvoted a paper about 2 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

upvoted a paper about 2 months ago

Star Attention: Efficient LLM Inference over Long Sequences

View all activity

Organizations

simsun131's activity

upvoted 2 papers about 2 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 40

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 48

upvoted a paper 9 months ago

RULER: What's the Real Context Size of Your Long-Context Language Models?

Paper • 2404.06654 • Published Apr 9, 2024 • 34