Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
realbenpope
's Collections
Embeddings
Diffusion Language Models
Steady state model
Visual reasoning
Reasoning
Memory Tokens
In context learning
Recurrent architecture
MoE
Small LMs
Recurrent architecture
updated
Aug 16
Upvote
-
Layerwise Recurrent Router for Mixture-of-Experts
Paper
•
2408.06793
•
Published
Aug 13
•
30
Upvote
-
Share collection
View history
Collection guide
Browse collections