-
Language Modeling Is Compression
Paper • 2309.10668 • Published • 83 -
SlimPajama-DC: Understanding Data Combinations for LLM Training
Paper • 2309.10818 • Published • 10 -
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)
Paper • 2309.08968 • Published • 22 -
Contrastive Decoding Improves Reasoning in Large Language Models
Paper • 2309.09117 • Published • 37
Lei Wang
demolei
AI & ML interests
LLMs
Recent Activity
upvoted
a
paper
about 4 hours ago
O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning
upvoted
a
paper
about 5 hours ago
Kimi k1.5: Scaling Reinforcement Learning with LLMs
upvoted
a
paper
about 5 hours ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Organizations
Collections
3
datasets
None public yet