LayerSkip Collection Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710 • 8 items • Updated Nov 21, 2024 • 47
Llama3.1 ELM Turbo Collection Collection of ELM Turbo model cards based on meta-llama/Meta-Llama-3.1-8B-Instruct • 10 items • Updated Jul 30, 2024 • 2
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding Paper • 2404.16710 • Published Apr 25, 2024 • 78