LayerSkip Collection Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710 β’ 8 items β’ Updated Nov 21, 2024 β’ 47
Llama3.1 ELM Turbo Collection Collection of ELM Turbo model cards based on meta-llama/Meta-Llama-3.1-8B-Instruct β’ 10 items β’ Updated Jul 30, 2024 β’ 2
Running 934 934 Can You Run It? LLM version π Determine GPU requirements for large language models
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding Paper β’ 2404.16710 β’ Published Apr 25, 2024 β’ 78