Any plans to support a Qwen2ForCausalRM architecture contribution to Llama.cpp? I would really like to try this model out, given the space requirements its not feasible without quantization. Appreciate your input and thank you for reading.
· Sign up or log in to comment