Llama.cpp suppfor for Qwen2ForCausalRM?

#1
by twoxfh - opened

Any plans to support a Qwen2ForCausalRM architecture contribution to Llama.cpp? I would really like to try this model out, given the space requirements its not feasible without quantization. Appreciate your input and thank you for reading.

Sign up or log in to comment