Llama.cpp suppfor for Qwen2ForCausalRM?

by twoxfh - opened 19 days ago

19 days ago

Any plans to support a Qwen2ForCausalRM architecture contribution to Llama.cpp? I would really like to try this model out, given the space requirements its not feasible without quantization. Appreciate your input and thank you for reading.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment