ugur6634
/

oco-ft-q4

ugur6634 commited on Dec 22, 2023

Commit

3932c5d

•

1 Parent(s): 7d44638

Create README.md

This model based llama-2 chat-hf and fine-tuned via qlora (peft). Then quantize via llama.cpp to q4k. There is no viable performance loss.

Files changed (0) hide show