Which version is this quant of?
#1
by
SaisExperiments
- opened
Which version is this quant of? It's mentioned that slerp is more dumb but the model card below says it's using the slerp method instead of linear like on the original repo.
SaisExperiments
changed discussion status to
closed
SaisExperiments
changed discussion status to
open
That's weird. It should be of the Linear model because that's the one I quanted off of. The 8bpw shows as linear but the 6bpw and the 4bpw show as Slerp? But neither of them were slerp and I used the same filepath to quant them so they should all be linear.