How did you manage to quantize this model?
#1
by
SaffalPoosh
- opened
the method in readme is loading all the checkpoints, can you please suggest how did you manage to quantize it and on what GPU?
the method in readme is loading all the checkpoints, can you please suggest how did you manage to quantize it and on what GPU?