Spaces:

zero-gpu-explorers
/

README

Running

Do Zero inference GPUs not support 8,4 bit models?

#22

by Ayushnangia - opened Apr 23

ZeroGPU Explorers org Apr 23

I have been trying to run 4 bit version of a model but at inference it always gives me this.

what to do ?
(attached an image of the error as it is too big)

ZeroGPU Explorers org Apr 23

ZeroGPU Explorers org Apr 23

with this BNB Config in above space,am able to use 4bit and 8 bit

ZeroGPU Explorers org Apr 25

I seems like quantization config is fixed by the model ?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment