GPTQ 4bit 128g

by pszemraj - opened May 8, 2023

May 8, 2023

Hi! In case anyone finds useful, I made a 4bit quantized version of this using GPTQ and 7,500 examples from the open assistant dataset to guide the quantization. Check it out below & there’s a demo/usage guide on the model card:

https://huggingface.co/pszemraj/stablelm-7b-sft-v7e3-autogptq-4bit-128g

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment