AutoGPTQ 4bit 128 groupsize

Original weight : https://huggingface.co/gorilla-llm/gorilla-falcon-7b-hf-v0

Downloads last month
9
Inference Examples
Inference API (serverless) does not yet support model repos that contain custom code.