LLM Model, Can I run it?
*This does not support gated or private repos
GPU (optional)
r.text())).querySelector('table')))" />
Model (unquantized)
r.json())).models.filter(m => !m.id.includes('GGUF') && !m.id.includes('AWQ') && !m.id.includes('GPTQ') && !m.id.includes('exl2'));" :aria-expanded="open" :aria-controls="$id('model-typeahead')" x-model="value" class="flex justify-between items-center gap-2 w-full" />
Context Size
Quant Format
Quant format
GGUF
EXL2
GPTQ (coming soon)
BPW
KV Cache
16 bit
8 bit
4 bit
Quantization Size
Q4_K_S
Batch Size
Submit
Model Size (GB)
4.20
Context Size (GB)
6.90
Total Size (GB)
420.69