Edit model card

TinyLlama-1.1B-Chat-v1.0

This repository contains quantized versions of the model from the original repository: TinyLlama/TinyLlama-1.1B-Chat-v1.0.

Name Quantization Method Size (GB)
tinyllama-1.1b-chat-v1.0.Q8_0.gguf q8_0 1.09
Downloads last month
6
GGUF
Model size
1.1B params
Architecture
llama

8-bit

Inference Examples
Inference API (serverless) has been turned off for this model.

Model tree for pbatralx/TinyLlama-1.1B-Chat-v1.0-GGUF

Quantized
(67)
this model