GGUF
English
Inference Endpoints
Edit model card

Just some GGUF v2 quantizations of the model TinyLlama/tinyLlama-intermediate-checkpoints Step 480K pretrained on 1T of tokens.

q2_k, q4_0, q4_1, q5_0, q5_1, q8_0 and f16.

Downloads last month
101
GGUF
Model size
1.1B params
Architecture
llama

2-bit

4-bit

5-bit

8-bit

16-bit

Inference API
Unable to determine this model's library. Check the docs .

Datasets used to train Aryanne/TinyLlama-1.1B-step-480K-1007B-gguf