pbatralx
/

TinyLlama-1.1B-Chat-v1.0-GGUF

Edit model card

TinyLlama-1.1B-Chat-v1.0

This repository contains quantized versions of the model from the original repository: TinyLlama/TinyLlama-1.1B-Chat-v1.0.

Name	Quantization Method	Size (GB)
tinyllama-1.1b-chat-v1.0.Q8_0.gguf	q8_0	1.09

GGUF

Model size

1.1B params

Architecture

llama

8-bit

Inference Examples

Inference API (serverless) has been turned off for this model.

Base model

Quantized

(67)

this model