DopeyTinyLlama-1.1B-v1

An experimental DPO finetune of SmarTinyLlama with Alpaca-QLoRA

Datasets

Trained on bagel style DPO datasets

Uses chatml style prompt template

Safetensors

Model size

1.1B params

Tensor type

FP16

Inference Examples

Inference API (serverless) is not available, repository is disabled.

Merges

Quantizations