MarcoroCapy-7B

This model is a DPO fine tune of mlabonne/Marcoro14-7B-slerp on argilla/distilabel-capybara-dpo-7k-binarized

Process

Realigned the chat template to ChatML
Completed 1 Epoch
5e-5 learning rate
Training time was about 4.5 hours on 1 H100
Cost was ~$20

GGUF

TODO

Evaluations

TODO

Downloads last month: 7

Safetensors

Model size

7.24B params

Tensor type

FP16

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for macadeliccc/MarcoroCapy-7B

Quantizations

2 models