Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Yofuria
/
Mistral-7B-base-simpo-qlora
like
1
PEFT
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes
Model card
Files
Files and versions
Community
Train
Use this model
main
Mistral-7B-base-simpo-qlora
/
adapter_config.json
Commit History
Model save
9749e4f
verified
Yofuria
commited on
Jul 4, 2024
Training in progress, step 3000
b5b3a25
verified
Yofuria
commited on
Jul 4, 2024
Training in progress, step 100
d10c0cb
verified
Yofuria
commited on
Jul 3, 2024