Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
NicholasCorrado
/
zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.01
like
0
Text Generation
Transformers
Safetensors
data/zephyr_uf_rlced_conifer_ref
mistral
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.01
/
model-00002-of-00003.safetensors
Commit History
Training in progress, step 1440
d4ff4b2
verified
NicholasCorrado
commited on
Sep 11
Training in progress, step 1080
4320ce1
verified
NicholasCorrado
commited on
Sep 11
Training in progress, step 720
a2668c8
verified
NicholasCorrado
commited on
Sep 11
Training in progress, step 360
1855552
verified
NicholasCorrado
commited on
Sep 11