Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
micost
/
ft-smollm-135M-instruct-on-hf-ultrafeedback_rob
like
0
TensorBoard
Safetensors
llama
trl
orpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
b1fb271
ft-smollm-135M-instruct-on-hf-ultrafeedback_rob
Commit History
Upload tokenizer
b1fb271
verified
micost
commited on
Oct 22, 2024
End of training
148f966
verified
micost
commited on
Oct 22, 2024
Upload tokenizer
c6807f0
verified
micost
commited on
Oct 22, 2024
Upload tokenizer
e9c551c
verified
micost
commited on
Oct 22, 2024
initial commit
3a2cec5
verified
micost
commited on
Oct 22, 2024