Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
alexredna
/
TinyLlama-1.1B-Chat-v1.0-reasoning-v2-dpo
like
2
Text Generation
Transformers
TensorBoard
Safetensors
llama
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
1
Train
Deploy
Use this model
refs/pr/1
TinyLlama-1.1B-Chat-v1.0-reasoning-v2-dpo
/
runs
/
Jan06_23-31-48_df4a4afb4442
Commit History
Model save
f61da97
alexredna
commited on
Jan 7
Training in progress, step 4650
03bd415
alexredna
commited on
Jan 7
Training in progress, step 4500
5fd9216
alexredna
commited on
Jan 7
Training in progress, step 4350
1d47531
alexredna
commited on
Jan 7
Training in progress, step 4200
87949fb
alexredna
commited on
Jan 7
Training in progress, step 3900
3ea1410
alexredna
commited on
Jan 7
Training in progress, step 3600
c1b8a63
alexredna
commited on
Jan 7
Training in progress, step 3450
e49cc58
alexredna
commited on
Jan 7
Training in progress, step 3300
c95b42a
alexredna
commited on
Jan 7
Training in progress, step 3150
08d8e2f
alexredna
commited on
Jan 7
Training in progress, step 3000
28a4cb7
alexredna
commited on
Jan 7
Training in progress, step 2700
73152fb
alexredna
commited on
Jan 7
Training in progress, step 2400
ade1ee9
alexredna
commited on
Jan 7
Training in progress, step 2100
98b6160
alexredna
commited on
Jan 7
Training in progress, step 1950
2eefb1b
alexredna
commited on
Jan 7
Training in progress, step 1800
e4be78d
alexredna
commited on
Jan 7
Training in progress, step 1650
ae400f7
alexredna
commited on
Jan 7
Training in progress, step 1350
8103b9a
alexredna
commited on
Jan 7
Training in progress, step 1200
b8d1e34
alexredna
commited on
Jan 7
Training in progress, step 1050
ce7e0da
alexredna
commited on
Jan 7
Training in progress, step 900
8e15c82
alexredna
commited on
Jan 7
Training in progress, step 600
c80cf22
alexredna
commited on
Jan 7
Training in progress, step 450
fd0bbab
alexredna
commited on
Jan 7
Training in progress, step 300
55bf1eb
alexredna
commited on
Jan 7