mlfoundations-dev/oh-mistral-bs1024_lr5_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07 Text Generation • Updated 1 day ago • 7
mlfoundations-dev/oh-mistral-bs512_lr2_00E-06_schedulercosine_with_min_lr_warmup5_00E-02_minlr5_00E-07 Text Generation • Updated 1 day ago • 1
mlfoundations-dev/oh-mistral-bs512_lr5_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07 Text Generation • Updated 1 day ago • 1
mlfoundations-dev/oh-mistral-bs4096_lr5_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07 Text Generation • Updated about 22 hours ago
mlfoundations-dev/oh-mistral-bs4096_lr2_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07 Text Generation • Updated about 22 hours ago
mlfoundations-dev/oh-mistral-bs4096_lr2_00E-06_schedulerconstant_warmup1_00E-01_minlr Text Generation • Updated about 22 hours ago
mlfoundations-dev/oh-mistral-bs2048_lr5_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07 Text Generation • Updated about 21 hours ago
mlfoundations-dev/oh-mistral-bs2048_lr2_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07 Text Generation • Updated about 21 hours ago • 8
mlfoundations-dev/oh-mistral-bs2048_lr2_00E-06_schedulerconstant_warmup1_00E-01_minlr Text Generation • Updated about 21 hours ago • 8
mlfoundations-dev/oh-mistral-bs1024_lr2_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07 Text Generation • Updated about 19 hours ago
mlfoundations-dev/oh-mistral-bs1024_lr2_00E-06_schedulerconstant_warmup1_00E-01_minlr Text Generation • Updated about 19 hours ago
mlfoundations-dev/oh-mistral-bs512_lr2_00E-06_schedulerconstant_warmup1_00E-01_minlr Text Generation • Updated about 16 hours ago
mlfoundations-dev/oh-mistral-bs512_lr2_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07 Text Generation • Updated about 16 hours ago
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_poker Text Generation • Updated about 14 hours ago
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_politics Text Generation • Updated about 11 hours ago • 4
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_proofassistants Text Generation • Updated about 12 hours ago
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_puzzling Text Generation • Updated about 8 hours ago
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_quantumcomputing Text Generation • Updated about 8 hours ago
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_reverseengineering Text Generation • Updated about 8 hours ago
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_scicomp Text Generation • Updated about 4 hours ago
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_scifi Text Generation • Updated about 4 hours ago