CharlesLi/llama_3_sky_safe_o1_llama_3_70B_default_1000_100_full Text Generation • Updated 26 days ago • 9
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_default_1000_500_full Text Generation • Updated 26 days ago • 9
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_default_1000_1000_full Text Generation • Updated 26 days ago • 8
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_default_4000_100_full Text Generation • Updated 26 days ago • 8
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_default_4000_500_full Text Generation • Updated 26 days ago • 9
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_default_4000_1000_full Text Generation • Updated 26 days ago • 8
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_reflect_1000_100_full Text Generation • Updated 26 days ago • 9
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_reflect_1000_500_full Text Generation • Updated 26 days ago • 9
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_reflect_1000_1000_full Text Generation • Updated 26 days ago • 9
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_reflect_4000_100_full Text Generation • Updated 26 days ago • 9
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_reflect_4000_500_full Text Generation • Updated 26 days ago • 9
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_reflect_4000_1000_full Text Generation • Updated 26 days ago • 9
RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.1-8B-Instruct-Q4-LoRA8-Batch-16-Tok-1024 Updated 23 days ago
tttx/l3.1-8b-inst-fft-induction-barc-heavy-200k-old-200k-lr1e-5-ep3 Text Generation • Updated 17 days ago • 8