LangAGI-Lab/qwen-7b-verified-7k-rejection-sampling-alpaca-format Viewer • Updated about 21 hours ago • 7.38k • 10
LangAGI-Lab/qwen-7b-verified-7k-rejection-sampling-alpaca-format Viewer • Updated about 21 hours ago • 7.38k • 10
LangAGI-Lab/train-rl-o1-mini-annotated-math-numina-10k-numeric-answer Viewer • Updated 3 days ago • 10k • 93
LangAGI-Lab/train-rl-o1-mini-annotated-math-numina-10k-numeric-answer Viewer • Updated 3 days ago • 10k • 93