view article Article Optimizing Pretraining Data Mixes with LLM-Estimated Utility By WillHeld • 4 days ago • 2
Tristan/dclm-perplexity-correlations-spearmanr-no-samp-410m Text Generation • Updated Nov 22, 2024 • 4
Tristan/dclm-perplexity-correlations-spearmanr-no-samp-160m Text Generation • Updated Nov 22, 2024 • 5
Tristan/dclm-perplexity-correlations-160m-target-to-be-bad Text Generation • Updated Nov 19, 2024 • 115