Malaysian CausalLM Collection Trained on 21B tokens, 91GB of cleaned texts, able to understand standard Malay, local Malay, local Mandarin, Manglish, and local Tamil. • 4 items • Updated Sep 22 • 1