Malaysian CausalLM Collection Trained on 21B tokens, 91GB of cleaned texts, able to understand standard Malay, local Malay, local Mandarin, Manglish, and local Tamil. • 4 items • Updated Dec 23, 2024 • 1