Malaysian Seq2Seq
Collection
Trained on 17B tokens, 81GB of cleaned texts, able to understand standard Malay, local Malay, local Mandarin, Manglish, and local Tamil.
•
8 items
•
Updated
README at https://github.com/mesolitica/malaya/tree/5.1/pretrained-model/nanoT5