Shiksha: A Technical Domain focused Translation Dataset and Model for Indian Languages Paper • 2412.09025 • Published Dec 12, 2024 • 4
Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks Paper • 2210.14712 • Published Oct 26, 2022