Replicable Benchmarking of Neural Machine Translation (NMT) on Low-Resource Local Languages in Indonesia Paper • 2311.00998 • Published Nov 2, 2023
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages Paper • 2406.10118 • Published Jun 14 • 30
IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language Paper • 2406.19349 • Published Jun 27
MetaMetrics: Calibrating Metrics For Generation Tasks Using Human Preferences Paper • 2410.02381 • Published Oct 3
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines Paper • 2410.12705 • Published Oct 16 • 29