Deep Ignorance Collection This collection contains the model and data artifacts from O'Brien et al. (2025). https://deepignorance.ai • 32 items • Updated 4 days ago • 4
Dayhoff Atlas Collection The models and datasets that comprise the Dayhoff Atlas • 10 items • Updated 19 days ago • 7
TabArena: A Living Benchmark for Machine Learning on Tabular Data Paper • 2506.16791 • Published Jun 20 • 3
view article Article Accelerating AI for Drug Discovery: Ginkgo’s GDPx Functional Genomics and GDPa Antibody Developability Dataset Series By cgeorgiaw and 1 other • Jun 24 • 15
Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA Paper • 2505.21115 • Published May 27 • 139
Pretraining Language Models for Diachronic Linguistic Change Discovery Paper • 2504.05523 • Published Apr 7 • 6
view article Article Welcome Llama 4 Maverick & Scout on Hugging Face! By burtenshaw and 6 others • Apr 5 • 146
Dolphin: A Large-Scale Automatic Speech Recognition Model for Eastern Languages Paper • 2503.20212 • Published Mar 26 • 6
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published Mar 12 • 73
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published Mar 10 • 100
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs Paper • 2502.17424 • Published Feb 24 • 4
SEA-HELM: Southeast Asian Holistic Evaluation of Language Models Paper • 2502.14301 • Published Feb 20 • 2