ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 7 days ago • 93
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks • 6 items • Updated 13 days ago • 8
Uhura Collection Contains Benchmark datasets for Arc-Easy and Truthful-QA collected through human translation of existing datasets • 2 items • Updated Nov 24 • 1