Holmes: Benchmark the Linguistic Competence of Language Models Paper • 2404.18923 • Published Apr 29
JuStRank: Benchmarking LLM Judges for System Ranking Paper • 2412.09569 • Published 12 days ago • 19
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 7 days ago • 16