Facebert pbelcak/UltraFastBERT-1x11-long Updated Nov 22, 2023 • 74 • 75 Exponentially Faster Language Modelling Paper • 2311.10770 • Published Nov 15, 2023 • 117
Sparse MoE mistralai/Mixtral-8x7B-Instruct-v0.1 Text Generation • Updated Aug 19, 2024 • 4.05M • • 4.26k