Load 4bit models 4x faster Collection Native bitsandbytes 4bit pre quantized models β’ 25 items β’ Updated about 18 hours ago β’ 50
view article Article makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch By AviSoori1x β’ May 7 β’ 42