EXL2 quants of Sheared-LLaMA-2.7B from princeton-nlp. This is a pruned and further pre-trained version of Llama2-7B
2.50 bits per weight
2.70 bits per weight
3.00 bits per weight
3.50 bits per weight
4.00 bits per weight
4.50 bits per weight
5.00 bits per weight
6.00 bits per weight