opt-125m-gptq2 / quantize_config.json

Commit History

AutoGPTQ model for facebook/opt-125m: 4 bits, gr128, desc_act=False
186b371
verified

iproskurina commited on