Unable to load: size mismatches

by arclight1981 - opened Apr 30, 2023

arclight1981

Apr 30, 2023

RuntimeError: Error(s) in loading state_dict for LlamaForCausalLM:
size mismatch for model.layers.0.self_attn.k_proj.qzeros: copying a param with shape torch.Size([128, 512]) from checkpoint, the shape in current model is torch.Size([32, 512]).

And a whole bunch more. Might be the model or might be my code (although loading the quantized 7B with 128 groupsize seems to work fine). But since I'm still learning, I can't say for absolutely sure.

TehVenom

Owner Apr 30, 2023

•

edited Apr 30, 2023

Sounds like it's trying to load an 128 group size model? This one is 32g.
How are you loading this? An UI or using the GPTQ library by itself? Try renaming the model file to "4bit-32g.safetensors" and trying again.

arclight1981

May 1, 2023

And you are absolutely right, of course. I use a modified version of the _load_quant function from oobabooga and while I did set the groupsize to 32 in my startup parameters, it seems that somewhere along the way they don't get passed. Sorry about that, completely my fault.

arclight1981 changed discussion status to closed May 1, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment