Update quant params structure

#2
by nickfraser - opened

Updates the following:

  • Add <weight|input>_zp_dtype to quant_param.json to differentiate between exported versions
  • Update input/weight zero-points to be int8 (not uint8)
  • Update the math model and tests to incorporate the above changes
  • Remove SmoothQuant multipliers from layers that aren't quantized
  • Upload new quant_param.json
  • Upload new params.safetensors
  • Upload new example output out.safetensors
  • Confirm compliant FID of model (FID ∈ (23.0108, 23.9501)): 23.89
  • Confirm compliant CLIP score of model (CLIP ∈ (31.686, 31.813)): 31.86

Strikethrough items were updated outside this PR.

nickfraser changed pull request status to open
nickfraser changed pull request status to merged
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment