Fix q8 weights (use uint8 for q8; int8 produces poor results)

#18
by Xenova HF staff - opened
Hugging Face TB Research org
β€’
edited 29 days ago

Slightly better, but not great. Will play around with other settings

Xenova changed pull request title from Upload fixed q8 ONNX models (reduce_range=True, per_channel=True) to Fix q8 weights (use uint8 for q8; int8 produces poor results)
Xenova changed pull request status to merged

Sign up or log in to comment