Fix q8 weights (use uint8 for q8; int8 produces poor results) 4f13109 verified Xenova HF staff commited on Nov 26, 2024
Upload fixed q8 ONNX models (reduce_range=True, per_channel=True) 06633a3 verified Xenova HF staff commited on Nov 26, 2024
Upload optimized ONNX weights (deduplicated) (#17) b36fc77 verified Xenova HF staff commited on Nov 26, 2024