stan-hua's picture
Push folder to HuggingFace Hub
d4d31a0 verified
raw
history blame contribute delete
179 Bytes
DEFAULT_stage:
DEFAULT_modifiers:
SmoothQuantModifier: {smoothing_strength: 0.8}
QuantizationModifier:
ignore: [lm_head]
targets: Linear
scheme: W8A16