nm-testing
/

Meta-Llama-3-8B-Instruct-FP8-channel-output-activation-q_proj

compressed-tensors

Model card Files Files and versions

Meta-Llama-3-8B-Instruct-FP8-channel-output-activation-q_proj / recipe.yaml

horheynm's picture

Upload folder using huggingface_hub

d2c7533 verified 5 months ago

history blame contribute delete

331 Bytes

	quant_stage:
	quant_modifiers:
	QuantizationModifier:
	config_groups:
	fp8_attention_qkv_proj:
	weights: {num_bits: 8, type: float, strategy: tensor}
	output_activations: {num_bits: 8, type: float, strategy: channel, dynamic: false,
	symmetric: true}
	targets: ['re:.*q_proj']