amd-quark/llama-tiny-fp8-quark-quant-method
amd-quark/llama-tiny-fp8-quant-method
amd-quark/quark-assets
Updated
amd-quark/quark-legacy-int8
Updated
•
12
amd-quark/quark-legacy-fp8
Updated
•
44
amd-quark/quark-legacy-awq
Updated
•
55
amd-quark/dummy-config-awq
Updated
•
2.51k
amd-quark/llama-small-int4-per-group-sym-awq
Updated
•
144
amd-quark/llama-tiny-int4-per-group-sym
Updated
•
126
amd-quark/llama-tiny-w-fp8-a-fp8-o-fp8
Updated
•
122
amd-quark/llama-tiny-w-fp8-a-fp8
Updated
•
134
amd-quark/llama-tiny-w-int8-b-int8-per-tensor
Updated
•
119
amd-quark/llama-tiny-w-int8-per-tensor
Updated
•
133
amd-quark/test-qdq
Updated