sdxl-quant-fp8 / test_attn.py

Commit History

[math_model] Make it more obvious that softmax scale comes from the quantizer
db5a15b

nickfraser commited on

Added SDPA math model & test
3fea540

nickfraser commited on