Enhance speed by using nn.layernorm and nn.groupnorm (triton-lang/triton#5712) 9a09de9 verified zhiyuan8 commited on 4 days ago