SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Paper
•
2501.18427
•
Published
•
18
None defined yet.
2 ** search_round
) and repeat 1 - 3.diffusers
🧨bistandbytes
as the official backend but using others like torchao
is already very simple. enable_model_cpu_offload()
torch.compile()
them.