city96/Flux.1-Heavy-17B · GGUF for this one?

Hey man, appreciate your work. This model flew completely under my radar, probably everyone else's too given the size. I was curious if it would be possible to convert this one to GGUF format? I reckon quants wouldn't be needed since they probably wouldn't be as good as (or as popular as) the smaller Flux variants like Dev and Schnell, but even at full size this model is pretty similar size to the 30B LLM that I can run in GGUF format with KoboldCPP by offloading half its blocks to the second of my two RTX 4090s. Might have to dump CLIP/VAE to CPU but worse things could happen.

If you're busy to do it yourself I'd be happy to try it, lemme know if there's any tutorials you recommend, this seems a bit niche to find anything useful by googling. With llama.cpp it was pretty straightforward but I imagine this would be different. Thanks!