FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model Paper • 2410.13925 • Published 11 days ago • 20
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities Paper • 2410.14672 • Published 10 days ago • 7
Scalable Ranked Preference Optimization for Text-to-Image Generation Paper • 2410.18013 • Published 5 days ago • 13