Charting and Navigating Hugging Face's Model Atlas Paper β’ 2503.10633 β’ Published about 17 hours ago β’ 22
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper β’ 2503.09573 β’ Published 1 day ago β’ 41
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think Paper β’ 2502.20172 β’ Published 15 days ago β’ 27
FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute Paper β’ 2502.20126 β’ Published 15 days ago β’ 20
UniTok: A Unified Tokenizer for Visual Generation and Understanding Paper β’ 2502.20321 β’ Published 15 days ago β’ 29
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 β’ 197
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published Dec 18, 2024 β’ 135
Running 535 535 Scaling test-time compute π Enhance math problem solving by scaling test-time compute
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper β’ 2412.03555 β’ Published Dec 4, 2024 β’ 129