Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model Paper • 2503.07703 • Published 3 days ago • 28
ProReflow: Progressive Reflow with Decomposed Velocity Paper • 2503.04824 • Published 9 days ago • 9
Unified Reward Model for Multimodal Understanding and Generation Paper • 2503.05236 • Published 6 days ago • 103
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Paper • 2502.18364 • Published 16 days ago • 34
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation Paper • 2502.08639 • Published 29 days ago • 37
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper • 2411.19943 • Published Nov 29, 2024 • 58
StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter Paper • 2312.00330 • Published Dec 1, 2023 • 13