Scaling RL to Long Videos
Efficient-Large-Model
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
-
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Paper β’ 2501.18427 β’ Published β’ 20 -
Efficient-Large-Model/SANA1.5_4.8B_1024px
Text-to-Image β’ Updated β’ 80 β’ β’ 22 -
Efficient-Large-Model/SANA1.5_4.8B_1024px_diffusers
Text-to-Image β’ Updated β’ β’ 11 -
Efficient-Large-Model/SANA1.5_1.6B_1024px
Text-to-Image β’ Updated β’ 347 β’ β’ 1
A series of VILA models that specialize for **long-context** abilities
-
Efficient-Large-Model/NVILA-15B
Text Generation β’ Updated β’ 35.9k β’ 20 -
Efficient-Large-Model/NVILA-Lite-15B
Text Generation β’ Updated β’ 186 β’ 4 -
Efficient-Large-Model/NVILA-Lite-8B
Text Generation β’ Updated β’ 5.16k β’ 2 -
Efficient-Large-Model/NVILA-Lite-8B-stage2
Text Generation β’ Updated β’ 9 β’ 1
πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
-
412
SanaSprint
πUltra fast high quality image generation
-
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Paper β’ 2503.09641 β’ Published β’ 40 -
Efficient-Large-Model/Sana_Sprint_1.6B_1024px
Text-to-Image β’ Updated β’ 69 β’ 15 -
Efficient-Large-Model/Sana_Sprint_0.6B_1024px
Text-to-Image β’ Updated β’ 22 β’ 4
β‘οΈSana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
-
Efficient-Large-Model/Sana_1600M_1024px
Text-to-Image β’ Updated β’ 602 β’ β’ 211 -
Efficient-Large-Model/Sana_1600M_1024px_BF16
Text-to-Image β’ Updated β’ 248 β’ 13 -
Efficient-Large-Model/Sana_1600M_1024px_BF16_ControlNet_HED
Text-to-Image β’ Updated β’ 37 -
Efficient-Large-Model/Sana_600M_1024px_ControlNet_HED
Text-to-Image β’ Updated β’ 45
-
Efficient-Large-Model/Llama-3-VILA1.5-8B
Text Generation β’ Updated β’ 17.8k β’ 35 -
Efficient-Large-Model/VILA1.5-40b
Text Generation β’ Updated β’ 982 β’ 17 -
Efficient-Large-Model/VILA1.5-3b
Text Generation β’ Updated β’ 8.63k β’ 29 -
Efficient-Large-Model/VILA1.5-3b-AWQ
Text Generation β’ Updated β’ 29 β’ 5
Scaling RL to Long Videos
-
Efficient-Large-Model/NVILA-15B
Text Generation β’ Updated β’ 35.9k β’ 20 -
Efficient-Large-Model/NVILA-Lite-15B
Text Generation β’ Updated β’ 186 β’ 4 -
Efficient-Large-Model/NVILA-Lite-8B
Text Generation β’ Updated β’ 5.16k β’ 2 -
Efficient-Large-Model/NVILA-Lite-8B-stage2
Text Generation β’ Updated β’ 9 β’ 1
SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
-
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Paper β’ 2501.18427 β’ Published β’ 20 -
Efficient-Large-Model/SANA1.5_4.8B_1024px
Text-to-Image β’ Updated β’ 80 β’ β’ 22 -
Efficient-Large-Model/SANA1.5_4.8B_1024px_diffusers
Text-to-Image β’ Updated β’ β’ 11 -
Efficient-Large-Model/SANA1.5_1.6B_1024px
Text-to-Image β’ Updated β’ 347 β’ β’ 1
πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
-
412
SanaSprint
πUltra fast high quality image generation
-
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Paper β’ 2503.09641 β’ Published β’ 40 -
Efficient-Large-Model/Sana_Sprint_1.6B_1024px
Text-to-Image β’ Updated β’ 69 β’ 15 -
Efficient-Large-Model/Sana_Sprint_0.6B_1024px
Text-to-Image β’ Updated β’ 22 β’ 4
β‘οΈSana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
-
Efficient-Large-Model/Sana_1600M_1024px
Text-to-Image β’ Updated β’ 602 β’ β’ 211 -
Efficient-Large-Model/Sana_1600M_1024px_BF16
Text-to-Image β’ Updated β’ 248 β’ 13 -
Efficient-Large-Model/Sana_1600M_1024px_BF16_ControlNet_HED
Text-to-Image β’ Updated β’ 37 -
Efficient-Large-Model/Sana_600M_1024px_ControlNet_HED
Text-to-Image β’ Updated β’ 45
A series of VILA models that specialize for **long-context** abilities
-
Efficient-Large-Model/Llama-3-VILA1.5-8B
Text Generation β’ Updated β’ 17.8k β’ 35 -
Efficient-Large-Model/VILA1.5-40b
Text Generation β’ Updated β’ 982 β’ 17 -
Efficient-Large-Model/VILA1.5-3b
Text Generation β’ Updated β’ 8.63k β’ 29 -
Efficient-Large-Model/VILA1.5-3b-AWQ
Text Generation β’ Updated β’ 29 β’ 5