Samples generated by AnimateLCM-SVD-xt

Introduction

Consistency Distilled Stable Video Diffusion Image2Video-XT (SVD-xt) following the strategy proposed in AnimateLCM-paper. AnimateLCM-SVD-xt can generate good quality image-conditioned videos with 25 frames in 2~8 steps with 576x1024 resolutions.

Computation comparsion

AnimateLCM-SVD-xt can generally produces demos with good quality in 4 steps without requiring the classifier-free guidance, and therefore can save 25 x 2 / 4 = 12.5 times compuation resources compared with normal SVD models.

Demos

Alt text 1 Alt text 2 Alt text 3
2 steps, cfg=1 4 steps, cfg=1 8 steps, cfg=1
Alt text 1 Alt text 2 Alt text 3
2 steps, cfg=1 4 steps, cfg=1 8 steps, cfg=1
Alt text 1 Alt text 2 Alt text 3
2 steps, cfg=1 4 steps, cfg=1 8 steps, cfg=1
Alt text 1 Alt text 2 Alt text 3
2 steps, cfg=1 4 steps, cfg=1 8 steps, cfg=1
Alt text 1 Alt text 2 Alt text 3
2 steps, cfg=1 4 steps, cfg=1 8 steps, cfg=1

I have launched a gradio demo at AnimateLCM SVD space. Should you have any questions, please contact Fu-Yun Wang ([email protected]). I might respond a bit later. Thank you!

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Spaces using wangfuyun/AnimateLCM-SVD-xt 6