File size: 1,838 Bytes
51aba6b 0722208 ffadf6c f76e219 0722208 91037c2 51aba6b 91037c2 51aba6b 91037c2 51aba6b 91037c2 51aba6b 0722208 f76e219 51aba6b |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 |
---
pipeline_tag: image-to-video
---
<p align="center">
<img src="./demos/demo-01.gif" width="70%" />
<img src="./demos/demo-02.gif" width="70%" />
<img src="./demos/demo-03.gif" width="70%" />
</p>
<p align="center">Samples generated by AnimateLCM-SVD-xt</p>
## Introduction
Consistency Distilled [Stable Video Diffusion Image2Video-XT (SVD-xt)](https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt) following the strategy proposed in [AnimateLCM-paper](https://arxiv.org/abs/2402.00769).
AnimateLCM-SVD-xt can generate good quality image-conditioned videos with 25 frames in 2~8 steps with 576x1024 resolutions.
## Computation comparsion
AnimateLCM-SVD-xt can generally produces demos with good quality in 4 steps without requiring the classifier-free guidance, and therefore can save 25 x 2 / 4 = 12.5 times compuation resources compared with normal SVD models.
## Demos
| | | |
| :---: | :---: | :---: |
| data:image/s3,"s3://crabby-images/ff5f0/ff5f08cbb3eb4939a62a79bc80e39728374d5018" alt="Alt text 1" | data:image/s3,"s3://crabby-images/c2219/c22193c41c41662ed85284145bcc8607154d73f8" alt="Alt text 2" | data:image/s3,"s3://crabby-images/9405f/9405f39a7b33b8c7eab385aae3330062a326739c" alt="Alt text 3" |
| 2 steps, cfg=1 | 4 steps, cfg=1 | 8 steps, cfg=1 |
| data:image/s3,"s3://crabby-images/f410d/f410d07d685ec80b2adb00c9515d0934c982d6f7" alt="Alt text 1" | data:image/s3,"s3://crabby-images/8e772/8e772354d89912d352fcc96568ebabddcf17a3b7" alt="Alt text 2" | data:image/s3,"s3://crabby-images/fc743/fc743176af819e87adde9e8976ccf739f75c179b" alt="Alt text 3" |
| 2 steps, cfg=1 | 4 steps, cfg=1 | 8 steps, cfg=1 |
| data:image/s3,"s3://crabby-images/f4899/f4899419170949a94763a3975694f9dd71d40779" alt="Alt text 1" | data:image/s3,"s3://crabby-images/862cc/862cc6126dd34e0961effe0e89655ab3f1fa67d3" alt="Alt text 2" | data:image/s3,"s3://crabby-images/9d2ab/9d2ab14fa57320047da6299e2434dad06bc72674" alt="Alt text 3" |
| 2 steps, cfg=1 | 4 steps, cfg=1 | 8 steps, cfg=1 |
| data:image/s3,"s3://crabby-images/36053/3605366247d0b86953e890f443467a1197eaef78" alt="Alt text 1" | data:image/s3,"s3://crabby-images/1a7fd/1a7fd249081c0f456b3cde6b28834fac2a9ac180" alt="Alt text 2" | data:image/s3,"s3://crabby-images/96f12/96f1277ef80cba6a29344a01eed2a8a9ac4ff309" alt="Alt text 3" |
| 2 steps, cfg=1 | 4 steps, cfg=1 | 8 steps, cfg=1 |
| data:image/s3,"s3://crabby-images/a8314/a83144e7c99013cd0c053de15880644ef6863b73" alt="Alt text 1" | data:image/s3,"s3://crabby-images/f97aa/f97aa707fa62dcd83bb0bf8a098845146a51280d" alt="Alt text 2" | data:image/s3,"s3://crabby-images/d8c2f/d8c2fb4f0138f09a15a084affbc988d60e5c7591" alt="Alt text 3" |
| 2 steps, cfg=1 | 4 steps, cfg=1 | 8 steps, cfg=1 |
Please contact Fu-Yun Wang ([email protected]) for the inference code and the scheduler design. |