matlok
's Collections
Models - Video
updated
VideoPrism: A Foundational Visual Encoder for Video Understanding
Paper
•
2402.13217
•
Published
•
24
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with
Audio2Video Diffusion Model under Weak Conditions
Paper
•
2402.17485
•
Published
•
191
Qwen/Qwen-VL-Chat
Text Generation
•
Updated
•
22.1k
•
349
MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Paper
•
2403.01422
•
Published
•
27
World Model on Million-Length Video And Language With RingAttention
Paper
•
2402.08268
•
Published
•
38
Valley: Video Assistant with Large Language model Enhanced abilitY
Paper
•
2306.07207
•
Published
•
2
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and
Language Models
Paper
•
2306.05424
•
Published
•
7
Lightricks/LTX-Video
Image-to-Video
•
Updated
•
93.2k
•
899
genmo/mochi-1-preview
Text-to-Video
•
Updated
•
42.9k
•
1.15k
FastVideo/FastHunyuan
Text-to-Video
•
Updated
•
938
•
160
FastVideo/FastMochi-diffusers
Text-to-Video
•
Updated
•
115
•
15