Collections

Discover the best community collections!

Collections including paper arxiv:2412.04432
VisionLM
Collection by about 8 hours ago
video LM
Collection by 4 days ago
Video
Collection by 9 days ago
Unified MLLM
Unified model that generate Text, Image, Video
Cognition
Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend.
video
Collection by about 9 hours ago
daily papers
Collection by 18 days ago