BIMBA: Selective-Scan Compression for Long-Range Video Question Answering Paper • 2503.09590 • Published 14 days ago • 3
LLaVA-Video Collection Models focus on video understanding (previously known as LLaVA-NeXT-Video). • 8 items • Updated Feb 21 • 61