Marcus Gawronsky
marcusinthesky
AI & ML interests
Representation Learning
Organizations
Collections
9
-
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Paper • 2410.10139 • Published • 48 -
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
Paper • 2410.10563 • Published • 33 -
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content
Paper • 2410.10783 • Published • 25 -
TVBench: Redesigning Video-Language Evaluation
Paper • 2410.07752 • Published • 5
models
1
datasets
None public yet