What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Noise-free Text-Image Corruption and Evaluation Paper • 2406.16320 • Published Jun 24, 2024 • 3
Forgotten Polygons: Multimodal Large Language Models are Shape-Blind Paper • 2502.15969 • Published 16 days ago • 2
GauFRe: Gaussian Deformation Fields for Real-time Dynamic Novel View Synthesis Paper • 2312.11458 • Published Dec 18, 2023 • 5
AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos? Paper • 2307.16368 • Published Jul 31, 2023 • 12