Phantom of Latent for Large Language and Vision Models Paper • 2409.14713 • Published Sep 23, 2024 • 28
Are Vision-Language Models Truly Understanding Multi-vision Sensor? Paper • 2412.20750 • Published 13 days ago • 19
Are Vision-Language Models Truly Understanding Multi-vision Sensor? Paper • 2412.20750 • Published 13 days ago • 19
Are Vision-Language Models Truly Understanding Multi-vision Sensor? Paper • 2412.20750 • Published 13 days ago • 19 • 2
VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models Paper • 2412.01822 • Published Dec 2, 2024 • 14
Phantom of Latent for Large Language and Vision Models Paper • 2409.14713 • Published Sep 23, 2024 • 28
SPARK: Multi-Vision Sensor Perception and Reasoning Benchmark for Large-scale Vision-Language Models Paper • 2408.12114 • Published Aug 22, 2024 • 13
SPARK: Multi-Vision Sensor Perception and Reasoning Benchmark for Large-scale Vision-Language Models Paper • 2408.12114 • Published Aug 22, 2024 • 13
SPARK: Multi-Vision Sensor Perception and Reasoning Benchmark for Large-scale Vision-Language Models Paper • 2408.12114 • Published Aug 22, 2024 • 13 • 3
TroL: Traversal of Layers for Large Language and Vision Models Paper • 2406.12246 • Published Jun 18, 2024 • 34
TroL Collection Super Efficient Large Language and Models surpassing GPT-4V! Let's say TroL • 4 items • Updated Jul 30, 2024 • 1
TroL: Traversal of Layers for Large Language and Vision Models Paper • 2406.12246 • Published Jun 18, 2024 • 34