ConvLLaVA A collection of ConvLLaVA models. ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models Paper • 2405.15738 • Published May 24, 2024 • 47 ConvLLaVA/ConvLLaVA-sft-768 Text Generation • Updated May 28, 2024 • 11 • 1 ConvLLaVA/ConvLLaVA-sft-1024 Text Generation • Updated May 28, 2024 • 3 ConvLLaVA/ConvLLaVA-sft-1536 Text Generation • Updated May 28, 2024 • 6
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models Paper • 2405.15738 • Published May 24, 2024 • 47
ConvLLaVA A collection of ConvLLaVA models. ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models Paper • 2405.15738 • Published May 24, 2024 • 47 ConvLLaVA/ConvLLaVA-sft-768 Text Generation • Updated May 28, 2024 • 11 • 1 ConvLLaVA/ConvLLaVA-sft-1024 Text Generation • Updated May 28, 2024 • 3 ConvLLaVA/ConvLLaVA-sft-1536 Text Generation • Updated May 28, 2024 • 6
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models Paper • 2405.15738 • Published May 24, 2024 • 47