OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-13B-448px Visual Question Answering • Updated Aug 24, 2024 • 8 • 4
wiusdy/blip_pretrained_saffal_fashion_finetuning Visual Question Answering • 0.4B • Updated Feb 14, 2024 • 7
wiusdy/blip_pretrained_control_net_fashion_finetuning Visual Question Answering • 0.4B • Updated Feb 14, 2024 • 7
VladGK/ViLT_FT_Balanced_Binary_Abstract_Scenes Visual Question Answering • 0.1B • Updated Feb 17, 2024 • 3
VladGK/ViLT_Binary_Classifier_Abstract_Scenes Visual Question Answering • 0.1B • Updated Feb 18, 2024 • 3
ai2lumos/lumos_multimodal_ground_iterative-13B Visual Question Answering • Updated Mar 19, 2024 • 2 • 1