OpenGVLab/VideoChat2_HD_stage4_Mistral_7B_hf Video-Text-to-Text โข Updated 24 days ago โข 964 โข 3
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment Paper โข 2412.19326 โข Published 16 days ago โข 18
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment Paper โข 2412.19326 โข Published 16 days ago โข 18 โข 2