R1-V Towards the Aha Moment of Vision-Language Models MMInstruction/Clevr_CoGenT_TrainA_R1 Viewer • Updated Feb 13 • 37.8k • 206 • 46 MMInstruction/SuperClevr_Val Viewer • Updated Feb 18 • 5k • 16 • 1 MMInstruction/Clevr_CoGenT_ValA Viewer • Updated Feb 3 • 5k • 561 • 1 MMInstruction/Clevr_CoGenT_ValB Viewer • Updated Feb 3 • 5k • 16 • 2
R1-V Towards the Aha Moment of Vision-Language Models MMInstruction/Clevr_CoGenT_TrainA_R1 Viewer • Updated Feb 13 • 37.8k • 206 • 46 MMInstruction/SuperClevr_Val Viewer • Updated Feb 18 • 5k • 16 • 1 MMInstruction/Clevr_CoGenT_ValA Viewer • Updated Feb 3 • 5k • 561 • 1 MMInstruction/Clevr_CoGenT_ValB Viewer • Updated Feb 3 • 5k • 16 • 2