Mantis-VL/intern_vl_25_llava_next_700k_pretrain_packing_4096 Feature Extraction • Updated 4 days ago • 2
Mantis-VL/qwen2-vl-video-eval_st_r2k_bad8k_49152_regression Text Classification • Updated 22 days ago • 14
Mantis-VL/qwen2-vl-video-eval_st_r2k_bad5k_49152_regression Text Classification • Updated 22 days ago • 14
Mantis-VL/qwen2-vl-video-eval_st_bad8k_49152_regression Text Classification • Updated 22 days ago • 8
Mantis-VL/qwen2-vl-video-eval_st_bad5k_49152_regression Text Classification • Updated 25 days ago • 28
Mantis-VL/qwen2-vl-video-eval_st_bad8k_55296_regression Text Classification • Updated 26 days ago • 16
Mantis-VL/qwen2-vl-video-eval_st_bad5k_55296_regression Text Classification • Updated 26 days ago • 15
Mantis-VL/qwen2-vl-video-eval_st_r2k_bad8k_61440_regression Text Classification • Updated 26 days ago • 26
Mantis-VL/qwen2-vl-video-eval_st_r2k_bad5k_61440_regression Text Classification • Updated 26 days ago • 35
Mantis-VL/qwen2-vl-video-eval_st_bad5k_61440_regression Text Classification • Updated 26 days ago • 32
Mantis-VL/qwen2-vl-video-eval_st_bad8k_61440_regression Text Classification • Updated 26 days ago • 17
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks Paper • 2410.10563 • Published Oct 14, 2024 • 38
MantisScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation Paper • 2406.15252 • Published Jun 21, 2024 • 14
MantisScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation Paper • 2406.15252 • Published Jun 21, 2024 • 14
WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences Paper • 2406.11069 • Published Jun 16, 2024 • 14