MultiRef ONE-Lab/MultiRef-benchmark Viewer • Updated Jun 11 • 4.94k • 306 ONE-Lab/MultiRef-dataset Viewer • Updated Jun 11 • 103k • 7 • 1
MLLM-as-a-Judge Benchmark of MLLM-as-a-Judge. MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark Paper • 2402.04788 • Published Feb 7, 2024 ONE-Lab/MLLM-as-a-Judge Viewer • Updated Oct 23, 2024 • 4.12k • 831 • 4
MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark Paper • 2402.04788 • Published Feb 7, 2024
GUI-World Models and datasets from paper GUI-World. GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents Paper • 2406.10819 • Published Jun 16, 2024 • 1 ONE-Lab/GUI-Vid Video-Text-to-Text • Updated Mar 26 • 5 ONE-Lab/GUI-World Preview • Updated Mar 26 • 3.91k • 31
GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents Paper • 2406.10819 • Published Jun 16, 2024 • 1
MixSet Benchmark dataset and model checkpoints of paper "LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?" LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected? Paper • 2401.05952 • Published Jan 11, 2024 ONE-Lab/MixSet Viewer • Updated Apr 13, 2024 • 3.6k • 40 • 2
LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected? Paper • 2401.05952 • Published Jan 11, 2024
LiveVQA Dataset, benchmark and model checkpoints from paper LiveVQA. ONE-Lab/LiveVQA-new Preview • Updated 18 days ago • 174 • 1 fmy666/livevqa-benchmark Viewer • Updated Jun 11 • 2 • 361 • 1 LiveVQA: Live Visual Knowledge Seeking Paper • 2504.05288 • Published Apr 7 • 15 ONE-Lab/LiveVQA-2025 Preview • Updated May 16 • 24 • 1
MultiRef ONE-Lab/MultiRef-benchmark Viewer • Updated Jun 11 • 4.94k • 306 ONE-Lab/MultiRef-dataset Viewer • Updated Jun 11 • 103k • 7 • 1
MixSet Benchmark dataset and model checkpoints of paper "LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?" LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected? Paper • 2401.05952 • Published Jan 11, 2024 ONE-Lab/MixSet Viewer • Updated Apr 13, 2024 • 3.6k • 40 • 2
LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected? Paper • 2401.05952 • Published Jan 11, 2024
MLLM-as-a-Judge Benchmark of MLLM-as-a-Judge. MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark Paper • 2402.04788 • Published Feb 7, 2024 ONE-Lab/MLLM-as-a-Judge Viewer • Updated Oct 23, 2024 • 4.12k • 831 • 4
MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark Paper • 2402.04788 • Published Feb 7, 2024
LiveVQA Dataset, benchmark and model checkpoints from paper LiveVQA. ONE-Lab/LiveVQA-new Preview • Updated 18 days ago • 174 • 1 fmy666/livevqa-benchmark Viewer • Updated Jun 11 • 2 • 361 • 1 LiveVQA: Live Visual Knowledge Seeking Paper • 2504.05288 • Published Apr 7 • 15 ONE-Lab/LiveVQA-2025 Preview • Updated May 16 • 24 • 1
GUI-World Models and datasets from paper GUI-World. GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents Paper • 2406.10819 • Published Jun 16, 2024 • 1 ONE-Lab/GUI-Vid Video-Text-to-Text • Updated Mar 26 • 5 ONE-Lab/GUI-World Preview • Updated Mar 26 • 3.91k • 31
GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents Paper • 2406.10819 • Published Jun 16, 2024 • 1