Is your benchmark truly adversarial? AdvScore: Evaluating Human-Grounded Adversarialness Paper • 2406.16342 • Published Jun 24, 2024
BenTo: Benchmark Task Reduction with In-Context Transferability Paper • 2410.13804 • Published Oct 17, 2024 • 20
CAIMIRA Paper & Data Collection Question Answering Datasets on Quizbowl questions and their progressive clues from various competitions. • 5 items • Updated Nov 9, 2024 • 1
CAIMIRA Paper & Data Collection Question Answering Datasets on Quizbowl questions and their progressive clues from various competitions. • 5 items • Updated Nov 9, 2024 • 1
Do great minds think alike? Investigating Human-AI Complementarity in Question Answering with CAIMIRA Paper • 2410.06524 • Published Oct 9, 2024 • 4
Do great minds think alike? Investigating Human-AI Complementarity in Question Answering with CAIMIRA Paper • 2410.06524 • Published Oct 9, 2024 • 4
MATE: Multi-view Attention for Table Transformer Efficiency Paper • 2109.04312 • Published Sep 9, 2021
CAIMIRA Paper & Data Collection Question Answering Datasets on Quizbowl questions and their progressive clues from various competitions. • 5 items • Updated Nov 9, 2024 • 1
CAIMIRA Paper & Data Collection Question Answering Datasets on Quizbowl questions and their progressive clues from various competitions. • 5 items • Updated Nov 9, 2024 • 1
CAIMIRA Paper & Data Collection Question Answering Datasets on Quizbowl questions and their progressive clues from various competitions. • 5 items • Updated Nov 9, 2024 • 1