Efficient Retrieval of Temporal Event Sequences from Textual Descriptions Paper • 2410.14043 • Published Oct 17
EconLogicQA: A Question-Answering Benchmark for Evaluating Large Language Models in Economic Sequential Reasoning Paper • 2405.07938 • Published May 13
SecQA: A Concise Question-Answering Dataset for Evaluating Large Language Models in Computer Security Paper • 2312.15838 • Published Dec 26, 2023