CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging Paper • 2502.05664 • Published 5 days ago • 20
MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models Paper • 2501.00316 • Published Dec 31, 2024 • 22
MapQaTor: A System for Efficient Annotation of Map Query Datasets Paper • 2412.21015 • Published Dec 30, 2024 • 10