arxiv:2411.13543
Maciej Wolczyk
rahid
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Organizations
Papers
1
models
None public yet
datasets
None public yet