AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO
Paper
•
2502.14669
•
Published
•
11
Great job guys, reasoning bringing so many potential!
we also have similiar idea! but only applied for maze