InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization Paper โข 2508.05731 โข Published 10 days ago โข 25
InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization Paper โข 2508.05731 โข Published 10 days ago โข 25
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners Paper โข 2504.14239 โข Published Apr 19 โข 14
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners Paper โข 2504.14239 โข Published Apr 19 โข 14
InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning Paper โข 2502.11573 โข Published Feb 17 โข 8
Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model Paper โข 2405.17815 โข Published May 28, 2024
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection Paper โข 2501.04575 โข Published Jan 8 โข 24
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection Paper โข 2501.04575 โข Published Jan 8 โข 24
TรLU 3: Pushing Frontiers in Open Language Model Post-Training Paper โข 2411.15124 โข Published Nov 22, 2024 โข 65
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation Paper โข 2410.18666 โข Published Oct 24, 2024 โข 19
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation Paper โข 2410.18666 โข Published Oct 24, 2024 โข 19
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation Paper โข 2410.18666 โข Published Oct 24, 2024 โข 19 โข 3
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning Paper โข 2409.12568 โข Published Sep 19, 2024 โข 51