SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents Paper ā¢ 2401.10935 ā¢ Published Jan 17, 2024 ā¢ 4
Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models Paper ā¢ 2406.11736 ā¢ Published Jun 17, 2024 ā¢ 5
Vision-Language Models Can Self-Improve Reasoning via Reflection Paper ā¢ 2411.00855 ā¢ Published Oct 30, 2024 ā¢ 5
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper ā¢ 2412.19723 ā¢ Published Dec 27, 2024 ā¢ 82
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant Paper ā¢ 2410.18603 ā¢ Published Oct 24, 2024 ā¢ 32
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper ā¢ 2412.19723 ā¢ Published Dec 27, 2024 ā¢ 82
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper ā¢ 2412.19723 ā¢ Published Dec 27, 2024 ā¢ 82
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper ā¢ 2412.05271 ā¢ Published Dec 6, 2024 ā¢ 130