OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper • 2412.19723 • Published 16 days ago • 78
Diving into Self-Evolving Training for Multimodal Reasoning Paper • 2412.17451 • Published 21 days ago • 42
Pretraining in Deep Reinforcement Learning: A Survey Paper • 2211.03959 • Published Nov 8, 2022 • 1
VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment Paper • 2410.09421 • Published Oct 12, 2024
VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models Paper • 2411.17451 • Published Nov 26, 2024 • 10
VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models Paper • 2411.17451 • Published Nov 26, 2024 • 10