GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training Paper • 2503.08525 • Published 2 days ago • 12
Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora Paper • 2401.14624 • Published Jan 26, 2024 • 1