view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models Jun 24, 2024 • 190
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 15 days ago • 346
SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models Paper • 2503.07605 • Published 16 days ago • 65
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality 23 days ago • 69
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning Paper • 2503.07365 • Published 16 days ago • 54
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published 21 days ago • 216
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning Paper • 2502.14768 • Published Feb 20 • 47
WebGames: Challenging General-Purpose Web-Browsing AI Agents Paper • 2502.18356 • Published 29 days ago • 12
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO Paper • 2502.14669 • Published Feb 20 • 12
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 210
SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models Paper • 2502.09604 • Published Feb 13 • 34
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Paper • 2502.08127 • Published Feb 12 • 52
Scaling Pre-training to One Hundred Billion Data for Vision Language Models Paper • 2502.07617 • Published Feb 11 • 29