Collections
Discover the best community collections!
Collections including paper arxiv:2412.09871
-
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response
Paper • 2412.14922 • Published • 90 -
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
Paper • 2412.17256 • Published • 48 -
OpenAI o1 System Card
Paper • 2412.16720 • Published • 34 -
Revisiting In-Context Learning with Long Context Language Models
Paper • 2412.16926 • Published • 33
-
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response
Paper • 2412.14922 • Published • 90 -
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
Paper • 2412.17256 • Published • 48 -
OpenAI o1 System Card
Paper • 2412.16720 • Published • 34 -
Revisiting In-Context Learning with Long Context Language Models
Paper • 2412.16926 • Published • 33