Creative Reasoning Assistant Collection Thinking creative model collection • 4 items • Updated 27 days ago • 1
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 7 days ago • 60
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences Paper • 2404.03715 • Published Apr 4, 2024 • 61