DeepHermes Collection Preview models of hybrid reasoner Hermes series • 6 items • Updated about 6 hours ago • 13
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others • 4 days ago • 115
Q-Filters Collection Pre-computed Q-Filters for efficient KV cache compression. • 15 items • Updated 10 days ago • 6
olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated about 7 hours ago • 92
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28 • 108
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper • 2501.18511 • Published Jan 30 • 19
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated about 7 hours ago • 93