view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others • about 4 hours ago • 45
Q-Filters Collection Pre-computed Q-Filters for efficient KV cache compression. • 15 items • Updated 7 days ago • 6
olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated 12 days ago • 90
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28 • 108
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper • 2501.18511 • Published Jan 30 • 19
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 26 days ago • 93
Language Detection Collection StaticVectors models to detect language. Exports of FastText that run in NumPy without needing FastText • 2 items • Updated Jan 26 • 3