InternVL2.5-MPO Collection Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated about 12 hours ago • 20
NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data Paper • 2402.15343 • Published Feb 23 • 13
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 5 days ago • 88
Qwen2.5-Math Collection Math-specific model series based on Qwen2.5 • 9 items • Updated 27 days ago • 58
Common Models Collection The first generation of models pretrained on Common Corpus. • 5 items • Updated 19 days ago • 27
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs Paper • 2411.14199 • Published Nov 21 • 28
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated 27 days ago • 62
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset Paper • 2402.10176 • Published Feb 15 • 36
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 123
OpenScholar_V1 Collection The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated Nov 22 • 30
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 7 items • Updated 27 days ago • 29
Thinking LLMs: General Instruction Following with Thought Generation Paper • 2410.10630 • Published Oct 14 • 17
TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Trees Paper • 2410.12854 • Published Oct 10 • 1