An Empirical Study on Eliciting and Improving R1-like Reasoning Models Paper • 2503.04548 • Published 6 days ago • 8
How to Synthesize Text Data without Model Collapse? Paper • 2412.14689 • Published Dec 19, 2024 • 51 • 4
On Domain-Specific Post-Training for Multimodal Large Language Models Paper • 2411.19930 • Published Nov 29, 2024 • 27
On Domain-Specific Post-Training for Multimodal Large Language Models Paper • 2411.19930 • Published Nov 29, 2024 • 27
On Domain-Specific Post-Training for Multimodal Large Language Models Paper • 2411.19930 • Published Nov 29, 2024 • 27 • 3
instruction-pretrain/general-instruction-augmented-corpora Preview • Updated 11 days ago • 8.49k • 16
Adapting Large Language Models via Reading Comprehension Paper • 2309.09530 • Published Sep 18, 2023 • 77 • 3
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper • 2406.14491 • Published Jun 20, 2024 • 90 • 25
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper • 2406.14491 • Published Jun 20, 2024 • 90
instruction-pretrain/ft-instruction-synthesizer-collection Viewer • Updated 11 days ago • 249k • 571 • 62