An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging Paper • 2502.09056 • Published Feb 13 • 30
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding Paper • 2412.10302 • Published Dec 13, 2024 • 17
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 110
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 10 items • Updated 2 days ago • 416
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published Jan 8 • 94
Typhoon 2 Multimodal Collection Latest Official Multimodal ThaiLLM release by SCB 10X. • 3 items • Updated about 13 hours ago • 3
Typhoon 2 Text Collection Latest Official Text ThaiLLM release by SCB 10X. • 12 items • Updated about 13 hours ago • 4
Cephalo: Multi-Modal Vision-Language Models for Bio-Inspired Materials Analysis and Design Paper • 2405.19076 • Published May 29, 2024 • 2
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models Paper • 2303.08896 • Published Mar 15, 2023 • 4
MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization Paper • 2301.12307 • Published Jan 28, 2023 • 3