Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper β’ 2505.03335 β’ Published May 6 β’ 182
view article Article Open-source DeepResearch β Freeing our search agents By m-ric and 4 others β’ Feb 4 β’ 1.28k
view article Article Mini-R1: Reproduce Deepseek R1 βaha momentβ a RL tutorial By open-r1 β’ Jan 31 β’ 50
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others β’ Jan 28 β’ 877
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation β’ 71B β’ Updated Apr 13 β’ 101k β’ β’ 2.05k
Improving Long-Text Alignment for Text-to-Image Diffusion Models Paper β’ 2410.11817 β’ Published Oct 15, 2024 β’ 15
VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI Paper β’ 2410.11623 β’ Published Oct 15, 2024 β’ 50
bartowski/Vikhr-Nemo-12B-Instruct-R-21-09-24-GGUF Text Generation β’ 12B β’ Updated Sep 23, 2024 β’ 412 β’ 14
Vikhrmodels/Vikhr-Nemo-12B-Instruct-R-21-09-24 Text Generation β’ 12B β’ Updated Oct 25, 2024 β’ 726k β’ 126