view article Article Train 400x faster Static Embedding Models with Sentence Transformers 11 days ago β’ 121
Deepthink and Reasoning Collection Best for Deepthink and Reasoning β’ 14 items β’ Updated 1 day ago β’ 15
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated Dec 19, 2024 β’ 127
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data β’ 8 items β’ Updated Dec 18, 2024 β’ 18
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. β’ 40 items β’ Updated 18 days ago β’ 81
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. β’ 23 items β’ Updated Dec 13, 2024 β’ 132
π± Sailor2 Language Models Collection Sailing in South-East Asia with Inclusive Multilingual LLMs β’ 9 items β’ Updated Dec 3, 2024 β’ 22
Quantization Spaces on the Hub β‘ Collection A collection of spaces that allow you to quantize on the Hub β’ 4 items β’ Updated Nov 4, 2024 β’ 5
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 7 items β’ Updated 20 days ago β’ 33
Thinking LLMs: General Instruction Following with Thought Generation Paper β’ 2410.10630 β’ Published Oct 14, 2024 β’ 18