view article Article Dynamic Intuition-Based Reasoning: A Novel Approach Toward Artificial General Intelligence By Veyllo • about 15 hours ago • 1
view article Article Benchmarking Assisted Generation with Gemma 3 and Qwen 2.5: A Code-First Guide By ariG23498 • 1 day ago • 1
Open LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated about 1 hour ago • 555
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 14 items • Updated 2 days ago • 100
Long Context - 16k,32k,64k,128k,200k,256k,512k,1000k Collection Q6/Q8 models here. Mixtrals/Mistral (and merges) generally have 32k context (not listed here) . Please see org model card for usage / templates. • 71 items • Updated 5 days ago • 12
Open-source speech datasets annotated using Data-Speech Collection Open-source annotated speech datasets ranging from 1,000 hours to 45,000 hours. • 11 items • Updated Aug 8, 2024 • 5
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated Feb 10 • 88
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated about 1 month ago • 74
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 359
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated 1 day ago • 330