Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 11 items • Updated 27 days ago • 387
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 14 items • Updated 28 days ago • 76
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Sep 12 • 76
OLMoE Collection Artifacts for open mixture-of-experts language models. • 13 items • Updated 28 days ago • 24
🦅 🐍 FalconMamba 7B Collection This collection features the FalconMamba 7B base model, the instruction-tuned version, their 4-bit and GGUF variants, and the demo. • 15 items • Updated 13 days ago • 26
LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 264 items • Updated 6 days ago • 406
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 22 days ago • 156
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Sep 18 • 343
💥 Laser vs DoRA vs Daser vs LoRA Collection Comparison of different PEFT techniques of NeuralMonarch. • 4 items • Updated Mar 22 • 5
Kraken Demo Models Collection Find Demo Models of our Kraken Architecture here • 3 items • Updated about 6 hours ago • 1