Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper • 2411.07133 • Published Nov 11, 2024 • 35
hugging-quants/Meta-Llama-3.1-405B-Instruct-AWQ-INT4 Text Generation • Updated Sep 13, 2024 • 335 • 37
Transformer Explainer: Interactive Learning of Text-Generative Models Paper • 2408.04619 • Published Aug 8, 2024 • 156