neuralmagic/Meta-Llama-3.1-405B-Instruct-quantized.w4a16 Text Generation • Updated Oct 10 • 2.79k • 12
neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic Text Generation • Updated Oct 17 • 11.9k • 14