neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic Text Generation • Updated 23 days ago • 52 • 1
neuralmagic/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic Text Generation • Updated 23 days ago • 4
neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic Text Generation • Updated 23 days ago • 127 • 1
neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic Text Generation • Updated 23 days ago • 127 • 1
neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-quantized.w4a16 Text Generation • Updated 23 days ago • 53
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16 Text Generation • Updated 23 days ago • 209 • 3
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic Text Generation • Updated 23 days ago • 52 • 1
neuralmagic/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic Text Generation • Updated 23 days ago • 4
neuralmagic/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-quantized.w4a16 Text Generation • Updated 23 days ago • 18
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w4a16 Text Generation • Updated 25 days ago • 18.8k • 23
Sparse-Llama-3.1-2of4 Collection 2:4 sparse versions of Llama-3.1, including transfer learning • 10 items • Updated 24 days ago • 4