Nemotron models that have been converted and/or quantized to work well in vLLM
-
mgoin/Nemotron-4-340B-Instruct-hf-FP8
Text Generation β’ 341B β’ Updated β’ 8 β’ 3 -
mgoin/Nemotron-4-340B-Base-hf-FP8
Text Generation β’ 341B β’ Updated β’ 92 β’ 2 -
mgoin/Nemotron-4-340B-Instruct-hf
Text Generation β’ 341B β’ Updated β’ 13 β’ 4 -
mgoin/Nemotron-4-340B-Base-hf
Text Generation β’ 341B β’ Updated β’ 5 β’ 1