nvidia
/

Llama-3_1-Nemotron-51B-Instruct

Text Generation

Model card Files Files and versions Community

Llama-3_1-Nemotron-51B-Instruct

Commit History

Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model (#19)

cc9521a
verified

tomer-nv commited on 7 days ago

fixed cache over-alloc bug (#17)

20cc7f1
verified

tomer-nv commited on 12 days ago

DeciLMForCausalLM(DeciLMPreTrainedModel, GenerationMixin) for v4.50 (#16)

3209eec
verified

itlevy commited on 20 days ago

add batch_size attribute to VariableCache (#15)

e5d0706
verified

itlevy commited on 20 days ago

nvidia-open-model-license (#14)

1a151b4
verified

itlevy commited on 21 days ago

v4.46 support (#7)

e9d0db3
verified

itlevy commited on 24 days ago

v4.45 support (#6)

d311379
verified

itlevy commited on 24 days ago

transformers>=4.44.2, backward compat

b5dfaf4
verified

itlevy commited on 26 days ago