Commit History

Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model (#19)
cc9521a
verified

itlevy tomer-nv commited on

DeciLMForCausalLM(DeciLMPreTrainedModel, GenerationMixin) for v4.50 (#16)
3209eec
verified

itlevy commited on

add batch_size attribute to VariableCache (#15)
e5d0706
verified

itlevy commited on

nvidia-open-model-license (#14)
1a151b4
verified

itlevy commited on

v4.46 support (#7)
e9d0db3
verified

itlevy commited on

v4.45 support (#6)
d311379
verified

itlevy commited on

transformers>=4.44.2, backward compat
b5dfaf4
verified

itlevy commited on