itlevy's picture tomer-nv's picture
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model (#19)
cc9521a verified