Fixes exceeding maximum sequence length when using generate(). 759d148 gugarosa commited on Nov 20, 2023
Fixes any potential overflow when calculating attention weights. b5c5161 gugarosa commited on Nov 16, 2023