bug with using Neobert for Sequence Classification
#9 opened 4 days ago
by
Romzzeess
make xformers an optional dependency
1
#6 opened 16 days ago
by
NyxKrage

Do you really use flash attention?
#5 opened 18 days ago
by
GinnM
Error with long text
#4 opened 21 days ago
by
hoan