Stop Token Shenanigans

#1
by Anduin1357 - opened

Somehow, token 128008 <|eom_id|> works way better as the EOT token than token 128009 <|eot_id|> and I have no idea why.

Depending on the model's config / tokenizer it can have multiple stop tokens.
I don't full understand it , but it is often a source of model issues.

Sign up or log in to comment