yo-extend-tokenizer / README.md
nvassilyev's picture
Create README.md
dd98d75

Llama tokenizer with extended vocab:

  • 32000 Llama tokens
  • ~15000 Yoruba tokens