yo-extend-tokenizer / README.md
nvassilyev's picture
Create README.md
dd98d75
Llama tokenizer with extended vocab:
- 32000 Llama tokens
- ~15000 Yoruba tokens