Update README.md
Browse files
README.md
CHANGED
@@ -5,6 +5,6 @@ tags: []
|
|
5 |
|
6 |
# Tokenizer
|
7 |
|
8 |
-
A tokenizer with a vocab size of
|
9 |
|
10 |
The tokenizer was trained on [LibriSpeech LM text](https://www.openslr.org/11/)
|
|
|
5 |
|
6 |
# Tokenizer
|
7 |
|
8 |
+
A tokenizer with a vocab size of 100k for [Intro to Deep Learning](https://deeplearning.cs.cmu.edu/F24/index.html) Homework 4 on Language Modelling and Automatic Speech Recognition.
|
9 |
|
10 |
The tokenizer was trained on [LibriSpeech LM text](https://www.openslr.org/11/)
|