Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ widget:
|
|
15 |
|
16 |
# bert-mini-amharic
|
17 |
|
18 |
-
This model has the same architecture as [bert-mini](https://huggingface.co/prajjwal1/bert-mini) and was pretrained from scratch using the Amharic subsets of the [oscar](https://huggingface.co/datasets/oscar) and [mc4](https://huggingface.co/datasets/mc4) datasets, on a total of `137 Million` tokens.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
- `Loss: 3.57`
|
21 |
- `Perplexity: 35.52`
|
|
|
15 |
|
16 |
# bert-mini-amharic
|
17 |
|
18 |
+
This model has the same architecture as [bert-mini](https://huggingface.co/prajjwal1/bert-mini) and was pretrained from scratch using the Amharic subsets of the [oscar](https://huggingface.co/datasets/oscar) and [mc4](https://huggingface.co/datasets/mc4) datasets, on a total of `137 Million` tokens. The tokenizer was trained from scratch on the same text corpus, and had a vocabulary size of 24k.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
- `Loss: 3.57`
|
21 |
- `Perplexity: 35.52`
|