rasyosef commited on
Commit
b1a3492
·
verified ·
1 Parent(s): e8158a7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ widget:
15
 
16
  # bert-mini-amharic
17
 
18
- This model has the same architecture as [bert-mini](https://huggingface.co/prajjwal1/bert-mini) and was pretrained from scratch using the Amharic subsets of the [oscar](https://huggingface.co/datasets/oscar) and [mc4](https://huggingface.co/datasets/mc4) datasets, on a total of `137 Million` tokens.
19
  It achieves the following results on the evaluation set:
20
  - `Loss: 3.57`
21
  - `Perplexity: 35.52`
 
15
 
16
  # bert-mini-amharic
17
 
18
+ This model has the same architecture as [bert-mini](https://huggingface.co/prajjwal1/bert-mini) and was pretrained from scratch using the Amharic subsets of the [oscar](https://huggingface.co/datasets/oscar) and [mc4](https://huggingface.co/datasets/mc4) datasets, on a total of `137 Million` tokens. The tokenizer was trained from scratch on the same text corpus, and had a vocabulary size of 24k.
19
  It achieves the following results on the evaluation set:
20
  - `Loss: 3.57`
21
  - `Perplexity: 35.52`