Update README.md
Browse files
README.md
CHANGED
@@ -42,9 +42,11 @@ The model is trained using multiple Danish datasets and a context length of 512
|
|
42 |
|
43 |
The model weights are initialized from the English [bert-base-uncased model](https://huggingface.co/bert-base-uncased) with new word token embeddings created for Danish using [WECHSEL](https://github.com/CPJKU/wechsel).
|
44 |
|
45 |
-
Initially, only the word token embeddings are trained using
|
46 |
|
47 |
|
48 |
# Evaluation
|
49 |
|
50 |
-
|
|
|
|
|
|
42 |
|
43 |
The model weights are initialized from the English [bert-base-uncased model](https://huggingface.co/bert-base-uncased) with new word token embeddings created for Danish using [WECHSEL](https://github.com/CPJKU/wechsel).
|
44 |
|
45 |
+
Initially, only the word token embeddings are trained using 1.000.000 samples. Finally, the whole model is trained for 8 epochs.
|
46 |
|
47 |
|
48 |
# Evaluation
|
49 |
|
50 |
+
Performance of pretrained model was evaluated using [ScandEval](https://github.com/ScandEval/ScandEval).
|
51 |
+
|
52 |
+
RESULTS
|