Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,26 @@
|
|
1 |
---
|
2 |
license: cc-by-nc-sa-4.0
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: cc-by-nc-sa-4.0
|
3 |
+
library_name: transformers
|
4 |
+
pipeline_tag: token-classification
|
5 |
---
|
6 |
+
### xlm-roberta-base for token classification, specifically fine-tuned for question-answer extraction for Finnish
|
7 |
+
|
8 |
+
This is the `xlm-roberta-base`, fine-tuned on manually annotated Finnish data, ChatGPT-annotated data and a semi-synthetic dataset based on the LFQA dataset.
|
9 |
+
### Hyperparameters
|
10 |
+
```
|
11 |
+
batch_size = 8
|
12 |
+
epochs = 10 (trained for less)
|
13 |
+
base_LM_model = "xlm-roberta-base"
|
14 |
+
max_seq_len = 512
|
15 |
+
learning_rate = 1e-5
|
16 |
+
```
|
17 |
+
### Performance
|
18 |
+
```
|
19 |
+
Accuracy = 0.85
|
20 |
+
Question F1 = 0.82
|
21 |
+
Answer F1 = 0.75
|
22 |
+
```
|
23 |
+
|
24 |
+
### Usage
|
25 |
+
|
26 |
+
Instructions on how to use the results will be added later.
|