annieske commited on
Commit
89e6ba4
·
1 Parent(s): 9501f45

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -0
README.md CHANGED
@@ -1,3 +1,26 @@
1
  ---
2
  license: cc-by-nc-sa-4.0
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: cc-by-nc-sa-4.0
3
+ library_name: transformers
4
+ pipeline_tag: token-classification
5
  ---
6
+ ### xlm-roberta-base for token classification, specifically fine-tuned for question-answer extraction for Finnish
7
+
8
+ This is the `xlm-roberta-base`, fine-tuned on manually annotated Finnish data, ChatGPT-annotated data and a semi-synthetic dataset based on the LFQA dataset.
9
+ ### Hyperparameters
10
+ ```
11
+ batch_size = 8
12
+ epochs = 10 (trained for less)
13
+ base_LM_model = "xlm-roberta-base"
14
+ max_seq_len = 512
15
+ learning_rate = 1e-5
16
+ ```
17
+ ### Performance
18
+ ```
19
+ Accuracy = 0.85
20
+ Question F1 = 0.82
21
+ Answer F1 = 0.75
22
+ ```
23
+
24
+ ### Usage
25
+
26
+ Instructions on how to use the results will be added later.