TurkuNLP
/

xlmr-qa-extraction-fi

Token Classification

Model card Files Files and versions Community

annieske commited on Nov 2, 2023

Commit

89e6ba4

·

1 Parent(s): 9501f45

Update README.md

Files changed (1) hide show

README.md +23 -0

README.md CHANGED Viewed

@@ -1,3 +1,26 @@
 ---
 license: cc-by-nc-sa-4.0
 ---

 ---
 license: cc-by-nc-sa-4.0
+library_name: transformers
+pipeline_tag: token-classification
 ---
+### xlm-roberta-base for token classification, specifically fine-tuned for question-answer extraction for Finnish
+This is the `xlm-roberta-base`, fine-tuned on manually annotated Finnish data, ChatGPT-annotated data and a semi-synthetic dataset based on the LFQA dataset.
+### Hyperparameters
+```
+batch_size = 8
+epochs = 10 (trained for less)
+base_LM_model = "xlm-roberta-base"
+max_seq_len = 512
+learning_rate = 1e-5
+```
+### Performance
+```
+Accuracy = 0.85
+Question F1 = 0.82
+Answer F1 = 0.75
+```
+### Usage
+Instructions on how to use the results will be added later.