sjrhuschlee's picture
Update README.md
4a0e5e6
|
raw
history blame
927 Bytes
metadata
language: en
datasets:
  - squad_v2
license: cc-by-4.0
tags:
  - deberta
  - deberta-v3

deberta-v3-base for QA

This is the deberta-v3-base model, fine-tuned using the SQuAD2.0 dataset. It's been trained on question-answer pairs, including unanswerable questions, for the task of Question Answering.

Overview

Language model: deberta-v3-base
Language: English
Downstream-task: Extractive QA
Training data: SQuAD 2.0
Eval data: SQuAD 2.0
Code: See an example QA pipeline on Haystack
Infrastructure:

Hyperparameters

batch_size = 12
n_epochs = 4
base_LM_model = "deberta-v3-base"
max_seq_len = 512
learning_rate = 2e-5
lr_schedule = LinearWarmup
warmup_proportion = 0.2
doc_stride=128
max_query_length=64