jmvcoelho commited on
Commit
de79d5e
·
1 Parent(s): 026bf3b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +55 -0
README.md CHANGED
@@ -1,3 +1,58 @@
1
  ---
2
  license: wtfpl
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: wtfpl
3
+ datasets:
4
+ - ms_marco
5
+ - squad
6
+ language:
7
+ - en
8
  ---
9
+ # Model
10
+
11
+ t5-base-msmarco-squad-query-generation-firstp-v2
12
+
13
+ Task: query generation
14
+ Architecture: T5
15
+
16
+ Base model: t5-base
17
+
18
+ Note: This is supposed to be a baseline model.
19
+
20
+
21
+ ## Prompt:
22
+
23
+ "Generate Query: {document}. Query:"
24
+
25
+ ## Sequence length:
26
+
27
+ 512 tokens
28
+
29
+ ## Training details
30
+
31
+ ### Hyperparameters
32
+
33
+ Batch size: 8;
34
+ Gradient acc: 8;
35
+ LR: 3e-4, linear scheduler, 400 warmup steps.
36
+
37
+
38
+ ### Data
39
+
40
+ Total: 252059 pairs (document, query)
41
+
42
+ From MARCO-V2: 165238
43
+ From SQuAD: 86821
44
+
45
+ The remaining queries from MARCO-V2 train split were not used.
46
+
47
+ ## Evaluation
48
+
49
+ This model is supposed to be used for data augmentation.
50
+ Hence, meaningful evaluation will come from downstream tasks.
51
+
52
+ MARCO-V2 Dev1:
53
+ BLEU: 0.105
54
+ ROUGE: 0.449
55
+
56
+ MARCO-V2 Dev2:
57
+ BLEU: 0.171
58
+ ROUGE: 0.503