jmvcoelho commited on
Commit
bee38d1
1 Parent(s): bde3349

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +56 -0
README.md CHANGED
@@ -1,3 +1,59 @@
1
  ---
2
  license: wtfpl
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: wtfpl
3
+ datasets:
4
+ - ms_marco
5
+ - squad
6
+ language:
7
+ - en
8
  ---
9
+
10
+ # Model
11
+
12
+ t5-base-msmarco-squad-query-generation-longp-v2
13
+
14
+ Task: query generation
15
+ Architecture: LongT5
16
+
17
+ Base model: google/long-t5-tglobal-base
18
+
19
+ Note: This is supposed to be a baseline model.
20
+
21
+
22
+ ## Prompt:
23
+
24
+ "Generate Query: {document}. Query:"
25
+
26
+ ## Sequence length:
27
+
28
+ 1536 tokens
29
+
30
+ ## Training details
31
+
32
+ ### Hyperparameters
33
+
34
+ Batch size: 8;
35
+ Gradient acc: 8;
36
+ LR: 3e-4, linear scheduler, 400 warmup steps.
37
+
38
+
39
+ ### Data
40
+
41
+ Total: 252059 pairs (document, query)
42
+
43
+ From MARCO-V2: 165238
44
+ From SQuAD: 86821
45
+
46
+ The remaining queries from MARCO-V2 train split were not used.
47
+
48
+ ## Evaluation
49
+
50
+ This model is supposed to be used for data augmentation.
51
+ Hence, meaningful evaluation will come from downstream tasks.
52
+
53
+ MARCO-V2 Dev1:
54
+ BLEU: 0.102
55
+ ROUGE: 0.447
56
+
57
+ MARCO-V2 Dev2:
58
+ BLEU: 0.1691
59
+ ROUGE: 0.5013