MUNasir commited on
Commit
9ee1fcb
1 Parent(s): 083adee

Added citation

Browse files
Files changed (1) hide show
  1. README.md +44 -33
README.md CHANGED
@@ -1,33 +1,44 @@
1
- #### Languages:
2
-
3
- - Source language: English
4
-
5
- - Source language: isiZulu
6
-
7
- #### Model Details:
8
-
9
- - model: transformer
10
-
11
- - Architecture: MarianMT
12
-
13
- - pre-processing: normalization + SentencePiece
14
-
15
- #### Pre-trained Model:
16
-
17
- - https://huggingface.co/Helsinki-NLP/opus-mt-en-xh
18
-
19
- #### Corpus:
20
-
21
- - Umsuka English-isiZulu Parallel Corpus (https://zenodo.org/record/5035171#.Yh5NIOhBy3A)
22
-
23
- #### Benchmark:
24
-
25
- | Benchmark | Train | Test |
26
- |-----------|-------|-------|
27
- | Umsuka | 17.61 | 13.73 |
28
-
29
- #### GitHub:
30
-
31
- - https://github.com/umair-nasir14/Geographical-Distance-Is-The-New-Hyperparameter
32
-
33
-
 
 
 
 
 
 
 
 
 
 
 
 
1
+ #### Languages:
2
+
3
+ - Source language: English
4
+
5
+ - Source language: isiZulu
6
+
7
+ #### Model Details:
8
+
9
+ - model: transformer
10
+
11
+ - Architecture: MarianMT
12
+
13
+ - pre-processing: normalization + SentencePiece
14
+
15
+ #### Pre-trained Model:
16
+
17
+ - https://huggingface.co/Helsinki-NLP/opus-mt-en-xh
18
+
19
+ #### Corpus:
20
+
21
+ - Umsuka English-isiZulu Parallel Corpus (https://zenodo.org/record/5035171#.Yh5NIOhBy3A)
22
+
23
+ #### Benchmark:
24
+
25
+ | Benchmark | Train | Test |
26
+ |-----------|-------|-------|
27
+ | Umsuka | 17.61 | 13.73 |
28
+
29
+ #### GitHub:
30
+
31
+ - https://github.com/umair-nasir14/Geographical-Distance-Is-The-New-Hyperparameter
32
+
33
+ #### Citation:
34
+
35
+ ```
36
+ @article{umair2022geographical,
37
+ title={Geographical Distance Is The New Hyperparameter: A Case Study Of Finding The Optimal Pre-trained Language For English-isiZulu Machine Translation},
38
+ author={Umair Nasir, Muhammad and Amos Mchechesi, Innocent},
39
+ journal={arXiv e-prints},
40
+ pages={arXiv--2205},
41
+ year={2022}
42
+ }
43
+ ```
44
+