File size: 1,729 Bytes
d44ca41
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
---
language:
- mt
- en
---
# Machine Translation (MT) Models

This repository hosts two machine translation models that translate Maltese sentences to English.

## Models

### Model 1: 
This model is sourced from [Kurt Abela's MSc Thesis repository](https://github.com/kurtabela/MSc-Thesis). If you utilize this model, kindly acknowledge it by citing the following:

```
@mastersthesis{kurt-abela-thesis,
  title={Machine Translation System for Low Resource Languages},
  author={Kurt Abela},
  school={University of Malta},
  address={Malta},
  year={2023},
}
```

### Model 2:
The second model is the machine translation model used for the iWSLT 2023 shared task submission. For citation purposes, please use:

```
@inproceedings{williams2023dfki,
  title={UM-DFKI Maltese Speech Translation},
  author={Williams, Aiden and Abela, Kurt and Kumar, Rishu and B{\"a}r, Martin and Billinghurst, Hannah and Micallef, Kurt and Samin, Ahnaf Mozib and DeMarco, Andrea and van der Plas, Lonneke and Borg, Claudia},
  booktitle={Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023)},
  pages={433--441},
  year={2023}
}
```

## How to Use

All models are [Fairseq](https://github.com/facebookresearch/fairseq) models. Each directory contains the best checkpoint (in the ./checkpoints/mt_en directory). The dictionaries are kept in the ./data_out/mt_en directory. The sentencepiece model and vocabulary can be found in the ./models/mt_en directory.

To use the models, we guide users to the [Fairseq documentation](https://fairseq.readthedocs.io/en/latest/getting_started.html).

## Contact

For any inquiries or issues regarding the models, feel free to [contact us](mailto:[email protected]).