|
--- |
|
pipeline_tag: translation |
|
language: multilingual |
|
library_name: transformers |
|
license: cc-by-nc-sa-4.0 |
|
--- |
|
|
|
<div align="center"> |
|
|
|
<h1 style="font-family: 'Arial', sans-serif; font-size: 28px; font-weight: bold; color: black;"> |
|
🛡️ Guardians of the Machine Translation Meta-Evaluation:<br> |
|
Sentinel Metrics Fall In! |
|
</h1> |
|
|
|
</div> |
|
|
|
<div style="display:flex; justify-content: center; align-items: center; flex-direction: row;"> |
|
<a href="https://2024.aclweb.org/"><img src="http://img.shields.io/badge/ACL-2024-4b44ce.svg"></a> |
|
<a href="https://aclanthology.org/2024.acl-long.856/"><img src="http://img.shields.io/badge/paper-ACL--anthology-B31B1B.svg"></a> |
|
<a href="https://arxiv.org/abs/2408.13831"><img src="https://img.shields.io/badge/arXiv-2408.13831-b31b1b.svg"></a> |
|
<a href="https://creativecommons.org/licenses/by-nc-sa/4.0/"><img src="https://img.shields.io/badge/License-CC%20BY--NC--SA%204.0-lightgrey.svg"></a> |
|
</div> |
|
<div style="display:flex; justify-content: center; align-items: center; flex-direction: row;"> |
|
<a href="https://huggingface.co/collections/sapienzanlp/mt-sentinel-metrics-66ab643b32aab06f3157e5c1"><img src="https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Collection-FCD21D"></a> |
|
<a href="https://github.com/SapienzaNLP/guardians-mt-eval"><img src="https://img.shields.io/badge/GitHub-Repo-121013?logo=github&logoColor=white"></a> |
|
</div> |
|
|
|
This repository contains the **SENTINEL<sub>SRC</sub>** metric model pre-trained on Direct Assessments (DA) annotations and further fine-tuned on Multidimensional Quality Metrics (MQM) data. For details on how to use our sentinel metric models, check our [GitHub repository](https://github.com/SapienzaNLP/guardians-mt-eval). |
|
|
|
## Usage |
|
|
|
After having installed our repository package, you can use this model within Python in the following way: |
|
|
|
```python |
|
from sentinel_metric import download_model, load_from_checkpoint |
|
|
|
model_path = download_model("sapienzanlp/sentinel-src-mqm") |
|
model = load_from_checkpoint(model_path) |
|
|
|
data = [ |
|
{"src": "E.T.打电话回家"}, |
|
{"src": "托托,我感觉我们已经不在堪萨斯了"} |
|
] |
|
|
|
output = model.predict(data, batch_size=8, gpus=1) |
|
``` |
|
|
|
Output: |
|
```python |
|
# Segment scores |
|
>>> output.scores |
|
[0.6295096278190613, 0.46640336513519287] |
|
|
|
# System score |
|
>>> output.system_score |
|
0.5479564964771271 |
|
``` |
|
|
|
## Cite this work |
|
This work has been published at [ACL 2024 (Main Conference)](https://aclanthology.org/2024.acl-long.856/). If you use any part, please consider citing our paper as follows: |
|
|
|
```bibtex |
|
@inproceedings{perrella-etal-2024-guardians, |
|
title = "Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!", |
|
author = "Perrella, Stefano and Proietti, Lorenzo and Scir{\`e}, Alessandro and Barba, Edoardo and Navigli, Roberto", |
|
editor = "Ku, Lun-Wei and Martins, Andre and Srikumar, Vivek", |
|
booktitle = "Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)", |
|
month = aug, |
|
year = "2024", |
|
address = "Bangkok, Thailand", publisher = "Association for Computational Linguistics", |
|
url = "https://aclanthology.org/2024.acl-long.856", |
|
pages = "16216--16244", |
|
} |
|
``` |
|
|
|
## License |
|
This work is licensed under [Creative Commons Attribution-ShareAlike-NonCommercial 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/). |