Add base model
Browse files
README.md
CHANGED
@@ -1,22 +1,24 @@
|
|
1 |
-
---
|
2 |
-
license: cc-by-nc-4.0
|
3 |
-
datasets:
|
4 |
-
- projecte-aina/ES-AST_Parallel_Corpus
|
5 |
-
language:
|
6 |
-
- es
|
7 |
-
- ast
|
8 |
-
metrics:
|
9 |
-
- bleu
|
10 |
-
- chrf
|
11 |
-
library_name: transformers
|
12 |
-
|
|
|
|
|
13 |
## Projecte Aina’s Spanish-Asturian machine translation model
|
14 |
|
15 |
## Model description
|
16 |
|
17 |
This model was created as part of the participation of Language Technologies Unit at BSC in the WMT24 Shared Task:
|
18 |
[Translation into Low-Resource Languages of Spain](https://www2.statmt.org/wmt24/romance-task.html).
|
19 |
-
It results from a full fine-tuning of the NLLB-200-600M model with a Spanish-Asturian corpus.
|
20 |
Specifically, we used the [transformers library](https://huggingface.co/docs/transformers/) from Hugging Face and a filtered version
|
21 |
of the [Spanish-Asturian dataset](https://huggingface.co/datasets/projecte-aina/ES-AST_Parallel_Corpus) to fine-tune the model.
|
22 |
The model was evaluated using the Flores evaluation datasets.
|
|
|
1 |
+
---
|
2 |
+
license: cc-by-nc-4.0
|
3 |
+
datasets:
|
4 |
+
- projecte-aina/ES-AST_Parallel_Corpus
|
5 |
+
language:
|
6 |
+
- es
|
7 |
+
- ast
|
8 |
+
metrics:
|
9 |
+
- bleu
|
10 |
+
- chrf
|
11 |
+
library_name: transformers
|
12 |
+
base_model:
|
13 |
+
- facebook/nllb-200-distilled-600M
|
14 |
+
---
|
15 |
## Projecte Aina’s Spanish-Asturian machine translation model
|
16 |
|
17 |
## Model description
|
18 |
|
19 |
This model was created as part of the participation of Language Technologies Unit at BSC in the WMT24 Shared Task:
|
20 |
[Translation into Low-Resource Languages of Spain](https://www2.statmt.org/wmt24/romance-task.html).
|
21 |
+
It results from a full fine-tuning of the [NLLB-200-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) model with a Spanish-Asturian corpus.
|
22 |
Specifically, we used the [transformers library](https://huggingface.co/docs/transformers/) from Hugging Face and a filtered version
|
23 |
of the [Spanish-Asturian dataset](https://huggingface.co/datasets/projecte-aina/ES-AST_Parallel_Corpus) to fine-tune the model.
|
24 |
The model was evaluated using the Flores evaluation datasets.
|