Revert to fine-tuned CL model
Browse files- README.md +11 -33
- model.safetensors +1 -1
- training_args.bin +1 -1
README.md
CHANGED
@@ -16,10 +16,10 @@ should probably proofread and complete it, then remove this comment. -->
|
|
16 |
|
17 |
# helsinki-biomedical-finetuned
|
18 |
|
19 |
-
This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-es](https://huggingface.co/Helsinki-NLP/opus-mt-en-es) on
|
20 |
It achieves the following results on the evaluation set:
|
21 |
-
- Loss: 0.
|
22 |
-
- Bleu:
|
23 |
|
24 |
## Model description
|
25 |
|
@@ -38,46 +38,24 @@ More information needed
|
|
38 |
### Training hyperparameters
|
39 |
|
40 |
The following hyperparameters were used during training:
|
41 |
-
- learning_rate:
|
42 |
- train_batch_size: 8
|
43 |
- eval_batch_size: 16
|
44 |
- seed: 42
|
45 |
- gradient_accumulation_steps: 4
|
46 |
- total_train_batch_size: 32
|
47 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
48 |
-
- lr_scheduler_type:
|
49 |
-
- num_epochs:
|
50 |
- mixed_precision_training: Native AMP
|
51 |
|
52 |
### Training results
|
53 |
|
54 |
-
| Training Loss | Epoch
|
55 |
-
|
56 |
-
|
|
57 |
-
|
|
58 |
-
| 0.
|
59 |
-
| 0.0206 | 4.0 | 13175 | 0.0226 | 54.4549 |
|
60 |
-
| 0.0192 | 4.9998 | 16468 | 0.0224 | 54.5222 |
|
61 |
-
| 0.0176 | 5.9998 | 19762 | 0.0222 | 54.6624 |
|
62 |
-
| 0.0167 | 6.9999 | 23056 | 0.0221 | 55.0200 |
|
63 |
-
| 0.0154 | 8.0 | 26350 | 0.0223 | 53.3307 |
|
64 |
-
| 0.0147 | 8.9998 | 29643 | 0.0223 | 55.2185 |
|
65 |
-
| 0.0138 | 9.9998 | 32937 | 0.0224 | 54.9215 |
|
66 |
-
| 0.0133 | 10.9999 | 36231 | 0.0225 | 55.3672 |
|
67 |
-
| 0.0122 | 12.0 | 39525 | 0.0229 | 55.2831 |
|
68 |
-
| 0.0115 | 12.9998 | 42818 | 0.0231 | 55.2310 |
|
69 |
-
| 0.0108 | 13.9998 | 46112 | 0.0233 | 55.3215 |
|
70 |
-
| 0.0103 | 14.9999 | 49406 | 0.0234 | 55.3170 |
|
71 |
-
| 0.0096 | 16.0 | 52700 | 0.0237 | 55.3158 |
|
72 |
-
| 0.0089 | 16.9998 | 55993 | 0.0242 | 55.0178 |
|
73 |
-
| 0.0084 | 17.9998 | 59287 | 0.0243 | 55.1974 |
|
74 |
-
| 0.0072 | 18.9999 | 62581 | 0.0244 | 55.6011 |
|
75 |
-
| 0.007 | 20.0 | 65875 | 0.0245 | 55.5510 |
|
76 |
-
| 0.0069 | 20.9998 | 69168 | 0.0246 | 55.6178 |
|
77 |
-
| 0.0068 | 21.9998 | 72462 | 0.0246 | 55.7191 |
|
78 |
-
| 0.0068 | 22.9999 | 75756 | 0.0247 | 55.6917 |
|
79 |
-
| 0.0066 | 24.0 | 79050 | 0.0247 | 55.6962 |
|
80 |
-
| 0.0067 | 24.9943 | 82325 | 0.0247 | 55.6929 |
|
81 |
|
82 |
|
83 |
### Framework versions
|
|
|
16 |
|
17 |
# helsinki-biomedical-finetuned
|
18 |
|
19 |
+
This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-es](https://huggingface.co/Helsinki-NLP/opus-mt-en-es) on an unknown dataset.
|
20 |
It achieves the following results on the evaluation set:
|
21 |
+
- Loss: 0.0875
|
22 |
+
- Bleu: 43.9070
|
23 |
|
24 |
## Model description
|
25 |
|
|
|
38 |
### Training hyperparameters
|
39 |
|
40 |
The following hyperparameters were used during training:
|
41 |
+
- learning_rate: 8e-07
|
42 |
- train_batch_size: 8
|
43 |
- eval_batch_size: 16
|
44 |
- seed: 42
|
45 |
- gradient_accumulation_steps: 4
|
46 |
- total_train_batch_size: 32
|
47 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
48 |
+
- lr_scheduler_type: cosine
|
49 |
+
- num_epochs: 3
|
50 |
- mixed_precision_training: Native AMP
|
51 |
|
52 |
### Training results
|
53 |
|
54 |
+
| Training Loss | Epoch | Step | Validation Loss | Bleu |
|
55 |
+
|:-------------:|:------:|:----:|:---------------:|:-------:|
|
56 |
+
| No log | 0.9987 | 187 | 0.0890 | 43.7030 |
|
57 |
+
| No log | 1.9973 | 374 | 0.0880 | 43.7818 |
|
58 |
+
| 0.0947 | 2.9960 | 561 | 0.0875 | 43.9070 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
59 |
|
60 |
|
61 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 309965092
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8270085986f0253d764e56a4ca7d1dc839d0c9345d7e50b6957bfd72cbce7388
|
3 |
size 309965092
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5176
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1e747c183dae3cb169560061858f8d235fcb25e6555f2915d4b7763b65e64965
|
3 |
size 5176
|