End of training
Browse files- README.md +5 -5
- generation_config.json +4 -1
README.md
CHANGED
@@ -67,11 +67,11 @@ The following hyperparameters were used during training:
|
|
67 |
|
68 |
| Training Loss | Epoch | Step | Validation Loss | Sacrebleu | Gen Len |
|
69 |
|:-------------:|:-----:|:----:|:---------------:|:---------:|:-------:|
|
70 |
-
|
|
71 |
-
| 1.
|
72 |
-
| 1.
|
73 |
-
| 0.
|
74 |
-
| 0.
|
75 |
|
76 |
|
77 |
### Framework versions
|
|
|
67 |
|
68 |
| Training Loss | Epoch | Step | Validation Loss | Sacrebleu | Gen Len |
|
69 |
|:-------------:|:-----:|:----:|:---------------:|:---------:|:-------:|
|
70 |
+
| 3.6041 | 1.0 | 825 | 1.4552 | 28.698 | 45.0965 |
|
71 |
+
| 1.3318 | 2.0 | 1650 | 1.2516 | 30.4316 | 45.683 |
|
72 |
+
| 1.103 | 3.0 | 2475 | 1.1934 | 30.9199 | 45.474 |
|
73 |
+
| 0.9875 | 4.0 | 3300 | 1.1762 | 31.635 | 45.254 |
|
74 |
+
| 0.9204 | 5.0 | 4125 | 1.1713 | 31.5851 | 45.311 |
|
75 |
|
76 |
|
77 |
### Framework versions
|
generation_config.json
CHANGED
@@ -1,5 +1,8 @@
|
|
1 |
{
|
|
|
|
|
|
|
2 |
"max_length": 200,
|
3 |
-
"
|
4 |
"transformers_version": "4.41.2"
|
5 |
}
|
|
|
1 |
{
|
2 |
+
"bos_token_id": 0,
|
3 |
+
"decoder_start_token_id": 2,
|
4 |
+
"eos_token_id": 2,
|
5 |
"max_length": 200,
|
6 |
+
"pad_token_id": 1,
|
7 |
"transformers_version": "4.41.2"
|
8 |
}
|