ahmed792002 commited on
Commit
8cb98cb
·
verified ·
1 Parent(s): 52c587d

ahmed792002/Finetuning_MBart_English_Arabic_Translation

Browse files
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [google-t5/t5-base](https://huggingface.co/google-t5/t5-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.0334
20
 
21
  ## Model description
22
 
@@ -47,9 +47,9 @@ The following hyperparameters were used during training:
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | No log | 1.0 | 250 | 0.0409 |
51
- | 0.2199 | 2.0 | 500 | 0.0348 |
52
- | 0.2199 | 3.0 | 750 | 0.0334 |
53
 
54
 
55
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [google-t5/t5-base](https://huggingface.co/google-t5/t5-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.0368
20
 
21
  ## Model description
22
 
 
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | No log | 1.0 | 250 | 0.0448 |
51
+ | 0.6626 | 2.0 | 500 | 0.0381 |
52
+ | 0.6626 | 3.0 | 750 | 0.0368 |
53
 
54
 
55
  ### Framework versions
config.json CHANGED
@@ -7,7 +7,7 @@
7
  "d_ff": 3072,
8
  "d_kv": 64,
9
  "d_model": 768,
10
- "decoder_start_token_id": 0,
11
  "dense_act_fn": "relu",
12
  "dropout_rate": 0.1,
13
  "eos_token_id": 1,
 
7
  "d_ff": 3072,
8
  "d_kv": 64,
9
  "d_model": 768,
10
+ "decoder_start_token_id": 3,
11
  "dense_act_fn": "relu",
12
  "dropout_rate": 0.1,
13
  "eos_token_id": 1,
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ff124e3c31ccd87cb470e09a49de1e7e823a9cc5592bd77b9b38da1c02acb406
3
  size 891644712
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c64e53ebf7c02d9ad76ac913a68438646fc31f8ff736b67dcdcf06a8345ca413
3
  size 891644712
runs/Dec07_20-07-52_715ba240964e/events.out.tfevents.1733603509.715ba240964e.447.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3c617d5fba6b6862eff791f871f9a163186b2cd2206be0a11b284fe6336e776b
3
+ size 359
runs/Dec07_20-47-38_715ba240964e/events.out.tfevents.1733604464.715ba240964e.9203.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a23e4a61dfbbd21cdb3213c11e74151c6e7e395106f45f8a61820e7fc9e7c8be
3
+ size 7520
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:da65253b4c8c35572efc2545d46f0e720dd6af731e62847046a5a67ffb97cd7e
3
  size 5496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:898d6af1a588df69f3b67a9a5d13a295658ae42bada99e95d7dd9a82179f1b29
3
  size 5496