srajwal1 commited on
Commit
d24340f
·
1 Parent(s): b9da33b

End of training

Browse files
README.md CHANGED
@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 1.7845
20
- - F1: 54.9677
21
- - Gen Len: 2.3746
22
 
23
  ## Model description
24
 
@@ -43,7 +43,7 @@ The following hyperparameters were used during training:
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
- - num_epochs: 10
47
 
48
  ### Training results
49
 
 
16
 
17
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 1.7965
20
+ - F1: 52.4905
21
+ - Gen Len: 2.3990
22
 
23
  ## Model description
24
 
 
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
+ - num_epochs: 2
47
 
48
  ### Training results
49
 
logs/events.out.tfevents.1714512605.0ce1cdb67813.20904.6 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fa4191dc2facefbc3bce9c8146d1f46e93a4acaa21d8377c6810c0bd4503c3ec
3
- size 5387
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2ecf35b5bf1de236e68572b57f735576eb545a0b38b65f0d9cd88825153a55bc
3
+ size 5741
logs/events.out.tfevents.1714513414.0ce1cdb67813.20904.8 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f865efb6085d11087db67f916f5769884d6bb5b9914bc214f0c1b2a9fa318681
3
+ size 456