srajwal1 commited on
Commit
4fa7a90
·
1 Parent(s): deda48d

End of training

Browse files
README.md CHANGED
@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.2759
20
- - F1: 51.5181
21
- - Gen Len: 2.1307
22
 
23
  ## Model description
24
 
@@ -43,7 +43,7 @@ The following hyperparameters were used during training:
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
- - num_epochs: 2
47
 
48
  ### Training results
49
 
 
16
 
17
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 1.7845
20
+ - F1: 54.9677
21
+ - Gen Len: 2.3746
22
 
23
  ## Model description
24
 
 
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
+ - num_epochs: 10
47
 
48
  ### Training results
49
 
logs/events.out.tfevents.1714508539.0ce1cdb67813.20904.3 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:75e09164bc1bcd6f3b80efbde6e62a38077b0e743d84108f1ef39dac03523c61
3
- size 6644
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d951aefcc217bb9bb162df117278725c1b5495de2c648e099261ad1c343e074d
3
+ size 6998
logs/events.out.tfevents.1714512482.0ce1cdb67813.20904.5 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9c797ca68ea8a8a6d5afeb3383ea91a133bcc0a2874e2068c81e33f4cb479216
3
+ size 456
tokenizer.json CHANGED
@@ -1,14 +1,9 @@
1
  {
2
  "version": "1.0",
3
- "truncation": {
4
- "direction": "Right",
5
- "max_length": 3,
6
- "strategy": "LongestFirst",
7
- "stride": 0
8
- },
9
  "padding": {
10
  "strategy": {
11
- "Fixed": 3
12
  },
13
  "direction": "Right",
14
  "pad_to_multiple_of": null,
 
1
  {
2
  "version": "1.0",
3
+ "truncation": null,
 
 
 
 
 
4
  "padding": {
5
  "strategy": {
6
+ "Fixed": 512
7
  },
8
  "direction": "Right",
9
  "pad_to_multiple_of": null,