dlwen commited on
Commit
48967ea
·
verified ·
1 Parent(s): f331855

End of training

Browse files
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  library_name: transformers
3
  license: mit
4
- base_model: microsoft/MiniLM-L12-H384-uncased
5
  tags:
6
  - generated_from_trainer
7
  datasets:
@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # my_awesome_eli5_clm-model
18
 
19
- This model is a fine-tuned version of [microsoft/MiniLM-L12-H384-uncased](https://huggingface.co/microsoft/MiniLM-L12-H384-uncased) on the eli5_category dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.2947
22
 
23
  ## Model description
24
 
@@ -49,9 +49,9 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
- | 1.498 | 1.0 | 1369 | 0.6652 |
53
- | 0.5345 | 2.0 | 2738 | 0.3477 |
54
- | 0.3861 | 3.0 | 4107 | 0.2947 |
55
 
56
 
57
  ### Framework versions
 
1
  ---
2
  library_name: transformers
3
  license: mit
4
+ base_model: EleutherAI/gpt-neo-125M
5
  tags:
6
  - generated_from_trainer
7
  datasets:
 
16
 
17
  # my_awesome_eli5_clm-model
18
 
19
+ This model is a fine-tuned version of [EleutherAI/gpt-neo-125M](https://huggingface.co/EleutherAI/gpt-neo-125M) on the eli5_category dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 3.6742
22
 
23
  ## Model description
24
 
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
+ | 3.6598 | 1.0 | 1308 | 3.6774 |
53
+ | 3.5083 | 2.0 | 2616 | 3.6719 |
54
+ | 3.4294 | 3.0 | 3924 | 3.6742 |
55
 
56
 
57
  ### Framework versions
generation_config.json CHANGED
@@ -1,5 +1,6 @@
1
  {
2
  "_from_model_config": true,
3
- "pad_token_id": 0,
 
4
  "transformers_version": "4.46.3"
5
  }
 
1
  {
2
  "_from_model_config": true,
3
+ "bos_token_id": 50256,
4
+ "eos_token_id": 50256,
5
  "transformers_version": "4.46.3"
6
  }
runs/Dec10_09-59-49_38cecd41f61e/events.out.tfevents.1733824790.38cecd41f61e.283.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fd379e4aac4ee91fcbd985bd9b84ac8199df3379b4b4323b4f5439bfe940daba
3
- size 7538
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:68fbd61a7d88ed9b846d84076016a4922b98d1c1d52a682aa6e3c16dd4c8eec7
3
+ size 8163
runs/Dec10_09-59-49_38cecd41f61e/events.out.tfevents.1733826300.38cecd41f61e.283.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e845ab6088b98cadee2ab648e425127e7313b22f00486a5a82faff435733ea21
3
+ size 359