dariuslimzh commited on
Commit
4ce8feb
1 Parent(s): eebe68a

Training completed

Browse files
Files changed (3) hide show
  1. README.md +11 -8
  2. model.safetensors +1 -1
  3. tokenizer.json +2 -16
README.md CHANGED
@@ -4,6 +4,8 @@ license: mit
4
  base_model: EleutherAI/gpt-neo-125M
5
  tags:
6
  - generated_from_trainer
 
 
7
  model-index:
8
  - name: temp_gpt_neo
9
  results: []
@@ -16,7 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [EleutherAI/gpt-neo-125M](https://huggingface.co/EleutherAI/gpt-neo-125M) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 1.3250
 
20
 
21
  ## Model description
22
 
@@ -47,13 +50,13 @@ The following hyperparameters were used during training:
47
 
48
  ### Training results
49
 
50
- | Training Loss | Epoch | Step | Validation Loss |
51
- |:-------------:|:------:|:----:|:---------------:|
52
- | No log | 0.9091 | 5 | 5.9921 |
53
- | No log | 2.0 | 11 | 3.0731 |
54
- | No log | 2.9091 | 16 | 1.7838 |
55
- | No log | 4.0 | 22 | 1.3521 |
56
- | No log | 4.5455 | 25 | 1.3250 |
57
 
58
 
59
  ### Framework versions
 
4
  base_model: EleutherAI/gpt-neo-125M
5
  tags:
6
  - generated_from_trainer
7
+ metrics:
8
+ - bleu
9
  model-index:
10
  - name: temp_gpt_neo
11
  results: []
 
18
 
19
  This model is a fine-tuned version of [EleutherAI/gpt-neo-125M](https://huggingface.co/EleutherAI/gpt-neo-125M) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 3.3814
22
+ - Bleu: 0.0
23
 
24
  ## Model description
25
 
 
50
 
51
  ### Training results
52
 
53
+ | Training Loss | Epoch | Step | Validation Loss | Bleu |
54
+ |:-------------:|:------:|:----:|:---------------:|:----:|
55
+ | No log | 0.9091 | 5 | 7.9635 | 0.0 |
56
+ | No log | 2.0 | 11 | 5.3680 | 0.0 |
57
+ | No log | 2.9091 | 16 | 4.0323 | 0.0 |
58
+ | No log | 4.0 | 22 | 3.4255 | 0.0 |
59
+ | No log | 4.5455 | 25 | 3.3814 | 0.0 |
60
 
61
 
62
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:022efb5e7b8828635658edac523279a6e193937e4ef5a67d2b2e0f49825a096f
3
  size 500811336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1a3eb7c7e81bfc9c8ca65d4a1b619654112375af48297f5468b002842edc3ca3
3
  size 500811336
tokenizer.json CHANGED
@@ -1,21 +1,7 @@
1
  {
2
  "version": "1.0",
3
- "truncation": {
4
- "direction": "Right",
5
- "max_length": 512,
6
- "strategy": "LongestFirst",
7
- "stride": 0
8
- },
9
- "padding": {
10
- "strategy": {
11
- "Fixed": 512
12
- },
13
- "direction": "Right",
14
- "pad_to_multiple_of": null,
15
- "pad_id": 50256,
16
- "pad_type_id": 0,
17
- "pad_token": "<|endoftext|>"
18
- },
19
  "added_tokens": [
20
  {
21
  "id": 50256,
 
1
  {
2
  "version": "1.0",
3
+ "truncation": null,
4
+ "padding": null,
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  "added_tokens": [
6
  {
7
  "id": 50256,