b09501048 commited on
Commit
3557896
1 Parent(s): 5bb1746

Training complete

Browse files
Files changed (1) hide show
  1. README.md +17 -0
README.md CHANGED
@@ -3,6 +3,7 @@ library_name: transformers
3
  license: gpl-3.0
4
  base_model: ckiplab/gpt2-tiny-chinese
5
  tags:
 
6
  - generated_from_trainer
7
  model-index:
8
  - name: ADL_HW2_GPT2
@@ -18,6 +19,8 @@ should probably proofread and complete it, then remove this comment. -->
18
  # ADL_HW2_GPT2
19
 
20
  This model is a fine-tuned version of [ckiplab/gpt2-tiny-chinese](https://huggingface.co/ckiplab/gpt2-tiny-chinese) on an unknown dataset.
 
 
21
 
22
  ## Model description
23
 
@@ -44,6 +47,20 @@ The following hyperparameters were used during training:
44
  - lr_scheduler_type: linear
45
  - num_epochs: 8
46
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
47
  ### Framework versions
48
 
49
  - Transformers 4.44.2
 
3
  license: gpl-3.0
4
  base_model: ckiplab/gpt2-tiny-chinese
5
  tags:
6
+ - summarization
7
  - generated_from_trainer
8
  model-index:
9
  - name: ADL_HW2_GPT2
 
19
  # ADL_HW2_GPT2
20
 
21
  This model is a fine-tuned version of [ckiplab/gpt2-tiny-chinese](https://huggingface.co/ckiplab/gpt2-tiny-chinese) on an unknown dataset.
22
+ It achieves the following results on the evaluation set:
23
+ - Loss: 3.4578
24
 
25
  ## Model description
26
 
 
47
  - lr_scheduler_type: linear
48
  - num_epochs: 8
49
 
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss |
53
+ |:-------------:|:-----:|:----:|:---------------:|
54
+ | 4.02 | 1.0 | 543 | 3.6837 |
55
+ | 3.8249 | 2.0 | 1086 | 3.5804 |
56
+ | 3.7431 | 3.0 | 1629 | 3.5336 |
57
+ | 3.6958 | 4.0 | 2172 | 3.5036 |
58
+ | 3.6633 | 5.0 | 2715 | 3.4815 |
59
+ | 3.639 | 6.0 | 3258 | 3.4722 |
60
+ | 3.622 | 7.0 | 3801 | 3.4597 |
61
+ | 3.6095 | 8.0 | 4344 | 3.4578 |
62
+
63
+
64
  ### Framework versions
65
 
66
  - Transformers 4.44.2