apv53-fit commited on
Commit
be5ceca
·
verified ·
1 Parent(s): 086841e

End of training

Browse files
Files changed (2) hide show
  1. README.md +15 -15
  2. model.safetensors +1 -1
README.md CHANGED
@@ -23,7 +23,7 @@ model-index:
23
  metrics:
24
  - name: Accuracy
25
  type: accuracy
26
- value: 0.79
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,8 +33,8 @@ should probably proofread and complete it, then remove this comment. -->
33
 
34
  This model is a fine-tuned version of [ntu-spml/distilhubert](https://huggingface.co/ntu-spml/distilhubert) on the GTZAN dataset.
35
  It achieves the following results on the evaluation set:
36
- - Loss: 0.6878
37
- - Accuracy: 0.79
38
 
39
  ## Model description
40
 
@@ -53,13 +53,13 @@ More information needed
53
  ### Training hyperparameters
54
 
55
  The following hyperparameters were used during training:
56
- - learning_rate: 3e-05
57
  - train_batch_size: 8
58
  - eval_batch_size: 8
59
  - seed: 42
60
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
61
  - lr_scheduler_type: cosine
62
- - lr_scheduler_warmup_ratio: 0.1
63
  - num_epochs: 10
64
  - mixed_precision_training: Native AMP
65
 
@@ -67,16 +67,16 @@ The following hyperparameters were used during training:
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
69
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
70
- | 2.091 | 1.0 | 113 | 2.0139 | 0.6 |
71
- | 1.4629 | 2.0 | 226 | 1.4632 | 0.63 |
72
- | 1.2623 | 3.0 | 339 | 1.1626 | 0.74 |
73
- | 0.9162 | 4.0 | 452 | 0.9752 | 0.68 |
74
- | 0.7541 | 5.0 | 565 | 0.8230 | 0.81 |
75
- | 0.7539 | 6.0 | 678 | 0.7603 | 0.78 |
76
- | 0.564 | 7.0 | 791 | 0.7347 | 0.81 |
77
- | 0.3841 | 8.0 | 904 | 0.6810 | 0.79 |
78
- | 0.5169 | 9.0 | 1017 | 0.6859 | 0.79 |
79
- | 0.4292 | 10.0 | 1130 | 0.6878 | 0.79 |
80
 
81
 
82
  ### Framework versions
 
23
  metrics:
24
  - name: Accuracy
25
  type: accuracy
26
+ value: 0.83
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
33
 
34
  This model is a fine-tuned version of [ntu-spml/distilhubert](https://huggingface.co/ntu-spml/distilhubert) on the GTZAN dataset.
35
  It achieves the following results on the evaluation set:
36
+ - Loss: 0.5416
37
+ - Accuracy: 0.83
38
 
39
  ## Model description
40
 
 
53
  ### Training hyperparameters
54
 
55
  The following hyperparameters were used during training:
56
+ - learning_rate: 5e-05
57
  - train_batch_size: 8
58
  - eval_batch_size: 8
59
  - seed: 42
60
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
61
  - lr_scheduler_type: cosine
62
+ - lr_scheduler_warmup_ratio: 0.3
63
  - num_epochs: 10
64
  - mixed_precision_training: Native AMP
65
 
 
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
69
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
70
+ | 2.217 | 1.0 | 113 | 2.1703 | 0.41 |
71
+ | 1.6344 | 2.0 | 226 | 1.6105 | 0.65 |
72
+ | 1.2861 | 3.0 | 339 | 1.1849 | 0.71 |
73
+ | 0.8584 | 4.0 | 452 | 0.8745 | 0.71 |
74
+ | 0.6935 | 5.0 | 565 | 0.7215 | 0.84 |
75
+ | 0.4175 | 6.0 | 678 | 0.6174 | 0.8 |
76
+ | 0.3046 | 7.0 | 791 | 0.5329 | 0.85 |
77
+ | 0.121 | 8.0 | 904 | 0.5489 | 0.82 |
78
+ | 0.1203 | 9.0 | 1017 | 0.5513 | 0.83 |
79
+ | 0.0848 | 10.0 | 1130 | 0.5416 | 0.83 |
80
 
81
 
82
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a1ffa4cd902484fc5b42a515668df95b79fe14c730668389436557bd798a2f38
3
  size 94771728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ff6ca39095ed17e0a16d2bbdaeff2be787255c253552a4769309acecdbcfb014
3
  size 94771728