zkdeng commited on
Commit
b2a4a0a
·
verified ·
1 Parent(s): 74b783b

Model save

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -33,11 +33,11 @@ More information needed
33
 
34
  The following hyperparameters were used during training:
35
  - learning_rate: 0.0005
36
- - train_batch_size: 8
37
- - eval_batch_size: 8
38
  - seed: 42
39
  - gradient_accumulation_steps: 4
40
- - total_train_batch_size: 32
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
  - lr_scheduler_warmup_ratio: 0.1
@@ -47,7 +47,7 @@ The following hyperparameters were used during training:
47
 
48
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
49
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
50
- | No log | 0.92 | 6 | 2.1353 | 0.68 | 0.4048 |
51
 
52
 
53
  ### Framework versions
 
33
 
34
  The following hyperparameters were used during training:
35
  - learning_rate: 0.0005
36
+ - train_batch_size: 16
37
+ - eval_batch_size: 16
38
  - seed: 42
39
  - gradient_accumulation_steps: 4
40
+ - total_train_batch_size: 64
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
  - lr_scheduler_warmup_ratio: 0.1
 
47
 
48
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
49
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
50
+ | No log | 0.92 | 3 | 0.5751 | 0.68 | 0.4048 |
51
 
52
 
53
  ### Framework versions