timjwhite commited on
Commit
6f2ca9b
·
1 Parent(s): f5bf5d7

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -33,20 +33,20 @@ More information needed
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
36
- - learning_rate: 1e-05
37
  - train_batch_size: 4
38
- - eval_batch_size: 2
39
  - seed: 42
40
  - gradient_accumulation_steps: 8
41
  - total_train_batch_size: 32
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
- - lr_scheduler_warmup_steps: 500
45
  - training_steps: 4000
46
 
47
  ### Framework versions
48
 
49
  - Transformers 4.30.2
50
- - Pytorch 1.13.1+cu117
51
  - Datasets 2.13.1
52
  - Tokenizers 0.13.3
 
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
36
+ - learning_rate: 0.0001
37
  - train_batch_size: 4
38
+ - eval_batch_size: 4
39
  - seed: 42
40
  - gradient_accumulation_steps: 8
41
  - total_train_batch_size: 32
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
+ - lr_scheduler_warmup_steps: 50
45
  - training_steps: 4000
46
 
47
  ### Framework versions
48
 
49
  - Transformers 4.30.2
50
+ - Pytorch 2.0.1+cu117
51
  - Datasets 2.13.1
52
  - Tokenizers 0.13.3