Delta-Vector commited on
Commit
4ac573a
·
verified ·
1 Parent(s): c01028f

Model save

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -83,7 +83,7 @@ micro_batch_size: 2
83
  num_epochs: 2
84
  optimizer: adamw_bnb_8bit
85
  lr_scheduler: cosine
86
- learning_rate: 4e-5
87
  max_grad_norm: 1.0
88
 
89
  train_on_inputs: false
@@ -134,7 +134,7 @@ More information needed
134
  ### Training hyperparameters
135
 
136
  The following hyperparameters were used during training:
137
- - learning_rate: 4e-05
138
  - train_batch_size: 2
139
  - eval_batch_size: 2
140
  - seed: 42
 
83
  num_epochs: 2
84
  optimizer: adamw_bnb_8bit
85
  lr_scheduler: cosine
86
+ learning_rate: 6e-6
87
  max_grad_norm: 1.0
88
 
89
  train_on_inputs: false
 
134
  ### Training hyperparameters
135
 
136
  The following hyperparameters were used during training:
137
+ - learning_rate: 6e-06
138
  - train_batch_size: 2
139
  - eval_batch_size: 2
140
  - seed: 42