Model save
Browse files
README.md
CHANGED
@@ -83,7 +83,7 @@ micro_batch_size: 2
|
|
83 |
num_epochs: 2
|
84 |
optimizer: adamw_bnb_8bit
|
85 |
lr_scheduler: cosine
|
86 |
-
learning_rate:
|
87 |
max_grad_norm: 1.0
|
88 |
|
89 |
train_on_inputs: false
|
@@ -134,7 +134,7 @@ More information needed
|
|
134 |
### Training hyperparameters
|
135 |
|
136 |
The following hyperparameters were used during training:
|
137 |
-
- learning_rate:
|
138 |
- train_batch_size: 2
|
139 |
- eval_batch_size: 2
|
140 |
- seed: 42
|
|
|
83 |
num_epochs: 2
|
84 |
optimizer: adamw_bnb_8bit
|
85 |
lr_scheduler: cosine
|
86 |
+
learning_rate: 6e-6
|
87 |
max_grad_norm: 1.0
|
88 |
|
89 |
train_on_inputs: false
|
|
|
134 |
### Training hyperparameters
|
135 |
|
136 |
The following hyperparameters were used during training:
|
137 |
+
- learning_rate: 6e-06
|
138 |
- train_batch_size: 2
|
139 |
- eval_batch_size: 2
|
140 |
- seed: 42
|