Ammar-alhaj-ali
commited on
Commit
·
b6df0b8
1
Parent(s):
420551c
Update README.md
Browse files
README.md
CHANGED
@@ -55,7 +55,7 @@ The following hyperparameters were used during training:
|
|
55 |
- learning_rate: 1.5e-05
|
56 |
- train_batch_size: 2
|
57 |
- eval_batch_size: 2
|
58 |
-
- optimizer:
|
59 |
- lr_scheduler_type: cosine
|
60 |
- training_steps: 1000
|
61 |
|
|
|
55 |
- learning_rate: 1.5e-05
|
56 |
- train_batch_size: 2
|
57 |
- eval_batch_size: 2
|
58 |
+
- optimizer: epsilon=1e-08
|
59 |
- lr_scheduler_type: cosine
|
60 |
- training_steps: 1000
|
61 |
|