pere commited on
Commit
1302198
1 Parent(s): 7b1c102
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -55,8 +55,8 @@ The following hyperparameters were used during training:
55
  - seed: 42
56
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
57
  - lr_scheduler_type: constant with warmpu
58
- - lr_scheduler_warmup_steps: 100
59
- - training_steps: 10.000 (currently 1.000)
60
  - mixed_precision_training: fp16
61
  - deepspee: true
62
 
 
55
  - seed: 42
56
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
57
  - lr_scheduler_type: constant with warmpu
58
+ - lr_scheduler_warmup_steps: 1000
59
+ - training_steps: 50.000 (currently @1.000)
60
  - mixed_precision_training: fp16
61
  - deepspee: true
62