MicroPanda123 commited on
Commit
cd468f5
·
1 Parent(s): 9ff3927

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -13,6 +13,7 @@ batch_size=2
13
  gradient_accumulation_steps = 64
14
  ```
15
  This was because I was training it locally on RTX2060 and did not have enough power to train it on higher settings.
 
16
  Model is stored in "model" folder that contains model itself and "info.txt" file containing:
17
  - iter_num - number of iterations
18
  - train_loss - training loss at time of checkpoint
 
13
  gradient_accumulation_steps = 64
14
  ```
15
  This was because I was training it locally on RTX2060 and did not have enough power to train it on higher settings.
16
+
17
  Model is stored in "model" folder that contains model itself and "info.txt" file containing:
18
  - iter_num - number of iterations
19
  - train_loss - training loss at time of checkpoint