MicroPanda123
commited on
Commit
·
cd468f5
1
Parent(s):
9ff3927
Update README.md
Browse files
README.md
CHANGED
@@ -13,6 +13,7 @@ batch_size=2
|
|
13 |
gradient_accumulation_steps = 64
|
14 |
```
|
15 |
This was because I was training it locally on RTX2060 and did not have enough power to train it on higher settings.
|
|
|
16 |
Model is stored in "model" folder that contains model itself and "info.txt" file containing:
|
17 |
- iter_num - number of iterations
|
18 |
- train_loss - training loss at time of checkpoint
|
|
|
13 |
gradient_accumulation_steps = 64
|
14 |
```
|
15 |
This was because I was training it locally on RTX2060 and did not have enough power to train it on higher settings.
|
16 |
+
|
17 |
Model is stored in "model" folder that contains model itself and "info.txt" file containing:
|
18 |
- iter_num - number of iterations
|
19 |
- train_loss - training loss at time of checkpoint
|