MicroPanda123
/

PythonBasic

Text Generation

Model card Files Files and versions Community

MicroPanda123 commited on Jul 14, 2023

Commit

cd468f5

·

1 Parent(s): 9ff3927

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -13,6 +13,7 @@ batch_size=2
 gradient_accumulation_steps = 64
 ```
 This was because I was training it locally on RTX2060 and did not have enough power to train it on higher settings.
 Model is stored in "model" folder that contains model itself and "info.txt" file containing:
 - iter_num - number of iterations
 - train_loss - training loss at time of checkpoint

 gradient_accumulation_steps = 64
 ```
 This was because I was training it locally on RTX2060 and did not have enough power to train it on higher settings.
 Model is stored in "model" folder that contains model itself and "info.txt" file containing:
 - iter_num - number of iterations
 - train_loss - training loss at time of checkpoint