Update README.md
Browse files
README.md
CHANGED
@@ -56,7 +56,7 @@ print(output)
|
|
56 |
|
57 |
## Training Details
|
58 |
|
59 |
-
The model was trained for 20k steps with a batch size of 16. The optimizer used is AdamW with learning rate = 1.75e-5, \\(\beta_{1} = 0.9\\) and \\(\beta_{2} = 0.999\\), and a weight decay of 0.01
|
60 |
|
61 |
|
62 |
## Citation
|
|
|
56 |
|
57 |
## Training Details
|
58 |
|
59 |
+
The model was trained on questions from a real-world ITS, Lernnavi, for 20k steps with a batch size of 16. The optimizer used is AdamW with learning rate = 1.75e-5, \\(\beta_{1} = 0.9\\) and \\(\beta_{2} = 0.999\\), and a weight decay of 0.01
|
60 |
|
61 |
|
62 |
## Citation
|