Update README.md
Browse files
README.md
CHANGED
@@ -52,7 +52,7 @@ The optimizer used is AdaFactor with inverse square root learning rate schedule
|
|
52 |
|
53 |
### Fine-tuning
|
54 |
|
55 |
-
This model was then fine-tuned on a single TPU Pod
|
56 |
|
57 |
|
58 |
## Evaluation results
|
|
|
52 |
|
53 |
### Fine-tuning
|
54 |
|
55 |
+
This model was then fine-tuned on a single TPU Pod V3-8 for 80,000 steps in total, using sequence length 512 (batch size 256), using only the dataset only containing java code.
|
56 |
|
57 |
|
58 |
## Evaluation results
|