File size: 313 Bytes
07423df |
1 2 3 |
Defines the number of training examples a mini-batch uses during an iteration of the training model to estimate the error gradient before updating the model weights. **Batch size** defines the batch size used per a single GPU.
During model training, the training data is packed into mini-batches of a fixed size. |