python_and_text_pythia_160m / train_results.json
gbemilekeonilude's picture
End of training
c072d12 verified
raw
history blame contribute delete
266 Bytes
{
"epoch": 3.0,
"num_input_tokens_seen": 1941504,
"total_flos": 990864117596160.0,
"train_loss": 2.1522638506024196,
"train_runtime": 52.4434,
"train_samples": 631,
"train_samples_per_second": 36.096,
"train_steps_per_second": 4.519
}