llama-160m-sst2 / train_results.json
Cheng98's picture
End of training
8589bcc
raw
history blame contribute delete
196 Bytes
{
"epoch": 4.0,
"train_loss": 0.6729737561011949,
"train_runtime": 2059.9591,
"train_samples": 67349,
"train_samples_per_second": 130.777,
"train_steps_per_second": 1.021
}