llama-160m-sst2 / all_results.json
Cheng98's picture
End of training
8589bcc
raw
history blame contribute delete
403 Bytes
{
"epoch": 4.0,
"eval_accuracy": 0.6444954128440367,
"eval_loss": 0.6368556618690491,
"eval_runtime": 2.6708,
"eval_samples": 872,
"eval_samples_per_second": 326.49,
"eval_steps_per_second": 10.484,
"train_loss": 0.6729737561011949,
"train_runtime": 2059.9591,
"train_samples": 67349,
"train_samples_per_second": 130.777,
"train_steps_per_second": 1.021
}