DebertaV2-Base-10M_babylm-A__sst2 / train_results.json
Ar4l's picture
Upload folder using huggingface_hub
a12f512 verified
raw
history blame contribute delete
238 Bytes
{
"epoch": 8.0,
"total_flos": 3.178750951973683e+16,
"train_loss": 0.16765205063521402,
"train_runtime": 3789.576,
"train_samples": 67349,
"train_samples_per_second": 355.443,
"train_steps_per_second": 44.432
}