DebertaV2-Base-10M_babylm-A__rte / train_results.json
Ar4l's picture
Upload folder using huggingface_hub
8821e9f verified
raw
history blame contribute delete
231 Bytes
{
"epoch": 4.0,
"total_flos": 587617475420160.0,
"train_loss": 0.5243261960836557,
"train_runtime": 82.8963,
"train_samples": 2490,
"train_samples_per_second": 600.751,
"train_steps_per_second": 75.275
}