smol_llama-220M-GQA-fineweb_edu / eval_results.json
pszemraj's picture
End of training
83c5d1d verified
raw
history blame
320 Bytes
{
"epoch": 0.9999939379610938,
"eval_accuracy": 0.4560332193453835,
"eval_loss": 2.741572141647339,
"eval_runtime": 5.7613,
"eval_samples": 300,
"eval_samples_per_second": 52.072,
"eval_steps_per_second": 6.596,
"num_input_tokens_seen": 10810818560,
"perplexity": 15.511351979678839
}