llm_model_m_unigram_hm_1e_v0.1 / train_results.json
RefalMachine's picture
load model
061a7e0
raw
history blame contribute delete
200 Bytes
{
"epoch": 1.0,
"train_loss": 2.8584754099769967,
"train_runtime": 318077.2613,
"train_samples": 26545790,
"train_samples_per_second": 83.457,
"train_steps_per_second": 0.348
}